Python CSV files merger

washington chekata

Backend Engineer
scripting and automation
Software Engineer
GitHub
pandas
Python
Description: This project demonstrates my ability to work with file systems, handle CSV files, and efficiently merge multiple datasets into a single unified file. The core functionality involves reading multiple CSV files from a specified directory, validating their structure, and merging them into a single output file.
Key Features:
CSV File Validation: The project uses robust error handling to check if files are in the correct CSV format and ensures proper structure with validation for delimiter consistency and file content.
Efficient File Merging: It merges CSV files even with large datasets by handling multiple files seamlessly, appending them into a single output file.
Logging & Error Handling: Detailed logging is included to ensure transparency in processing and to capture errors, with specific error messages for file validation failures.
Customizable for Various Use Cases: Although tailored for CSV files, the approach can be easily adapted for other file types and formats, making it highly flexible.
Technologies Used:
Python
Pandas
CSV module
Logging
This project is perfect for anyone dealing with large datasets across multiple files, such as for data analysis, reporting, or ETL processes. It showcases my skills in data handling, file manipulation, and problem-solving with a clean and maintainable codebase.
Partner With washington
View Services

More Projects by washington