Project Description
Read the README description at the repository to better understand how it works.
However, a simple description is: When there is a need of using more than one dataset, there is a need to cross them at some point. This project is a Python Module capable of doing such merge automatically. It is an enabling tool to allow data scientists (for instance) to avoid the effort of doing it. When it merges all the structured data the user wants, it returns a report and a sqlite database file for the user to then proceed on their analysis.
This project proposes a solution for multiple challenges by creating a Python module capable of taking multiple .csv or .xslx files and combining them into one single database, on a sqlite file. In addition, it provides a report with all files that were able to merge into such database.
It is an enabling tool, not a data analysis and information giving software. However, it is capable of checking all possible combinations inside the total range of files given. For instance, if 10 files are given and two different clusters of data exists on them, it will generate 2 different databases alongside with their respective reports.