Tanzania, as a developing country, struggles with providing clean water to its population of over 57,000,000.
Business Understanding
There are many water points already established in the country, but some are in need of repair while others have failed altogether hence help the NGO to locate and find patterns in non-functional water wells and find patterns in non-functional wells to influence how new wells are built. *classifiations include:
-Functional wells
-Functional and needs repair wells
-Non-functional wells
Business Question
Where are wells needing repair locating?
What is the patterns in non-functional wells to influence how new wells are built?
What is the predicted the condition of a water wells?
Data Understanding and Analysis
Requirements
An anaconda environment to run the models The Jupyter Notebook that should demonstrate an iterative approach to modeling. It begins with a basic model, and then provides justification. The provides 1-3 paragraphs discussing the final model. The deliverables should explicitly address each step of the data science process.
Description of data
*amount_tsh : Total static head (amount water available to waterpoint)
*date_recorded : The date the row was entered
*funder : Who funded the well
*gps_height : Altitude of the well
*installer : Organization that installed the well
*longitude : GPS coordinate
*latitude : GPS coordinate
*wpt_name : Name of the waterpoint if there is one
*num_private :Private use or not
*basin : Geographic water basin
*subvillage : Geographic location
*region : Geographic location
*region_code : Geographic location (coded)
*district_code : Geographic location (coded)
*lga : Geographic location
*ward : Geographic location
*population : Population around the well
*public_meeting : True/False
*recorded_by : Group entering this row of data
*scheme_management : Who operates the waterpoint
*scheme_name : Who operates the waterpoint
*permit : If the waterpoint is permitted
*construction_year : Year the waterpoint was constructed
*extraction_type : The kind of extraction the waterpoint uses
*extraction_type_group : The kind of extraction the waterpoint uses
*extraction_type_class : The kind of extraction the waterpoint uses
*management : How the waterpoint is managed
*management_group : How the waterpoint is managed
*payment : What the water costs
*payment_type : What the water costs
*water_quality : The quality of the water
*quality_group : The quality of the water
*quantity : The quantity of water
*quantity_group : The quantity of water
*source : The source of the water
*source_type : The source of the water
*source_class : The source of the water
*waterpoint_type : The kind of waterpoint
*waterpoint_type_group : The kind of waterpoint
visualizations
Conclusion
*Many wells need repair compared to non-functional ones.
*Best constructions were done in 2000-2005
*Regions like Mboya,Shinyanga, Kigoma and Kilimanjaro need their wells repaired