Wrangle-OpenStreetMap-Databases

View project on GitHub

Welcome to my “Wrangle OpenStreetMap Databases” Project Page!

I completed this project for Udacity’s Data Analyst Nanodegree “Data Wrangling” and “SQL for Data Analysis” courses.

In these classes we learned how to do the following things:

  • Assess the quality of the data for validity, accuracy, completeness, consistency and uniformity.
  • Parsing and gather data from popular file formats such as .json, .xml, .csv, and .html.
  • Process data from many files and very large files that can be cleaned with spreadsheet programs
  • Learn how to store, query, and aggregate data using SQL

For this project, I wrangled the XML OpenStreetMap data for Los Angeles, CA. I then used data munging techniques to assess the quality of the data for validity, accuracy, completeness, consistency and uniformity. To finish it off, I used SQL to store, query, and aggregate the data.

Both the original .ipynb file and the generated .html file can be found by clicking the “View on GitHub” link above.

Enjoy!