Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data.
-
Updated
Nov 23, 2024 - Jupyter Notebook
Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data.
Buying a home in NYC, what Neighborhoods are the best value? This project seeks to understand the fundamental factors that explain differences in residential real estate prices across NYC.
[WIP] Building a New York City job portal using the NYC OpenData Jobs API:
Conducting geodemographic classification for ethnic groups in NYC using K-means algorithm available in sklearn.cluster module.
Identified data types for each distinct column value on 1900 data sets. For each column, summarized semantic types present in the column, using Fuzzy Logic, Levenshtein distance. Identified & derived inference the 3 most frequent 311 complaint types by borough.
Developed a comprehensive exploratory data analysis (EDA) of a vehicle repairs dataset, uncovering patterns in repair types, costs, and vehicle platforms. Includes data cleaning, insights extraction, tag generation from free-text fields, and saving of cleaned datasets for further analysis.
Add a description, image, and links to the nyc-open-data topic page so that developers can more easily learn about it.
To associate your repository with the nyc-open-data topic, visit your repo's landing page and select "manage topics."