Skip to content
#

nyc-open-data

Here are 6 public repositories matching this topic...

Identified data types for each distinct column value on 1900 data sets. For each column, summarized semantic types present in the column, using Fuzzy Logic, Levenshtein distance. Identified & derived inference the 3 most frequent 311 complaint types by borough.

  • Updated Apr 15, 2020
  • Python

Developed a comprehensive exploratory data analysis (EDA) of a vehicle repairs dataset, uncovering patterns in repair types, costs, and vehicle platforms. Includes data cleaning, insights extraction, tag generation from free-text fields, and saving of cleaned datasets for further analysis.

  • Updated Apr 30, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the nyc-open-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nyc-open-data topic, visit your repo's landing page and select "manage topics."

Learn more