Compute Natural Breaks in Python (Fisher-Jenks algorithm)
-
Updated
Feb 14, 2025 - Python
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
📊 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
Visual Knowledge Discovery demo tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.
Two differrent approach to predict Churn customers and finding out important variables that drives churn
Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.
Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; tabulates, and visualizes in Parallel Coordinates: best, worst, average, and standard deviation of Accuracy/F1/Recall.
Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.
The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data
This project is a simplified version of TensorFlow, which uses a neural network to predict the price of homes in the Boston area
Repository containing projects and algorithms developed for the INF01124 - Data Classification and Search Algorithms course at UFRGS.
📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.
This repository is a Virtual Internship contains impressive projects related to Machine Learning.
Add a description, image, and links to the data-classification topic page so that developers can more easily learn about it.
To associate your repository with the data-classification topic, visit your repo's landing page and select "manage topics."