This repo contents the two following sections:
Dataset analysis, preprocessing, metrics and other analyses are shown in Jupyter notebook binary_classification.ipynb.
Three Python files can be found in src/ folder:
- train.py - contains logic for neural network training,
- inference.py - contains logic for predicting classes after training is completed,
- helpers.py - contains some useful functions and variables, which are used by train.py and inference.py.
NOTE: after train.py is completed, the output/ folder is created. It contains three files:
- neural_network.keras (TRAINED_MODEL variable in helpers.py) - contains trained neural network,
- standard_scaler.json (STANDARD_SCALER variable in helpers.py) - contains Standard Scaler configs,
- encoder.json (ENCODER variable in helpers.py) - contains Encoder configs.
All requirements need for running scripts can be found in requirements.txt.
To train the neural network run:
python src/train.py data/p1_train.csv
To predict classes with trained neural network run:
python src/inference.py data/p1_test_student.csv