unit_test_rl

Project code for training LLMs to write better unit tests + code

Warning: This is experiment code and isn't necessarily production ready or easy to use (E.g. There may be hardcoded paths to local data files). I'm just sharing some experiments that I thought were fun.

You may need this fork of OpenRLHF, I made some modifications to support guided decoding and pass arguments to the reward model: https://github.com/rosmineb/OpenRLHF branch custom_changes_main

To run:

Start the reward server, python unit_tests_server/unit_test_reward_server.py --workers 8 --port 5432
Call the run.sh experiment source experiments/u6_7B_form/run.sh

These experiments support using a grammar for guided decoding. See u6_7B_gram experiment. You can check that a grammar parses an output with the following code (but a warning: using an SFT'd model is much better than using a grammar and potentially less work)

python unit_tests_server/check_grammar.py --grammar_file experiments/unit_test/grammar/unit_test_grammar.txt --string_file experiments/unit_test/grammar/sample_output.txt --model_id Qwen/Qwen2.5-Coder-7B-Instruct

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
experiments		experiments
unit_tests_server		unit_tests_server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check_grammar.py		check_grammar.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

unit_test_rl

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

rosmineb/unit_test_rl

Folders and files

Latest commit

History

Repository files navigation

unit_test_rl

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages