New to Onnx

Official ORT documentation: https://www.onnxruntime.ai/
Official ORT GitHub Repo: https://github.com/microsoft/onnxruntime
Official ORT Samples Repo: https://github.com/microsoft/onnxruntime-training-examples

What is ONNX Runtime for PyTorch

ONNX Runtime for PyTorch gives you the ability to accelerate training of large transformer PyTorch models. The training time and cost are reduced with just a one line code change.

One line code change: ORT provides a one-line addition for existing PyTorch training scripts allowing easier experimentation and greater agility.

    from torch_ort import ORTModule
    model = ORTModule(model)

Flexible and extensible hardware support: The same model and API works with NVIDIA and AMD GPUs; the extensible "execution provider" architecture allow you to plug-in custom operators, optimizer and hardware accelerators.
Faster Training: Optimized kernels provide up to 1.4X speed up in training time.
Larger Models: Memory optimizations allow fitting a larger model such as GPT-2 on 16GB GPU, which runs out of memory with stock PyTorch.
Composable with other acceleration libraries such as Deepspeed, Fairscale, Megatron for even faster and more efficient training
Part of the PyTorch Ecosystem. It is available via the torch-ort python package.
Built on top of highly successful and proven technologies of ONNX Runtime and ONNX format.

ONNX Runtime Training Examples

This repo has examples for using ONNX Runtime (ORT) for accelerating training of Transformer models. These examples focus on large scale model training and achieving the best performance in Azure Machine Learning service. ONNX Runtime has the capability to train existing PyTorch models (implemented using torch.nn.Module) through its optimized backend. The examples in this repo demonstrate how ORTModule can be used to switch the training backend.

Examples

Outline the examples in the repository.

Example	Performance Comparison	Model Change
HuggingFace BART	See BART	No model change required
HuggingFace BERT	See BERT	No model change required
HuggingFace DeBERTa	See DeBERTa	See this commit
HuggingFace DistilBERT	See DistilBERT	No model change required
HuggingFace GPT2	See GPT2	No model change required
HuggingFace RoBERTa	See RoBERTa	See this commit
t5-large	See T5	See this PR

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Name	Name	Last commit message	Last commit date
Latest commit dependabot[bot] Bump torch from 2.0.1 to 2.2.0 in /on_device_training/mobile/ios (#195 ) Jul 25, 2024 05c70f7 · Jul 25, 2024 History 202 Commits
DragGAN	DragGAN	Blog came live - updating the link (#168 )	Jan 8, 2024
ImageClassification-finetune	ImageClassification-finetune	add readme for aml-vision/image-classification examples (#146 )	Jun 12, 2023
QnA-finetune	QnA-finetune	nebula working (#117 )	Apr 26, 2023
StableDiffusion-finetune	StableDiffusion-finetune	Update train_text_to_image.py	Jun 22, 2023
T5	T5	bug fixes to T5 demo (#150 )	Jun 26, 2023
huggingface	huggingface	removed GPT2 special treatment (#89 )	Apr 21, 2023
mistral-finetune	mistral-finetune	Bump aiohttp from 3.9.0 to 3.9.4 in /mistral-finetune/environment (#188 )	Jun 18, 2024
on_device_training	on_device_training	Bump torch from 2.0.1 to 2.2.0 in /on_device_training/mobile/ios (#195 )	Jul 25, 2024
optimum @ 4056d24	optimum @ 4056d24	Add Optimum submodule support + add working Dockerfiles (#81 )	Jul 22, 2022
phi2-finetune	phi2-finetune	add phi2 example (#184 )	Feb 9, 2024
transformers @ de46cde	transformers @ de46cde	Add Optimum submodule support + add working Dockerfiles (#81 )	Jul 22, 2022
whisper-finetune	whisper-finetune	Update README.md	May 17, 2023
.gitignore	.gitignore	[js/web/training] E2E MNIST demo (#177 )	Jan 29, 2024
.gitmodules	.gitmodules	Add Optimum submodule support + add working Dockerfiles (#81 )	Jul 22, 2022
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Initial CODE_OF_CONDUCT.md commit	May 11, 2020
LICENSE	LICENSE	Initial LICENSE commit	May 11, 2020
README.md	README.md	Decommission RoBERTa & DeBERTa; amend README's (#87 )	Aug 1, 2022
SECURITY.md	SECURITY.md	Initial SECURITY.md commit	May 11, 2020
cgmanifest.json	cgmanifest.json	Add `$schema` to `cgmanifest.json` (#90 )	Apr 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New to Onnx

What is ONNX Runtime for PyTorch

ONNX Runtime Training Examples

Examples

Contributing

About

Releases

Packages

Contributors 30

Languages

License

microsoft/onnxruntime-training-examples

Folders and files

Latest commit

History

Repository files navigation

New to Onnx

What is ONNX Runtime for PyTorch

ONNX Runtime Training Examples

Examples

Contributing

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 30

Languages

Packages