Australians developing open-source large language models for Australia.
We’re a group of builders, researchers, and tinkerers working on open-source large language models.
No government spin, no corporate gatekeeping, no uni red tape. Just people who want to get hands-on, share knowledge, and make something real.
Everyone’s welcome — whether you’re brand new or already deep in the field. You can:
- Jump in with code, data, or documentation
- Help out with testing and feedback
- Join the chat, throw around ideas, or learn as you go
This is a community for people who want to make AI without the nonsense.
To keep things organised, every repository follows:
- 🧠 Models →
models-
for language models - 📊 Data →
data-
for datasets and pipelines - 📚 Docs →
docs-
for documentation, papers, admin - 🌐 Web →
web-
for web apps and portals - 🚀 Projects →
projects-
for new ideas and prototypes - 🛠 Infra →
infra-
for infrastructure and tooling
`
We’re rebranding from Southern Cross AI to Joey LLM.
This is a fresh start with clearer focus, real resources (GPUs + data pipelines), and stronger foundations.
You don’t need to be an expert — just willing to learn and contribute.
- 🖥 System Maintainer → infra, servers, deployments
- 🤖 Model Maintainer → training, fine-tuning, inference
- 🌐 Web Maintainer → apps, APIs, demos
- 📊 Data Maintainer → collection, cleaning, pipelines
- 📖 Docs Maintainer → guides, tutorials, onboarding
- 🎤 Community Maintainer → meetups, Discord, Hugging Face
📧 Contact:
- Matthew Altenburg: [email protected]
- Dale Rogers: [email protected]
- Andy Smith: [email protected]
Upcoming gatherings will be announced soon.
Check:
Expand to Explore
- LLM Visualization by Brendan Bycroft
- 3Blue1Brown: What is a GPT? • Attention Explained
- Articles: Embedding Spaces • Positional Encoding
- Karpathy tutorials: GPT-2 Reproduction • Tokenizer • GPT From Scratch
- Repos: minGPT • nanoGPT • build-nanogpt • nano-llama31
- ChatGPT: 30 Year History by Art of the Problem
- The moment we stopped understanding AI (AlexNet) by Welch Labs
- CNN Explainer
- NeuroCartography & Summit