Build Net

MetaFlow Automator 📊

A Linux-first automation toolkit for managing distributed project metadata. Combines bash scripting efficiency with Python data processing power for enterprise-scale metadata operations.

Features 🚀

Distributed Metadata Crawling - Bash-powered parallel scraping of project metadata
Smart Data Processing - Pandas/Numpy-based analysis pipelines
Cloud-Native - Built-in AWS S3/GCS integration for metadata storage
YAML/JSON Schemas - Type-safe metadata validation with Pydantic models
CLI Dashboard - Rich terminal interface for metadata exploration (using Rich)

Prerequisites 📋

Linux Environment (Ubuntu 22.04+ recommended)
Python 3.10+ (python3 -V)
pip 23.0+ (pip3 --version)
Bash 5.1+ (bash --version)
[Optional] Docker CE for containerized processing

Installation 🛠️

# Clone with submodules (contains sample metadata schemas)
git clone --recurse-submodules https://github.com/denezt/build-net.git
cd build-net

# Create virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install with production dependencies
pip3 install -r requirements.txt

# Install CLI tool globally
sudo ln -s $(pwd)/metaflow /usr/local/bin/metaflow

Usage Examples 💻

Basic Metadata Processing

# Crawl projects (parallel execution)
metaflow crawl --projects ./projects/*.yaml --workers 8

# Generate analysis report
metaflow analyze --output report.html --format html

# Validate metadata schema
metaflow validate --schema schemas/project_meta.v1.json

Advanced Pipeline

#!/bin/bash
# process_projects.sh - Parallel metadata ETL pipeline

export AWS_BUCKET="my-metadata-store"
export PYTHONPATH="./src"

find ./projects -name "*.yaml" | parallel -j 4 '''
  python3 -m pipeline.extract {} | \
  python3 -m pipeline.transform | \
  aws s3 cp - "s3://$AWS_BUCKET/processed/$(basename {}).json"
'''

Data Flow Architecture 📈

graph LR
    A[Raw Metadata] -->|Bash Crawler| B(JSON/YAML)
    B --> C[Pandas Cleanup]
    C --> D[Pydantic Validation]
    D --> E[(S3/GCS Storage)]
    E --> F[Analysis Dashboard]

Development Setup 🧑💻

Testing Framework

# Run unit tests with coverage
pytest --cov=metaflow --cov-report=html

# Static analysis
flake8 src/ --max-complexity 10
mypy src/

# Benchmark metadata processing
python3 -m pytest benchmarks/ -m "perf"

CI/CD Pipeline (.github/workflows/python-ci.yml)

jobs:
  analysis:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Set up Python
        uses: actions/setup-python@v4
        with:
          python-version: '3.10'
      - run: pip install -r requirements-dev.txt
      - run: |
          pytest --junitxml=test-results.xml
          flake8 src/ --exit-zero
          mypy src/
      - uses: codecov/codecov-action@v3

Data Schemas 📄

# schemas/project_meta.v1.yaml
Project:
  type: object
  required:
    - name
    - dependencies
    - contributors
  properties:
    name:
      type: string
      pattern: "^[A-Z][a-z0-9_-]{3,}$"
    dependencies:
      type: array
      items:
        type: string
    contributors:
      type: array
      items:
        $ref: "#/Contributor"

Contributing 🤝

Install dev dependencies: pip3 install -r requirements-dev.txt
Enable pre-commit hooks: pre-commit install
Add new metadata schemas to schemas/ submodule
Update integration tests in tests/e2e/
Document new features in man/metaflow.1.ronn

License 📄

https://unlicense.org

Name		Name	Last commit message	Last commit date
Latest commit History 59,386 Commits
.github/workflows		.github/workflows
.gitattributes		.gitattributes
.gitignore		.gitignore
BashNotesForProfessionals.txt		BashNotesForProfessionals.txt
LICENSE		LICENSE
README.md		README.md
added-1650993820-30895.txt		added-1650993820-30895.txt
cli.py		cli.py
cloud.py		cloud.py
data_set_list.txt		data_set_list.txt
install.sh		install.sh
models.py		models.py
new_file.txt		new_file.txt
os-setup-tools.json		os-setup-tools.json
processor.py		processor.py
project_meta.v1.yaml		project_meta.v1.yaml
requirements.txt		requirements.txt
techlist.data		techlist.data
temporary-storage.json		temporary-storage.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Build Net

MetaFlow Automator 📊

Features 🚀

Prerequisites 📋

Installation 🛠️

Usage Examples 💻

Basic Metadata Processing

Advanced Pipeline

Data Flow Architecture 📈

Development Setup 🧑💻

Testing Framework

CI/CD Pipeline (.github/workflows/python-ci.yml)

Data Schemas 📄

Contributing 🤝

License 📄

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

denezt/build-net

Folders and files

Latest commit

History

Repository files navigation

Build Net

MetaFlow Automator 📊

Features 🚀

Prerequisites 📋

Installation 🛠️

Usage Examples 💻

Basic Metadata Processing

Advanced Pipeline

Data Flow Architecture 📈

Development Setup 🧑💻

Testing Framework

CI/CD Pipeline (.github/workflows/python-ci.yml)

Data Schemas 📄

Contributing 🤝

License 📄

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages