Fine-Tuning Pipeline

Custom Training

Turn your codebase into a fine-tuned model. Automated parsing, dataset generation, and training — all in your infrastructure.

Pipeline Walkthrough

Step 1 of 6

Select Training Mode

Choose how you want to generate your dataset.

Architecture

Git RepoParserChunkerPrompt GenJSONLValidationFine-tune APILoRA Adapter

Code Examples

parse_repo.py
# Parse a Python repository into chunks
from training_pipeline import RepoParser

parser = RepoParser(
    repo_path="./my-project",
    extensions=[".py", ".ts", ".tsx"],
    ignore=["node_modules", ".git", "__pycache__"],
    chunk_size=512,
    overlap=64,
)

chunks = parser.parse()
print(f"Generated {len(chunks)} chunks")