Fine-Tuning Pipeline
Custom Training
Turn your codebase into a fine-tuned model. Automated parsing, dataset generation, and training — all in your infrastructure.
Pipeline Walkthrough
Step 1 of 6
Select Training Mode
Choose how you want to generate your dataset.
Architecture
Code Examples
parse_repo.py
# Parse a Python repository into chunks
from training_pipeline import RepoParser
parser = RepoParser(
repo_path="./my-project",
extensions=[".py", ".ts", ".tsx"],
ignore=["node_modules", ".git", "__pycache__"],
chunk_size=512,
overlap=64,
)
chunks = parser.parse()
print(f"Generated {len(chunks)} chunks")