chatml

Star

Here are 18 public repositories matching this topic...

sitammeur / Dolphin-llamacpp

Star

Dolphin 3.0 🐬: Versatile AI for coding, math, and more

python gradio gradio-interface huggingface-spaces huggingface-hub llamacpp chatml gguf llama3 llamacpp-python

Updated Mar 12, 2025
Python

AlekseyKorshuk / chat-data-pipeline

Star

Chat data cleaning, filtering and deduplication pipeline.

chat pipeline conversation filtering deduplication cleaning chatml near-deduplication

Updated Jul 25, 2023
Python

sammcj / llm-templates

Sponsor

Star

My LLM Templates (Ollama Modelfiles & Tabby Templates + Presets)

ai jinja2 templates llama jinja mistral tabby llm genai ollama chatml qwen deepseek exllamav2 tabbyapi

Updated Oct 17, 2025
Jinja

lukifer23 / AnarchoBot

Star

138M param ChatML training stack optimized for Apple Silicon via MLX. Features a curated Quality2K continuation curriculum and v18 SFT alignment.

from-scratch mlx sft pre-training apple-silicon llm local-llm chatml

Updated Apr 18, 2026
Python

IDanK0 / Deepseek-Dataset-Generator

Star

Deepseek-Dataset-Generator creates conversational datasets for LLM fine-tuning via DeepSeek API. Supports various formats (ChatML, ShareGPT, Alpaca, JSON, CSV), easy configuration via YAML and detailed logs. Ideal for generating realistic and customized data quickly.

python nlp open-source machine-learning dataset-generation data-augmentation conversational-ai synthetic-data ai-tools prompt-engineering sharegpt chatml llm-finetuning deepseek alpaca-format

Updated Jun 2, 2025
Python

ItayMoh / Prompt-Injection-Detection-via-Fine-Tuning

Star

Fine-tuned small language models (Qwen3-0.6B, Gemma3-1B) to detect prompt injection attacks using reasoning-augmented supervised fine-tuning with ChatML templates. Achieves 95-99% accuracy on adversarial prompts including goal hijacking, DAN jailbreaks, and obfuscation attacks.

fine-tuning adversarial-detection prompt-injection llm-security chatml gemma3 qwen3

Updated Feb 11, 2026
Jupyter Notebook

mridulsaklani / Prompt_Engineering

Star

About working Propmting in OpenAI models, it is also used with deffrent pettren Alpaca prompt, INST prompt

prompt alpaca prompt-engineering chatml

Updated May 31, 2025
Python

13Aluminium / ToolScript

Star

Paste your function, hit convert, and get a clean summary ready for use in LLM-based systems.

machine-learning llm chatml

Updated Apr 14, 2025
HTML

ella0333 / LLM-Scribe

Star

LLM Scribe is a toolkit for creating handwritten datasets quickly and easily for LLM fine-tuning. Automatically outputs into multiple common finetuning formats such as chatml, alpaca, and more.

desktop-app data-science data ai writing dataset alpaca handwritten fine-tuning finetuning large-language-models llm chatml

Updated Oct 5, 2025

kundanshar-cell / instruct-forge

Star

Generate instruction-tuning datasets (JSONL) from structured data using Claude

nlp dataset-generation alpaca claude fine-tuning llm instruction-tuning anthropic local-llm chatml

Updated Apr 15, 2026
JavaScript

idalin6127 / Module5-Hybrid-Retriever-SFT-FAISS-FTS5-BM25-LoRA-QLoRA

Star

Week 5 project: build a hybrid retriever that fuses FAISS dense vectors with SQLite FTS5/BM25 keyword search (RRF/weighted fusion), plus a Supervised Fine-Tuning (SFT) pipeline (Full FT vs LoRA/QLoRA) using TRL/PEFT/DeepSpeed.

Updated Oct 8, 2025
Python

mookiezi / interface

Star

A Python-based interactive CLI interface for chatting with Hugging Face language models, optimized for casual, Discord-style conversation using ChatML. Supports both quantized and full-precision models, live token streaming with color formatting, and dynamic generation parameter adjustment.

chatbot huggingface huggingface-transformers huggingface-models chatml

Updated Oct 1, 2025
Python

AndrMoura / posthog-llm-examples

Star

Upload data to PostHog-LLM

python conversational-ai examples-python posthog llm data-upload llmops chatml

Updated May 17, 2024

mookiezi / dataset-cleaning-toolkit

Star

A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, Parquet cleaning and sorting, HuggingFace–style metadata generation, and batched chain insertion into PostgreSQL — with Rich progress, multiprocessing, and 32 GB-RAM-friendly batching.

nlp machine-learning natural-language-processing csv toolkit ml toolbox dataset cleaner machine-learning-dataset chatml

Updated Oct 2, 2025
Python

xwxfox / convokit

Star

A flexible TypeScript framework for ingesting, processing, and exporting chat/conversation data for LLM training and analysis.

javascript nlp chat machine-learning typescript discord gemini conversation preprocessing data-processing training-data fine-tuning llm chatml

Updated Apr 20, 2025
TypeScript

postlang / posthog-llm-examples

Star

Upload data to PostHog-LLM

python ingestion conversational-ai examples-python posthog llm data-upload llmops chatml

Updated May 22, 2024

julep-ai / standard-chatml

Star

Standardized spec and vendor-specific transforms for ChatML

openai gemini-api anthropic vllm chatml litellm standard-chatml

Updated May 2 72A1 7, 2024
Python

Nooxus-AI / NOO-Verified-Global-Entities

Star

The Anti-Hallucination data layer for B2B Sourcing. Deep-verified global supply chain entities designed for RAG and LLM instruction tuning.

supply-chain data-engineering knowledge-graph b2b ai-agents jsonl rag chatml agentic-ai llm-dataset anti-hallucination sft-dataset supply-chain-llm

Updated Apr 18, 2026

Improve this page

Add a description, image, and links to the chatml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the chatml topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chatml

Here are 18 public repositories matching this topic...

sitammeur / Dolphin-llamacpp

AlekseyKorshuk / chat-data-pipeline

sammcj / llm-templates

lukifer23 / AnarchoBot

IDanK0 / Deepseek-Dataset-Generator

ItayMoh / Prompt-Injection-Detection-via-Fine-Tuning

mridulsaklani / Prompt_Engineering

13Aluminium / ToolScript

ella0333 / LLM-Scribe

kundanshar-cell / instruct-forge

idalin6127 / Module5-Hybrid-Retriever-SFT-FAISS-FTS5-BM25-LoRA-QLoRA

mookiezi / interface

AndrMoura / posthog-llm-examples

mookiezi / dataset-cleaning-toolkit

xwxfox / convokit

postlang / posthog-llm-examples

julep-ai / standard-chatml

Nooxus-AI / NOO-Verified-Global-Entities

Improve this page

Add this topic to your repo