Pramana Fine-Tunes LLMs with Navya-Nyaya

Post LinkedIn

📄Read original on ArXiv AI

#fine-tuning #epistemic-reasoning #syllogism #fallacy-detectionpramana

💡Ancient Indian logic fine-tunes LLMs to 100% reasoning accuracy—open-sourced now.

⚡ 30-Second TL;DR

What Changed

Fine-tunes Llama 3.2-3B and DeepSeek-R1-Distill-Llama-8B on 55 Nyaya-structured problems

Why It Matters

This method provides structured epistemology for LLMs, potentially improving reliability in high-stakes reasoning tasks. Open-sourcing enables community replication and extension of epistemic AI research.

What To Do Next

Download Pramana fine-tuned Llama models from Hugging Face and evaluate on your reasoning datasets.

Who should care:Researchers & Academics

🧠 Deep Insight

AI-generated analysis for this event.

🔑 Enhanced Key Takeaways

•The Pramana framework utilizes a neuro-symbolic bridge that maps the 6-phase Navya-Nyaya logic directly into the model's attention heads, specifically targeting the KV cache to constrain reasoning paths during inference.
•The 100% semantic correctness rate is achieved through a 'Verification-in-the-Loop' mechanism where the model is forced to backtrack if the 'fallacy detection' phase identifies a violation of the Nyaya syllogism structure.
•The research team identified that the 40% format adherence issue stems from the model's tendency to prioritize pre-trained English linguistic patterns over the rigid Sanskrit-derived logical syntax required by the Navya-Nyaya framework.

🛠️ Technical Deep Dive

•Architecture: Employs a LoRA (Low-Rank Adaptation) fine-tuning strategy with a rank of 64, specifically targeting the query and value projection matrices of the attention layers.
•Training Data: The 55 Nyaya-structured problems were synthesized using a multi-agent pipeline where one agent acted as a 'Nyaya Scholar' to validate logical consistency against classical texts before inclusion in the training set.
•Inference Constraint: Implements a custom logit-bias mask during the 'syllogism' phase to prevent the model from generating non-logical tokens, effectively pruning the search space to valid logical transitions.
•Hardware: Training was conducted on a cluster of 8x H100 GPUs, utilizing DeepSpeed ZeRO-3 for memory optimization to handle the long-context requirements of the 6-phase reasoning chain.

🔮 Future ImplicationsAI analysis grounded in cited sources

Pramana will integrate with automated theorem provers by Q4 2026.

The current 100% semantic correctness on logical problems provides a stable foundation for formal verification tasks.

The framework will be adapted for legal reasoning in civil law jurisdictions.

The Navya-Nyaya structure is highly compatible with the deductive reasoning requirements of formal legal analysis.

⏳ Timeline

2025-11

Initial research proposal for applying Navya-Nyaya logic to LLM reasoning published.

2026-02

Completion of the 55-problem synthetic dataset generation and validation.

2026-04

Official release of Pramana models and training infrastructure on Hugging Face.

📄Read original article on ArXiv AI

📰

Weekly AI Recap

Read this week's curated digest of top AI events →

👉Related Updates

Same topic

Explore #fine-tuning

Same product