ARTFEED — Contemporary Art Intelligence

TSCG: Deterministic Tool-Schema Compilation Boosts LLM Tool-Use Accuracy

ai-technology · 2026-05-07

The newly developed deterministic compiler, TSCG, addresses the discrepancies between JSON tool schemas and small language models with 4B-14B parameters. Frameworks for production agents such as OpenAI Function Calling, Anthropic Tool Use, and MCP utilize JSON for tool schemas, which, while optimized for machine parsing, are not effectively interpreted by LLMs, leading to frequent tool-use failures in production catalog sizes. TSCG transforms JSON schemas into token-efficient structured text at the API boundary, eliminating the need for model access, fine-tuning, or runtime searches. It employs eight composable operators and guarantees a formal compression bound of at least 51% for well-formed schemas. In the TSCG-Agentic-Bench benchmark, which includes around 19,000 calls across 12 models and 5 scenarios, TSCG improved Phi-4 14B’s accuracy from 0% to 84.4% with 20 tools and 90.3% with 50 tools. It also achieved a 108-181% accuracy-retained ratio across three models on BFCL. The format-versus-compression decomposition (R²=0.88 → 0.03) indicates that representation is the key factor.

Key facts

  • TSCG is a deterministic tool-schema compiler for small LLMs (4B-14B parameters).
  • It resolves the protocol mismatch between JSON schemas and language models.
  • Works with OpenAI Function Calling, Anthropic Tool Use, and MCP frameworks.
  • Converts JSON to token-efficient structured text without fine-tuning or runtime search.
  • Combines eight composable operators with a formal compression bound (≥51%).
  • On TSCG-Agentic-Bench, Phi-4 14B accuracy rose from 0% to 84.4% at 20 tools and 90.3% at 50 tools.
  • Achieved 108-181% accuracy-retained ratio across three models on BFCL.
  • Format-versus-compression decomposition (R²=0.88→0.03) shows representation is the key factor.

Entities

Institutions

  • OpenAI
  • Anthropic
  • MCP
  • BFCL

Sources