LM-Kit.NET 新版本发表

Released: May 28, 2026

2026.5.5 中的更新

特性

  • Context hibernation policy (HibernationMode, IKVCache, Configuration, MultiTurnConversation, PdfChat, RagChat, AgentExecutor): new HibernationMode enum (Auto, None, Forced), per-conversation IKVCache.HibernationMode get/set property on every multi-turn conversation type, and a process-wide Configuration.DefaultContextHibernationMode that seeds the value on every new conversation. In Forced mode the runtime hibernates the KV-cache to disk in the background at the end of every turn...

Released: May 24, 2026

2026.5.4 中的更新

特性

  • Stream-based model loading (LMKit.Model): new LM(Stream) loads GGUF and LMK archives from a stream; LM.LoadEncryptedFromStream(...) does the same for encrypted GGUF (.lmke). No on-disk extraction.
  • Markdown attachment support (LMKit.Data.Attachment, LM-Kit.Server): .md / .markdown files are now recognized as plain-text attachments (text/markdown), handled identically to .txt end to end.
  • Multi-Token Prediction (MTP) self-speculative decoding: a new generation accelerator for models...

Released: May 18, 2026

2026.5.3 中的更新

特性

  • Added Windows ARM64 platform support: LM-Kit.NET now ships native binaries for Windows ARM64 alongside the existing Windows x64, Linux x64, Linux ARM64, and macOS Universal targets. Covers Snapdragon X / Copilot+ PCs, Surface Pro X / Surface Pro 9 (5G), Windows Dev Kit 2023, and other Qualcomm-powered Windows 11 devices. The base LM-Kit.NET package now contains a runtimes/win-arm64/native/ folder with the default CPU (ARM Neon) and Vulkan backends (Qualcomm Adreno acceleration via Vulkan...

Released: May 10, 2026

2026.5.2 中的更新

特性

  • Added model cards for Qwen 3.6 family: qwen3.6:27b (27B dense hybrid Gated DeltaNet + Gated Attention model, 64 blocks) and qwen3.6:35b-a3b (35B MoE, 3B active, 256 experts with 8 routed plus 1 shared, 40 blocks). Both support chat, vision, tool calling, code completion, math, and OCR with a native 262K context window extensible to 1M tokens via YaRN. The qwen3.5:27b and qwen3.5:35b-a3b model cards are now deprecated in favor of qwen3.6:27b and qwen3.6:35b-a3b respectively.
  • Added...

Released: May 6, 2026

2026.5.1 中的更新

特性

  • Extended encrypted GGUF support to metadata-only scenarios (LMKit.Cryptography, LMKit.Model, LMKit.Hardware): LM.LoadEncrypted now honors LoadingOptions.LoadTensors = false, mirroring the plaintext metadata-only path - only the metadata block is decrypted, no tensor bytes are read, and the resulting LM exposes architecture, vocabulary, context length, layer count, and other GGUF metadata. Use this for fast catalog inspection or pre-flight checks on protected .lmke containers.
  • Added...

Released: Apr 23, 2026

2026.4.4 中的更新

特性

  • Introduced IKVCache public interface (LMKit.Inference): exposes residency, warmup, and hibernation capabilities on objects that own an inference KV-cache. Implemented by MultiTurnConversation, PdfChat, RagChat, and AgentExecutor (cast the instance to IKVCache to access).
    • Members:
      • KVCacheContent (textual projection of the cache).
      • Residency (current ContextResidency).
      • Warmup() (eagerly initializes the context or rehydrates it from disk so the first user-facing call is not penalized by...

Released: Apr 17, 2026

2026.4.3 中的更新

特性

  • Added ImageDetail property to IConversation and VlmOcr (LMKit.TextGeneration, LMKit.Extraction.Ocr): controls the pixel budget used when processing images for vision models. Accepts a member of the ImageDetail enumeration (Minimal, Low, Standard, High, Maximal). Default is High. Available on MultiTurnConversation, SingleTurnConversation, AgentExecutor, RagChat, PdfChat, and VlmOcr.
  • Added TranslationProgress event to TextTranslation (LMKit.Translation): reports chunk-level...

Released: Apr 10, 2026

2026.4.2 中的更新

特性

  • Added thinking mode support for Gemma 4 models (E2B, E4B, 26B).
  • Added per-request language and orientation detection to PDF OCR: OcrParameters, PdfSearchableMakerOptions, and PdfGenerationOptions now accept optional Languages and EnableOrientationDetection overrides. The PDF OCR server endpoint exposes languages and detect_orientation fields.
  • Orchestration streaming now emits Thinking tokens for internal reasoning segments.

Released: Apr 4, 2026

2026.4.1 中的更新

特性

  • Added model cards for gemma4:e2b, gemma4:e4b, and gemma4:26b-a4b: Google Gemma 4 multimodal Mixture-of-Experts models with vision and tool calling support.
  • Inference speed improvements.
  • Improved overall extraction accuracy.

Released: Mar 28, 2026

2026.3.5 中的更新

特性

  • Added model cards for 'translategemma3:4b' and 'translategemma3:12b'. Google TranslateGemma 3 open translation models built on Gemma 3, supporting 55 languages with text and image inputs.
  • Added Translation model capability (ModelCapabilities.Translation): New flag identifying models specifically trained for multilingual translation.
  • Added model card for 'nemotron3-nano:4b'. NVIDIA Nemotron 3 Nano 4B, an edge-ready hybrid Mamba-2/Transformer model (3.97B parameters) with 262K context...