LM-Kit.NET
用于构建 LLM 应用程序的企业级 .NET SDK。
由以下开发商制作:LM-Kit
ComponentSource开始代理销售的日期:2025年
Released: May 28, 2026
HibernationMode, IKVCache, Configuration, MultiTurnConversation, PdfChat, RagChat, AgentExecutor): new HibernationMode enum (Auto, None, Forced), per-conversation IKVCache.HibernationMode get/set property on every multi-turn conversation type, and a process-wide Configuration.DefaultContextHibernationMode that seeds the value on every new conversation. In Forced mode the runtime hibernates the KV-cache to disk in the background at the end of every turn...Released: May 24, 2026
LMKit.Model): new LM(Stream) loads GGUF and LMK archives from a stream; LM.LoadEncryptedFromStream(...) does the same for encrypted GGUF (.lmke). No on-disk extraction.LMKit.Data.Attachment, LM-Kit.Server): .md / .markdown files are now recognized as plain-text attachments (text/markdown), handled identically to .txt end to end.Released: May 18, 2026
LM-Kit.NET package now contains a runtimes/win-arm64/native/ folder with the default CPU (ARM Neon) and Vulkan backends (Qualcomm Adreno acceleration via Vulkan...Released: May 10, 2026
qwen3.6:27b (27B dense hybrid Gated DeltaNet + Gated Attention model, 64 blocks) and qwen3.6:35b-a3b (35B MoE, 3B active, 256 experts with 8 routed plus 1 shared, 40 blocks). Both support chat, vision, tool calling, code completion, math, and OCR with a native 262K context window extensible to 1M tokens via YaRN. The qwen3.5:27b and qwen3.5:35b-a3b model cards are now deprecated in favor of qwen3.6:27b and qwen3.6:35b-a3b respectively.Released: May 6, 2026
LMKit.Cryptography, LMKit.Model, LMKit.Hardware): LM.LoadEncrypted now honors LoadingOptions.LoadTensors = false, mirroring the plaintext metadata-only path - only the metadata block is decrypted, no tensor bytes are read, and the resulting LM exposes architecture, vocabulary, context length, layer count, and other GGUF metadata. Use this for fast catalog inspection or pre-flight checks on protected .lmke containers.Released: Apr 23, 2026
IKVCache public interface (LMKit.Inference): exposes residency, warmup, and hibernation capabilities on objects that own an inference KV-cache. Implemented by MultiTurnConversation, PdfChat, RagChat, and AgentExecutor (cast the instance to IKVCache to access).
KVCacheContent (textual projection of the cache).Residency (current ContextResidency).Warmup() (eagerly initializes the context or rehydrates it from disk so the first user-facing call is not penalized by...Released: Apr 17, 2026
ImageDetail property to IConversation and VlmOcr (LMKit.TextGeneration, LMKit.Extraction.Ocr): controls the pixel budget used when processing images for vision models. Accepts a member of the ImageDetail enumeration (Minimal, Low, Standard, High, Maximal). Default is High. Available on MultiTurnConversation, SingleTurnConversation, AgentExecutor, RagChat, PdfChat, and VlmOcr.TranslationProgress event to TextTranslation (LMKit.Translation): reports chunk-level...Released: Apr 10, 2026
OcrParameters, PdfSearchableMakerOptions, and PdfGenerationOptions now accept optional Languages and EnableOrientationDetection overrides. The PDF OCR server endpoint exposes languages and detect_orientation fields.Thinking tokens for internal reasoning segments.Released: Apr 4, 2026
gemma4:e2b, gemma4:e4b, and gemma4:26b-a4b: Google Gemma 4 multimodal Mixture-of-Experts models with vision and tool calling support.Released: Mar 28, 2026
'translategemma3:4b' and 'translategemma3:12b'. Google TranslateGemma 3 open translation models built on Gemma 3, supporting 55 languages with text and image inputs.Translation model capability (ModelCapabilities.Translation): New flag identifying models specifically trained for multilingual translation.'nemotron3-nano:4b'. NVIDIA Nemotron 3 Nano 4B, an edge-ready hybrid Mamba-2/Transformer model (3.97B parameters) with 262K context...