Accelerating LLM Inference with Prompt Caching for Open‑Source Models on Databricks | Endigest