Semantic Caching for Java: Turbocharge Your…

Nov 25, 2025

A practical hands-onLearn how to add fast, meaning-aware caching to your local LLM workflows using pgvector, embeddings, and a clean proxy wrapper pattern. tutorial for Java developers

Read →

2 Comments

Rakesh

Jan 7

How do we externalize the system message? In a way it helps to make it configurable without changes to code and deployment. I tried to intercept the requests to RegisterAiService similar to the one done here but unfortunately langchain does not allow modification to Chat message request. Is System message provider a viable option? Appreciate if you could share your insight

Comment removed

Comment removed

I mean, sure. Memory is just a lot faster.

With regards to concurrent requests, I havent had a lot of time to load test it. Would indeed be interesting.