Discussion about this post

User's avatar
Neural Foundry's avatar

Absolutely stellar breakdown of semantic caching as a production pattern. The example of "Who is the CEO?" vs "Who runs the company?" getting the same cached reponse really nails why traditional key-value caching breaks down with LLMs. I've seen teams burn through API costs becuase they didn't realize similarity-based caching was even possible, seems obvious in hindsight but most devs still think in exact string matches.

Buhake Sindi's avatar

For question #2, LangChain4J-CDI (a LangChain4J project) was built as part of the integration of LangChain4J with Jakarta EE CDI ecosystem. So, it's not just an only Quarkus and Spring Boot integration. LangChain4J can be used on any Java enterprise frameworks that supports CDI too.

No posts

Ready for more?