Pārlūkot izejas kodu

docs: add NEWS_LLM_CONCURRENCY and NEWS_LLM_RATE_LIMIT env vars to README

Lukas Goldschmidt 1 nedēļu atpakaļ
vecāks
revīzija
3b3b8ced25
1 mainītis faili ar 2 papildinājumiem un 0 dzēšanām
  1. 2 0
      README.md

+ 2 - 0
README.md

@@ -88,6 +88,8 @@ Key variables:
 - `ENRICH_OTHER_TOPICS_ONLY` (default false; set true to only LLM-enrich "other" topic clusters)
 - `ENRICHMENT_MAX_PER_REFRESH` (default 0 = no limit; max clusters to LLM-enrich per refresh cycle)
 - `NEWS_LLM_DEBUG` (default false; enable debug logging for LLM calls)
+- `NEWS_LLM_CONCURRENCY_<PROVIDER>` (e.g. `NEWS_LLM_CONCURRENCY_GROQ`; max concurrent outbound LLM calls per provider; overrides the built-in defaults: groq=8, openai=5, openrouter=2)
+- `NEWS_LLM_RATE_LIMIT_<PROVIDER>` (e.g. `NEWS_LLM_RATE_LIMIT_GROQ`; max LLM calls per second per provider. Set to `0` to disable rate limiting. Built-in defaults: groq=1.0, openai=5.0, openrouter=2.0)
 - `NEWS_EMBEDDINGS_ENABLED` (default false; enables Ollama embeddings for clustering)
 - `OLLAMA_BASE_URL` / `OLLAMA_URL` (default `http://127.0.0.1:11434`)
 - `OLLAMA_EMBEDDING_MODEL` (default `nomic-embed-text`)