Histórico de commits

Autor SHA1 Mensagem Data
  Lukas Goldschmidt b22882c580 feat: article_identity module, site_config DB table, debug_dedup tool 6 dias atrás
  Lukas Goldschmidt 28f4322c94 feat: seen_articles table + dual-signal clustering + lower title threshold 6 dias atrás
  Lukas Goldschmidt 8fe316db24 debug scripts 6 dias atrás
  Lukas Goldschmidt 2c049a1c7e fix wipe.sh: proper .env sourcing, inline Python, respect NEWS_MCP_DB_PATH 1 semana atrás
  Lukas Goldschmidt ea3bc2c757 fix wipe.sh: source .env properly, not via xargs export 1 semana atrás
  Lukas Goldschmidt 0d116bc74d add wipe.sh: source .venv + .env, clear all but feed_state 1 semana atrás
  Lukas Goldschmidt 14dbabaad7 add script: clear all news data but keep feed_state rows 1 semana atrás
  Lukas Goldschmidt f7f08dc990 seed feed_state from .env at startup — INSERT OR IGNORE, leave existing alone 1 semana atrás
  Lukas Goldschmidt 764aac573e simpler: don't auto-disable feeds removed from .env — leave DB config alone 1 semana atrás
  Lukas Goldschmidt 7f5176281a fix: track disabled feeds, reset their item count to 0 1 semana atrás
  Lukas Goldschmidt 2ae7bd92e4 fix: feed dedup hash must not include timestamp — only title+url 1 semana atrás
  Lukas Goldschmidt b3d915c73d fix: keyword arg for _cluster_is_within_age_window 1 semana atrás
  Lukas Goldschmidt 2670ed9d44 refactor: rewrite polling loop as ClusterPoller class with per-cycle stats 1 semana atrás
  Lukas Goldschmidt fab4b5ec31 fix: keep cluster_id stable across cycles — never recompute after creation 1 semana atrás
  Lukas Goldschmidt c3cc8103fe fix: skip LLM enrichment for clusters that already have entities and keywords 1 semana atrás
  Lukas Goldschmidt bb2b345be3 fix: prune on payload_ts (event time) + pre-filter articles older than retention 1 semana atrás
  Lukas Goldschmidt 3b3b8ced25 docs: add NEWS_LLM_CONCURRENCY and NEWS_LLM_RATE_LIMIT env vars to README 1 semana atrás
  Lukas Goldschmidt 2b778f45c7 feat: per-provider LLM rate limiter (token bucket) 1 semana atrás
  Lukas Goldschmidt 065421f71f fix: preserve enriched_at through sanitize_cluster_payload 1 semana atrás
  Lukas Goldschmidt 2db56b7dc0 poller llm calls reduced 1 semana atrás
  Lukas Goldschmidt 0e2119d549 prompt 1 semana atrás
  Lukas Goldschmidt 8813368e83 clamp future timestamps to now on ingest + remove stale prompt test 1 semana atrás
  Lukas Goldschmidt a907f7bb81 prompt 1 semana atrás
  Lukas Goldschmidt 45094bbb5b prompt 1 semana atrás
  Lukas Goldschmidt 535f0bef64 extraction prompt again 1 semana atrás
  Lukas Goldschmidt b73e04cd73 extraction prompt 1 semana atrás
  Lukas Goldschmidt e77a2e6e3e prompt, keywords filter, emerging topics - related entities 1 semana atrás
  Lukas Goldschmidt 7981f483f0 fix: cross-topic dedup pass to merge duplicate clusters from different feeds 1 semana atrás
  Lukas Goldschmidt 4ed086e30c fix: deduplicate related_keywords against related_entities in emerging topics 1 semana atrás
  Lukas Goldschmidt 1f8c568ffe fix: entity results in detect_emerging_topics now get related_keywords 1 semana atrás