Current version: v0.5.0
Raw news is useless to agents. Processed news is powerful.
Articles that update in-place at the same URL (e.g. FT's "More to come..." → real content) are now detected via content_hash comparison in seen_articles. Changed articles are re-clustered and re-enriched automatically.
/api/v1/configdebug_dedup(url, title?) — MCP tool to inspect dedup decisions, similarity signalsSee PROJECT.md for full schema and architecture details.
| Tool | Status | Notes |
|---|---|---|
get_latest_events |
✅ | Time-filtered via payload_ts SQL index |
get_events_for_entity |
✅ | SQL junction-table search |
get_event_summary |
✅ | LLM-written narrative |
detect_emerging_topics |
✅ | entity/keyword/phrase signal types |
get_news_sentiment |
✅ | SQL junction-table search |
get_related_recent_entities |
✅ | Co-occurrence + Google Trends blend |
get_feeds / toggle_feed |
✅ | Feed management |
debug_dedup |
✅ | Inspect dedup decisions (new in v0.5.0) |
Docker on thinkcenter-2 (192.168.0.200:8506):
cd ~/news-mcp && git pull && docker-compose up -d news-mcp
After schema changes, run backfill:
docker exec -it news-mcp python3 scripts/backfill_seen_articles.py