news-mcp release notes
v0.2.0 — embedding-aware clustering and richer agent tools
Highlights
- Optional Ollama embedding path for clustering (
NEWS_EMBEDDINGS_ENABLED=true)
- Configurable Ollama base URL and embedding model
- Tunable embedding similarity threshold (
NEWS_EMBEDDING_SIMILARITY_THRESHOLD)
- New agent tool:
get_related_entities(subject, timeframe, limit)
- Optional article payloads for
get_latest_events, get_events_for_entity, and get_event_summary
- Improved emerging-topic scoring with co-occurrence and importance weighting
- Blacklist enforcement back-clean script for stored clusters
- Embedding backfill script for older clusters
- Embedding similarity analysis script for threshold tuning
- Embedding-based merge script with dry-run and wet modes
- Article dedup cleanup for repeated article variants inside clusters
Notes
- Ollama embeddings are tried first when enabled; heuristic clustering remains the fallback.
- The merge script is intentionally destructive and should be preceded by a dry run.
- The article dedup cleanup script is safe to run after ingestion or on the historical dataset.