News-Tweets NLP Linker
nlp
distributed-computing
python
data-engineering
Link tweets with news articles using NLP text similarity, with distributed computing pipeline and interactive dashboard
Overview
Links tweets with news articles based on text content similarity using NLP techniques. Uses distributed computing (Mesos + Docker) to run the similarity score pipeline in parallel for speedup.

Components:
- Similarity pipeline — NLP-based text similarity scoring between tweets and news articles
- Distributed computing — Mesos and Docker pipeline for parallel processing
- Interactive dashboard — Dash app to view and interact with similarity results
