EPL 25/26 Season Awards MW1 → MW34
Awards judged by the only metric the discourse leaves behind: how often a player's name appears in fan posts, and the share of those mentions that carry positive vs negative language. Built from 707,550 player-mentions extracted from 339 match-day comment threads (Reddit, X/Twitter, YouTube) across every EPL fixture this season. Net sentiment = pos% − neg% of mentions for that player, where positive/negative is detected via a curated football-context lexicon (e.g. goat, class, masterclass, baller vs flop, bottled, passenger).
Methodology
Data: all merged.json files under output/raw/EPL/MW*/*/ — 339 match-day post bundles spanning MW1 → MW34 of the 2025/26 Premier League season. Each file is the union of Reddit, X/Twitter and YouTube fan posts for that fixture.
Player attribution: 2,503 unique aliases drawn from data/players.json (display_name, lastname, common_name, manually-curated aka list). Ambiguous tokens shared across multiple players are dropped automatically. Posts are scanned with word-boundary regex; multi-player posts credit every matched player.
Sentiment: a curated positive lexicon (goat, class, masterclass, brilliant, amazing, incredible, outstanding, unreal, sensational, superb, world class, elite, clutch, baller, magic) and a curated negative lexicon (flop, bottled, passenger, trash, terrible, awful, overrated, garbage, shocking, useless, disgrace, embarrassing, liability, mid) — football-discourse tuned. A post is positive if any positive term hits; same for negative. Mixed posts count toward both.
Award scoring: POTY / U23 / O32 / Best Signing all rank by net_sentiment × √mentions (volume-weighted net). Most Underrated filters to 500–3,000 mentions then ranks by raw net. Most Improved combines H2/H1 mention ratio (split at MW17) with net sentiment. Manager of the Year was previously decided in manager_discourse.html.
Caveats: the lexicon is keyword-based, not a trained classifier — sarcasm and ironic praise will leak through. Players whose surnames clash with everyday English (fire, hall, young) are partially filtered but not perfect. 25/26 signing detection combines historic_teams records with a curated list of summer-window arrivals from foreign clubs (whose previous teams aren't tracked in this dataset).