Discovery Sprint Overview

Done 80 (284 story points)
Backlog 7 (30 story points)
In progress 3 (9 story points)
Resolved 286 (9690 story points)
Stalled/Waiting 2 (0 story points)
total 378 (10013 story points)
Title Priority Story Points Assignee Status
#101277 Summarise our hypothesis for "User satisfaction" measurement Unbreak Now! 3 Ironholds Resolved
#109758 Rewrite mysql_read to use the updated DBI version Unbreak Now! Ironholds Resolved
#111880 Fix metrics dashboard Unbreak Now! 2 EBernhardson Resolved
#111749 Backfill dashboard data Unbreak Now! 2 Ironholds Resolved
#111714 Some golden retriever scripts not being run on a daily basis Unbreak Now! 1 Ironholds Resolved
#111549 Fix WDQS scripts and update dashboard accordingly Unbreak Now! 2 mpopov Resolved
#117510 Golden not retrieving Varnish data Unbreak Now! 3 Ironholds Resolved
#116919 "LD50" user satisfaction metric code broken Unbreak Now! mpopov Resolved
#124268 KPI broken on search dashboard Unbreak Now! 2 mpopov Resolved
#123816 Discovery search dashboard is showing an error message Unbreak Now! 2 mpopov Resolved
#134255 WDQS usage Dashboard print error for "Last 7 days" option Unbreak Now! mpopov Resolved
#129121 Backfill with correct satisfaction-related data Unbreak Now! 3 mpopov Resolved
#143447 Dashboard: Mobile App Events needs fixing Unbreak Now! 1 mpopov Done
#149735 Dashboards are down Unbreak Now! 2 mpopov Done
#163501 [Dashboards][WDQS] Include 'https://query.wikidata.org/sparql' endpoint Unbreak Now! 1 mpopov Done
#162178 Fix data retrieval scripts using raw user agent strings Unbreak Now! chelsyx Done
#117712 LDN script is still borken Unbreak Now! 1 mpopov Resolved
#114517 Data is not being retrieved/sync'd Unbreak Now! 1 Ironholds Resolved
#103596 Summarize what we know about the "zero results" queries Unbreak Now! - Resolved
#100674 Implement an isSearch UDF for Hadoop Needs Triage Ironholds Resolved
#100668 Rework searchdata.wmflabs.org so it can handle multiple dashboards (starting with the web portal) Needs Triage Deskana Resolved
#100449 Note the bug around clock times on the Apps page Needs Triage 1 Deskana Resolved
#100446 Fix dashboarding bug that's causing a claim of 6 events Needs Triage 2 Ironholds Resolved
#100056 Work out why mobile web dashboard entries abruptly terminate Needs Triage Ironholds Resolved
#100055 Make clear that the search dashboards are grabbed from sampled logs Needs Triage 1 Ironholds Resolved
#100050 Stop using scientific notation in dashboards Needs Triage 1 Ironholds Resolved
#98070 Document initial EventLogging needs Needs Triage 2 Ironholds Resolved
#101273 Include sampling rate in dashboards Needs Triage Ironholds Resolved
#102879 Fix immediate crash due to EL switchover and rebuilt basic dashboards to be resistant to such crashes Needs Triage 4 Ironholds Resolved
#102323 gzip: stdin: unexpected end of file in search failures script Needs Triage 2 Ironholds Resolved
#102249 Provide Dan with a list of "top n" search strings that returned 0 results Needs Triage Ironholds Resolved
#105739 Update dashboards to handle new adjustments to 'did you mean' feature Needs Triage EBernhardson Resolved
#106400 Create idealised schema for moving Cirrus logs into HDFS Needs Triage Ironholds Resolved
#109523 Create TestUserSatisfaction 2.0.0 schema Needs Triage 2 Ironholds Resolved
#109479 Measure actual effect size from the first A/B test Needs Triage 2 mpopov Resolved
#113299 Identify an appropriate outlier detection method for dwell times Needs Triage 1 mpopov Resolved
#112953 Create new web proxy for the search data Needs Triage 2 Ironholds Resolved
#112269 Perform survival analysis on user satisfaction data Needs Triage 4 mpopov Resolved
#111961 Patch dashboard setup scripts to set locale Needs Triage 1 Ironholds Resolved
#111856 Begin to quantify why people use Search Engines instead of Wikimedia search Needs Triage 3 Ironholds Resolved
#116295 Create data collection scripts for referer data Needs Triage 3 Ironholds Resolved
#116291 Switch hive queries over to Beeline Needs Triage 2 mpopov Resolved
#116194 Backfill satisfaction code Needs Triage 4 Ironholds Resolved
#115625 Move common dashboard JS to polloi Needs Triage 1 Ironholds Resolved
#115582 Fix weird indentation on discovery.wmflabs.org/index.html Needs Triage 1 mpopov Resolved
#115050 Do we have enough people in SearchSatisfaction to perform an A/B/C/D test and not screw up the dashboards? Needs Triage 1 mpopov Resolved
#114919 Refactor golden to be more robust and backfilling-friendly Needs Triage 5 Ironholds Resolved
#117617 Search Satisfaction metric dropped by 38.4% Needs Triage 3 Ironholds Resolved
#118483 Update satisfaction table references Needs Triage 1 Ironholds Resolved
#123347 Include geolocation data in portal dashboards Needs Triage 4 Ironholds Resolved
#122921 getting through on-boarding tasks Needs Triage debt Resolved
#124099 Note enwiki launch on maps dashboard Needs Triage Ironholds Resolved
#129137 http://discovery.wmflabs.org/external/ should filter, or at least break out, traffic from wmf sites Needs Triage 1 mpopov Resolved
#134301 Portal: investigate on how long a "typical" session is Needs Triage 4 mpopov Resolved
#137166 Data analysis support for Legal Needs Triage 6 mpopov Resolved
#136900 A/B Test (Chrysalis): Analyze the results from the new top 10 language links display test Needs Triage - Resolved
#136898 A/B Test (Chrysalis): Check that all is well with the new top 10 language links display test Needs Triage - Resolved
#136896 A/B Test (Chrysalis): Specifications on the gathering of data for the new top 10 language links display Needs Triage - Resolved
#145478 [Dashboards] Add/switch to bookmarking states Needs Triage 2 mpopov Done
#145149 discovery data access guidelines out of data Needs Triage 1 mpopov Done
#145124 Try combining cirrus search logs with engagement data Needs Triage 2 mpopov Done
#146422 Bug in current data retrieval script Needs Triage chelsyx Done
#161876 Search dashboard: remove regex in ZRR breakdown by type Needs Triage 1 mpopov Done
#123604 Include new fancy-dan stop function in dashboard code Needs Triage Ironholds Resolved
#123597 Fix dashboard collection scripts for the portal Needs Triage 2 Ironholds Resolved
#121034 Get statistics on how many people visit portal page vs how many people visit search results page Needs Triage Ironholds Resolved
#121026 [PAB2] (Analysis) Analyse results of Portal A/B test after it's finished Needs Triage Jdrewniak Resolved
#119395 Fix ZRR file change Needs Triage 2 Ironholds Resolved
#118914 Figure out how many people run a search query on wikipedia and land on an article Needs Triage mpopov Resolved
#117805 Hive claims it doesn't have any Maps data on Oct 31st Needs Triage Ironholds Resolved
#117299 Backfill desktop events (from 10/28 onward) Needs Triage Deskana Resolved
#117094 Figure out why the links on the external traffic dashboard don't work Needs Triage 1 Ironholds Resolved
#114261 Updating documentation & setup script for dashboard project Needs Triage 1 mpopov Resolved
#113637 Add augmented clickthrough & LDN metrics to dashboard Needs Triage 4 mpopov Resolved
#113513 Build retrieval scripts for dwell-time data Needs Triage 4 Ironholds Resolved
#107724 Contact entities responsible for zero-results queries Needs Triage 2 Ironholds Resolved
#102998 Distinguish iOS and Android app traffic Needs Triage 1 Ironholds Resolved
#102040 In the zero results stream it'd be nice to know if the user was given a search suggestion Needs Triage 3 - Resolved
#102038 In the zero results stream it'd be nice to be able to filter on whether the request was a prefix search or a full text search Needs Triage 2 - Resolved
#101902 Run a manual job to backfill more of the API data in the search dashboards Needs Triage 1 Ironholds Resolved
#101883 Create dashboards for the API statistics Needs Triage 4 Ironholds Resolved
#101774 Add counts of queries that are generating 0 results to searchdata.wmflabs.org Needs Triage 4 Deskana Resolved
#101384 Add submit-form action to the desktop search dashboard at searchdata.wmflabs.org Needs Triage Jdouglas Resolved
#106397 Write onboarding stuff for new Data Analyst High 3 Ironholds Resolved
#106395 Backfill Cirrus logs and notify users that the reporting system changed High 2 Ironholds Resolved
#105910 EPIC: expansion of data retrieved from the Cirrus search logs High 10 Ironholds Resolved
#109344 Perform a power analysis to figure out sample size for next A/B test High 2 mpopov Resolved
#108895 Get a fresh read on Google-referred traffic High 4 Ironholds Resolved
#109762 Perform final analysis for the second A/B test, write report, and publish it to Meta High 4 mpopov Resolved
#109744 Let user select date ranges (7/30/90 days) on the dashboard High 2 mpopov Resolved
#110618 Make sense of why the zero results rate is still going up in spite of us having tackled prominent zero results generators High 10 Ironholds Resolved
#111857 Verify that data from A/B test on suggester is coming through correctly (on or after 2015-09-09) High 2 mpopov Resolved
#112295 Design and agree on an Avro schema for cirrus search request logging to hadoop High 2 Ironholds Resolved
#113297 Create backend for speedily computing various ad-hoc user satisfaction KPIs High 10 Ironholds Resolved
#113292 Select an arbitrary dwell-time threshold High 5 Ironholds Resolved
#115878 [Spike 2 hours] Create document enumerating possibilities for search referrer metric High 1 Ironholds Resolved
#118926 Regenerate November data for maps High 1 mpopov Resolved
#118587 "Error: subscript out of bounds" when choosing 7 day date range on referrer dashboard High mpopov Resolved
#120285 Set up an experimental Discovery dashboard which people can push any graphs or features they want to High 3 mpopov Resolved
#120284 Try adding a "bot filter" checkbox to a single dashboard metric, in an experimental environment, to gauge how much work it is to do it more generically High 5 Ironholds Resolved
#120260 Analyse existing data on preferences to figure out if it can tell us about people opting in and out of the completion suggester beta feature High 2 Ironholds Resolved
#120135 Allow agent, spider, bot filtering on all dashboards High Ironholds_backup Resolved
#124685 Maps dashboard tile series is broken High mpopov Resolved
#130235 Investigate Portal post-deployment statistics High 3 mpopov Resolved
#132706 Validate click events in TestSearchSatisfaction2 High 2 mpopov Resolved
#133733 Portal Dashboard: Add de-duplication step to data collection High 2 mpopov Resolved
#134318 Verify data pipeline for TextCat A/B test on English Wikipedia High debt Resolved
#132509 A/B Test (Egg): Specifications on the gathering of data for the descriptive text on sister project links test High 1 mpopov Resolved
#132503 Update ZRR data collection to exclude irrelevant/invalid Cirrus requests High 2 mpopov Resolved
#132077 Re-analyse data from phrase rescore boost of 1 A/B test with single word queries excluded to see if/how it changes the result High 2 mpopov Resolved
#138397 Wikipedia.org: compare old data sets vs latest from A/B test (June 2016) High 10 mpopov Done
#139109 Wikipedia.org Portal Dashboard: Investigate spike in actions taken on the page High debt Done
#140816 Wikipedia.org Portal Dashboard: add clicks by language High 8 mpopov Done
#143045 Wikipedia.org Portal Dashboard: investigate increase in pageviews High 4 mpopov Done
#161806 Portal Dashboard: does not display US stats when grouping by region High 4 chelsyx Done
#146214 Wikipedia.org Portal Dashboard: investigate recent spike in pageviews High chelsyx Done
#131196 Research why the zero results rate for full text search is increasing High 4 mpopov Resolved
#124115 [PAB3] Analyze the results of the third portal A/B test High 4 mpopov Resolved
#124114 [PAB3] Check that all is well with the third portal A/B test High 1 mpopov Resolved
#124110 [PAB3] Specifications for lang detection on Portal High - Resolved
#123991 Explain why external referrers dashboard has a massive spike on 17th January 2016 (and permanently fix it!) High Ironholds Resolved
#123964 Investigate suggestions zero results rate High Deskana Resolved
#122072 Write postmortem for the portal A/B test problems High 1 Ironholds Resolved
#121434 Switch zero results rate dashboards over to the new data sources, including automata detection High 5 Ironholds Resolved
#121106 Write WDQS data retrieval script to breakdown stats by automata vs non-automats High 2 Ironholds Resolved
#117914 Analyse results of language switching test on or after 2015-11-11 High 5 Ironholds Resolved
#117789 Allow updating R packages installed from git repos High 2 mpopov Resolved
#116782 Time frame options for dashboard High 5 mpopov Resolved
#115652 Dashboard: Create performance indicator on referrer traffic High 5 Ironholds Resolved
#112945 Present results of analysis on TestSearchSatisfaction data to Discovery leads on Friday 18th Sep 2015 High 1 mpopov Resolved
#112813 Verify that data from A/B test on suggester is coming through correctly (on or after 2015-09-17) redux High 1 mpopov Resolved
#111269 Dashboard: conflicting data (results pages opened vs result sets per day) High 3 Ironholds Resolved
#111260 Validate TestSearchSatisfaction2 data after one week's data has streamed in (i.e. on or after 2015-09-10) to verify that we're happy with it High 2 Ironholds Resolved
#102098 Document why there was spike in the apps search data in June 2015 in the Discovery search dashboard High 2 mpopov Resolved
#101515 Write an Oozie job to gather API-based metrics High Ironholds Resolved
#99087 Technical task for the Data Analyst for R&D High Ironholds Resolved
#98568 Spin up Labs instances for data vis High 3 Ironholds Resolved
#94637 How many active editors are using VisualEditor at large Wikipedias? High 2 - Resolved
#98366 Build search log parser High 5 Ironholds Resolved
#98212 Set up data visualisation platform High 10 Ironholds Resolved
#98068 NSA data support High 40 Ironholds Resolved
#98078 What does the distribution of user agents/devices look like for the portal? Normal 2 Ironholds Resolved
#98076 Does portal traffic come from zero? Normal 1 Ironholds Resolved
#98071 Find search logs Normal 1 Ironholds Resolved
#98069 Data Analysis JDs Normal 5 Ironholds Resolved
#112170 Model user behavior and detect when reality heavily deviated from expectation Normal 20 mpopov Backlog
#124098 Make it easy to compute MoM and YoY changes in KPIs Normal 2 mpopov Done
#116828 Standardize on the same label for Non Search Traffic on referrer Dashboard Normal 1 Ironholds Resolved
#116822 Create data collection scripts for querying Google Webmaster Tools Normal 5 Deskana Resolved
#115895 Create central list of all Discovery A/B tests Normal 2 mpopov Resolved
#115919 Create UDFs for categorising referers Normal 5 mpopov Resolved
#115030 Backfill search data Normal 4 Ironholds Resolved
#111858 Analyze results of A/B test on suggester (on or after 2015-09-22) Normal 4 mpopov Resolved
#114776 Analyse results of AND operator relaxation test on or after ??? Normal Deskana Resolved
#114775 Verify data is transmitting correctly for AND operator relaxation test on 2015-10-14 Normal 4 Ironholds Resolved
#120406 [Spike] investigate prevalence of spiders within dashboard data Normal 3 Ironholds Resolved
#116325 Decide what actions we want to record on the www.wikipedia.org portal Normal 4 Ironholds Resolved
#121758 Include automata detection in Maps dashboard Normal 4 Ironholds Resolved
#123137 Write up version of portal report for Commons Normal Ironholds Resolved
#124827 Portal Dashboard - display browser/version used Normal 3 mpopov Resolved
#124824 Portal Dashboard - create algorithm to add browser/version used Normal 3 Ironholds Resolved
#132382 Investigate spike in page views on wikipedia.org at end of March 2016, and annotate dashboard Normal - Resolved
#132716 A/B Test (Egg): Analyze the results of the addition of descriptive text to sister project links Normal mpopov Resolved
#134303 Portal: Augment A/B test analysis for clickthroughs Normal 3 mpopov Resolved
#134011 A/B Test (Caterpiller): Analyze the results of the languages by article count test Normal 4 mpopov Resolved
#134199 [Portal dashboard] Add a "CTR on first visit" metric Normal 2 mpopov Resolved
#132519 A/B Test (Egg): Check that all is well with the addition of descriptive text to sister project links Normal 1 mpopov Resolved
#134009 A/B Test (Caterpiller): Check that all is well with the languages by article count test Normal 1 mpopov Resolved
#128118 Investigate how search query features affect result sets Normal 4 mpopov Resolved
#127868 Wikipedia.org Portal - determine percentage of traffic from search engines Normal 4 mpopov Resolved
#127867 Wikipedia.org Portal - determine mobile vs desktop visitors Normal 2 mpopov Resolved
#127846 EPIC: Prepare for Oliver's departure Normal 9021 Deskana Resolved
#128117 Transfer data collection ownership Normal 4 Ironholds Resolved
#128789 Talk to legal and make sure they're okay with our data handling documentation and everything else post-Oliver Normal Ironholds Resolved
#128888 Design a task for upcoming Data Analyst interview process Normal 3 mpopov Resolved
#134320 Analyse results of TextCat A/B test Normal 4 mpopov Resolved
#137170 Part Deux: TextCat A/B test for Language Identification - analysis of results Normal 4 mpopov Resolved
#137168 Part Deux: TextCat A/B test for Language Identification - ensure test is going well Normal 1 mpopov Resolved
#134302 Portal Dashboard: update for testing specifications Normal 0.5 mpopov Resolved
#137158 Compile and then resolve issues with TextCat A/B test data Normal 1 EBernhardson Resolved
#141135 "median" not working on WDQS dashboards Normal 1 mpopov Done
#144424 Add a PaulScore approximation to discovery.wmflabs.org Normal 3 mpopov Done
#142436 Dashboards: update for Product Owner change Normal 1 chelsyx Done
#143287 External Traffic Dashboard: add DuckDuckGo to external search engine referrers page Normal 4 mpopov Done
#143605 Wikipedia.org Portal Dashboard: add "other" to pageviews page Normal 8 mpopov Done
#143457 Search Metrics KPIs Dashboard: display errors need fixing Normal 2 mpopov Done
#143149 Wikipedia.org Portal: add event logging for selection of language in search box Normal 8 mpopov Done
#143137 Wikipedia.org Portal Visitors' Session Lengths (Redux) Normal 6 chelsyx Done
#143128 [EPIC] Learn about our databases and how to use them Normal chelsyx Done
#146216 Wikipedia.org Portal Dashboard: investigate spike in languages visited Normal 2 mpopov Done
#146215 Wikipedia.org dashboard: minor updates Normal 1 mpopov Done
#153856 Add lint/CI to all wikimedia/discovery analytics repositories Normal 10 mpopov Backlog
#146807 Wikipedia/Wikimedia apps availability test: analyze results Normal mpopov Done
#146806 Wikipedia/Wikimedia apps availability test: add event logging Normal - Done
#147513 Search results page: how many visitors are on mobile vs desktop Normal 2 mpopov Done
#147500 Analyze results of the second BM25 test Normal 10 chelsyx Done
#147496 Verify data pipeline for BM25 AB test Normal 1 mpopov Done
#147682 Can't install R package Boom (& bsts) on stat1002 (but can on stat1003) Normal 13 Ottomata Done
#149440 [EPIC] Detect bots from searches (and learn C++/Rcpp) Normal mpopov Backlog
#149963 Analyze WDQS traffic data to find parallel connection patterns Normal 6 mpopov Done
#149752 [Dashboard][Search] Make monthly metrics module work again Normal 2 mpopov Done
#147882 Access to Wikidata query logs that were used for recent research Normal chelsyx Done
#149355 Google-referred desktop traffic decline vs overall desktop traffic decline Normal 4 mpopov Done
#149838 Investigate if Interactive logging schema makes sense Normal 4 mpopov Done
#150539 Investigate intriguing behaviors in search metrics for desktop events Normal 4 mpopov Done
#150410 [Dashboard] Implement the wiki/language selector in Search Metrics Normal 8 chelsyx Done
#150370 [EPIC][Search][Dashboard] Add "well-behaved searchers" filter Normal mpopov Backlog
#150915 [Dashboard] Migrate golden to Reportupdater infrastructure Normal 50 mpopov Done
#150901 [Search][Dashboard] Well-behaved searchers filter: ZRR Normal 6 mpopov In progress
#153936 Create usage stats for WDQS LDF endpoint Normal 4 mpopov Done
#152728 Analyze access log to see whether we need to add filetype: aliases Normal chelsyx Done
#153716 Portal dashboard: Mexico shouldn't be listed as being in South America Normal chelsyx Done
#153715 Portal dashboard heading "No. Events by Country" could be clearer Normal chelsyx Done
#153887 Search Metrics Dashboard: Investigate recent spike in user engagement Normal mpopov Done
#154717 Maps Dashboard - add notation for increased tile usage Normal debt Done
#154634 Wikipedia.org: add app links to Portal dashboard Normal 4 mpopov Done
#156686 Search metrics: add sister projects to dashboard Normal - Done
#156512 Check that data coming from cross-wiki test is valid Normal 2 mpopov Done
#157796 Visits/searches from Safari 10 location bar search suggestions Normal mpopov Stalled/Waiting
#156300 analysis of results from A/B/C test for displaying sister projects in search results Normal 6 mpopov Done
#160624 Search dashboard, suspicious fulltext api usage Normal 3 dcausse Done
#161932 [Dashboards] Update external traffic dash to show non-bot traffic Normal 4 mpopov Done
#161771 [Dashboards] Add relative option to External By Search Engine Normal 4 mpopov Done
#164857 A/B Test: explore similar - analysis of results Normal - Backlog
#164856 A/B Test: explore similar - verify data coming in is good Normal - Backlog
#164854 Search Dashboard: update for engagement - sister projects Normal - Backlog
#165861 Use search log to find currently existing namespace combinations Normal 3 mpopov In progress
#161354 [Dashboards] Migrate from Vagrant to Puppet config Normal mpopov Stalled/Waiting
#160008 Analyze results of A/B test for displaying sister project search results (test #2) Normal 6 mpopov Done
#152617 Wikipedia portal: update eventlogging for sister project clickthroughs Normal 4 mpopov Done
#149143 Investigate what we'd need to do to ignore double quotes in search queries Normal debt Done
#149127 Maps Dashboard: fix and update Normal 4 mpopov Done
#147216 From Zero To Hero 2: Electric Boogaloo Normal 6 chelsyx Done
#143853 Wikipedia.org dashboard: Determine if new layout change caused any decreases Normal 4 mpopov Done
#143762 WDQS: Geographic breakdown of SPARQL queries Normal 6 chelsyx Done
#143589 Analyze results of BM25 AB test Normal 6 mpopov Done
#143587 Verify data pipeline for bm25 AB test Normal 1 mpopov Done
#143064 Wikipedia.org Portal Dashboard: update pageview counting Normal 1 mpopov Done
#141456 Maps EL outage Normal 1 mpopov Done
#141061 Wikipedia.org Portal Dashboard: add a 'most commonly clicked section per visit' metric Normal 1 mpopov Done
#140187 Decide on better wording for "users were 1.07 times more likely to do X" Normal mpopov Done
#136017 Analyse results of the swap2and3 search test Normal chelsyx In progress
#139548 Wikipedia.org Portal Dashboard: update geographical breakdown to filter out larger countries Normal debt Done
#134007 A/B Test (Caterpiller): Specifications on the gathering of data for the languages by article count test Normal mpopov Resolved
#139510 Dashboard: internal referrer weirdness on Feb 23, 2016 and search increase in June 2016 Normal - Resolved
#135759 Wikipedia.org Portal: investigate getting translations for 'the free encyclopedia' Normal mpopov Resolved
#137604 Data Visualization Literacy Lesson Normal 3 mpopov Resolved
#138107 [EPIC] Wikipedia Portal Dashboard: expand "other" countries data display Normal chelsyx Done
#138411 Wikipedia Portal Dashboard: filter out requests for 'search-redirect.php' Normal 2 mpopov Resolved
#136257 Wikipedia.org Portal Dashboard: investigate adding regions (or states) to geographical breakdown Normal 2 mpopov Resolved
#131875 Release BCDA package Normal 3 mpopov Resolved
#130754 Fix big drop in augmented clickthrough KPI that happened in March 2016 by backfilling data Normal mpopov Resolved
#129608 Perform analysis of results of A/B test with phrase rescore boost of 1 Normal 8 mpopov Resolved
#130083 Update data collections to use latest refinery (UDF) version Normal 2 mpopov Resolved
#129679 Analyze portal clickthrough by browser preferred langauge(s) (for wikipedia.org) Normal 2 mpopov Resolved
#129564 Switch Desktop data collection for dashboards to use TestSearchSatisfaction2 instead of Search schema Normal 2 mpopov Done
#129563 Find out more about the other 10%-ish of referrals on the Wikipedia portal Normal 2 mpopov Resolved
#129263 Discover where referral traffic comes from on the Wikipedia.org Portal (other than search engines) Normal 1 mpopov Resolved
#128929 Adjust dwell time / augmented clickthrough calculation Normal 2 mpopov Resolved
#128211 Write a first draft of a Employee Operations Manual for Discovery Analysts Normal 10 mpopov Resolved
#127900 Analyse the results of the "opening_text" for morelike suggestions A/B test Normal Dbrant Resolved
#128016 Refactor and update our internal R codebase Normal 6 Ironholds Resolved
#127743 Update Portal Traffic Dashboard - Browser Breakdown for Chrome OS version on Android Normal mpopov Resolved
#126244 Add data collection for getting zero results rate by language/project Normal mpopov Resolved
#127507 Add smoothing features to the portal dashboard Normal 1 mpopov Resolved
#127381 Fix bug with the by-engine external traffic dashboard Normal 2 Ironholds Resolved
#125739 Add absolute count of clickthrough graph to portal dashboard, to complement the relative clickthrough graph Normal Deskana Resolved
#125737 Add "page views per day" graph to the portal dashboard Normal 6 Ironholds Resolved
#125601 Define a way to measure user satisfaction of completion suggester Normal mpopov Resolved
#125216 Backfill now that replication is solved Normal 3 Ironholds Resolved
#124655 [SPIKE]: identify what proportion of portal requests lack JS support Normal 3 mpopov Resolved
#124093 Estimate zero results rate for 15-18 January (inclusive) Normal mpopov Resolved
#124072 Investigate huge spike in zero results rate on 16-17 January Normal mpopov Resolved
#123959 Make colors in Portal/Geo graph easier to see and differentiate Normal 1 mpopov Resolved
#123764 Backfill eventlogging-sourced data Normal mpopov Resolved
#122081 Add note to portal dashboards explaining why the baselines changed on 7th December 2105 Normal 1 Ironholds Resolved
#123673 Coordinate with Legal and Security, and do whatever is necessary to make Discovery data access guidelines public Normal Ironholds Resolved
#122039 Document relaunched Portal A/B test Normal 2 Ironholds Resolved
#122937 Experimental forecast dashboard Normal mpopov Resolved
#121757 Massively expand A/B test documentation Normal 3 Ironholds Resolved
#121701 Check data validity of the relaunched Portal A/B test on 2016-01-12/13 Normal 2 Ironholds Resolved
#121566 Analyse the results of the Portal search box A/B test on or after 2016-01-21 Normal 3 Ironholds Resolved
#121027 [PAB2] Describe upcoming portal A/B test Normal 1 Jdrewniak Resolved
#119639 Validate bucketing used for backend tests which report to CirrusSearchUserTesting log Normal 3 mpopov Resolved
#118995 Create portal dashboard Normal 5 Ironholds Resolved
#120432 Add dwell time to Discovery portals dashboard Normal 3 Ironholds Resolved
#117982 Field name changes messed with Maps scripts; fix! Normal 1 Ironholds Resolved
#117903 API data not displayed correctly (read: at all) Normal 3 Ironholds Resolved
#119531 Analyse results of accept-language header test on or after 2015-12-17 Normal mpopov Resolved
#119530 Verify data pipeline for accept-language test on or after 2015-12-10 Normal Deskana Resolved
#118994 Create portal data collection scripts Normal 3 Ironholds Resolved
#117147 When you click on a tab on a Discovery dashboard, the URL should be updated to the link at the bottom of the page Normal 3 mpopov Resolved
#118872 Missing data notifications on dashboards Normal 2 mpopov Resolved
#117096 Improve redandancy Normal 2 Ironholds Resolved
#118295 Analyse results of A/B test for the "fewer than 3 results" test on or after 2015-11-27 Normal 5 Ironholds Resolved
#118294 Verify data pipeline for A/B test for the "fewer than 3 results" test on or after 2015-11-20 Normal Deskana Resolved
#118218 Create UDFs for categorising types of queries based the nature of the query Normal 5 mpopov Resolved
#118214 Bring the smoothing and time frame selection features to the rest of Discovery's dashboards Normal 2 mpopov Resolved
#118027 Talk to Comms about Discovery blog posts Normal 1 Ironholds Resolved
#117915 Create UDFs to replicate the Python script used for parsing the Cirrus logs Normal 3 Ironholds Resolved
#116189 Fix survival estimates in Dashboard Normal 2 mpopov Resolved
#115274 Review and implement methods for Bayesian Categorical Data Analysis Normal 4 mpopov Resolved
#114696 Add link to Discovery landing page on data dash. Normal 1 Ironholds Resolved
#108233 Include generic shiny functionality in a package where it can be reused Normal 5 Ironholds Resolved
#113832 Write scripts for fetching server-side usage statistics Normal 2 mpopov Resolved
#113653 Post search data guidelines to officewiki Normal 2 Ironholds Resolved
#112700 Package up common functions and files across the dashboards Normal 5 Ironholds Resolved
#112337 Display maps server-side usage metrics on maps dashboard Normal 2 mpopov Resolved
#112311 Create data retrieval scripts for maps KPIs Normal 4 Ironholds Resolved
#112289 Update wmf::mysql_read() Normal 2 mpopov Done
#112605 Analyze queries performed on Wikidata Query Service to identify what users are using it for, and produce report Normal 10 mpopov Resolved
#112604 Create a document listing all the steps for a Discovery A/B test Normal 1 mpopov Resolved
#111979 Set up a staging/testing space for the dashboards (Discovery dashboards beta) Normal 4 mpopov Resolved
#111790 Improve Phabricator link on Wikidata Query Service dashboard Normal He7d3r Resolved
#103591 Migrate dashboards to gerrit Normal 2 mpopov Resolved
#108732 [Task] Train Wikidata people on how to add data/metrics to a Shiny dashboard for Wikidata Normal 2 mpopov Resolved
#107211 [Spike 1 day] Find sources of high-volume zero-results queries and determine the intent of the query originator Normal 4 Ironholds Resolved
#109361 Create a Wikidata query service usage dashboard Normal 8 mpopov Resolved
#109360 Create a script to extract request logs for query.wikidata.org for dashboards Normal 4 mpopov Resolved
#110942 Move data acquisition/aggregation scripts to their own repo Normal 1 mpopov Resolved
#108624 Add a new dashboard to searchdata.wmflabs.org to display maps KPIs Normal 6 Ironholds Resolved
#111242 Write EventLogging schemas on Meta-Wiki to capture data for Maps KPIs Normal Tfinc Resolved
#111256 WDQS Dashboard: reposition and relabel the plots Normal mpopov Resolved
#110703 Provide data dump for Tijmen van Dijl Normal 1 Ironholds Resolved
#109648 Create consultant JD Normal 2 Ironholds Resolved
#110590 Add breakdown of zero results rate by language/project pair to dashboard Normal 4 mpopov Resolved
#110482 Circulate X-Request-Purpose proposal internally Normal 1 Ironholds Resolved
#109761 Analyse the initial results of the second A/B test Normal 4 mpopov Resolved
#109760 Report on the first A/B test Normal 4 Ironholds Resolved
#107814 Coordinate with legal and security around data access guidelines for Search data Normal 6 Ironholds Resolved
#110080 Help requested from an R expert to help tweak phlogiston (burnup chart scripts) Normal 2 Ironholds Resolved
#109507 Coordinate data generation for maps KPIs Normal 2 Ironholds Resolved
#107815 Write up a Request for Comment for a x_analytics field that distinguishes sources of automated traffic Normal 2 Ironholds Resolved
#108389 Analyse the initial results of the A/B test data Normal 4 mpopov Resolved
#108239 Learn & doc how to access & work w/ queries and logs Normal 8 mpopov Resolved
#108230 Fix display bug with KPI dashboard Normal 1 Deskana Resolved
#108094 As a project lead, I'd like documentation on how to set up a Shiny dashboard so that I can visualise the project's key performance indicators Normal 4 mpopov Resolved
#107781 Display when the date range for the data backing the KPI summary Normal Deskana Resolved
#107780 Expand KPI dashboard to also include time series for each KPI Normal 2 mpopov Resolved
#107463 Make it possible to link to individual dashboards via a URL Normal 2 mpopov Resolved
#107112 Create dashboard for Discovery KPIs for Search Normal 4 mpopov Resolved
#107202 Dashboard feature: trend lines / smoothing Normal 2 mpopov Resolved
#107057 Include contact information into each markdown file Normal 1 mpopov Resolved
#105512 Analyse requests from Iran prior to and after the HTTPS switchover Normal 2 Ironholds Resolved
#105359 [Spike 4 hrs] Perform initial analysis of the TestSearchSatisfaction data so that we know whether we can trust it or not Normal 2 Ironholds Resolved
#105355 [Spike 1 day] Perform initial analysis of the TestSearchSatisfaction data to validate that the theory works Normal 5 mpopov Resolved
#105193 Modify dashboard's zero results rate tab to also show rate of change of zero results rate over time, to track progress we're making to reduce it Normal 2 Deskana Resolved
#68829 Prefer pages in the user's language in multilingual wikis Normal debt Done
#102999 EPIC: Architectural hardening of the Discovery dashboards Normal Ironholds Resolved
#102328 Remove mean from load time dashboard Normal - Resolved
#102247 Run a manual job to backfill more of the zero results rate data in the search dashboards Normal Ironholds Resolved
#101389 As a product manager, I'd like to know who is using our search API and how they're using it, so I can figure out how to prioritise tasks. (part 2) Normal 8 Ironholds Resolved
#101174 Estimate actual requirements for S&D analytics Normal Ironholds Resolved
#100715 Use the Cirrus server side logs to get some elasticsearch time took metrics Normal Deskana Resolved
#100672 Figure out some hypothetical formula for measuring the user perceived accuracy of full text search and create a plan to implement that including phabricator tasks Normal 4 Ironholds Resolved
#100669 Create initial stab at KPIs Normal Deskana Resolved
#100049 "Events" should read "Load times" Normal 1 Ironholds Resolved
#99578 Add legend to search dashboards Normal 1 - Resolved
#99013 As a product manager, I'd like to know who is using our search API and how they're using it, so I can figure out how to prioritise tasks. Normal Ironholds Resolved
#98569 Create EventLogging schema for initial data collection around the Wikipedia portal Normal 1 Ironholds Resolved
#98383 Create rsync connector to fluorine Normal 1 Ironholds Resolved
#102329 Figure out better approach for SI prefixes in dashboards Low 1 - Resolved
#118551 Add link to source code of each Discovery dashboard on the dashboard itself, so people know where to go to contribute Low 1 mpopov Resolved
#124066 Fix portal scripts to factor in webrequest partitions Low 1 Ironholds Resolved
#119448 Add per-country maps usage graphs to the maps dashboard Low 6 Ironholds Resolved
#126677 Determine what browser screen dimension are for Portal visitors Low debt Resolved
#128146 Analyze the varience of user-agent's, country, and other useful metrics of google refered traffic with and without a search query available in referer Low 4 mpopov Done
#130769 Determine if wikipedia.org portal is redirecting to itself Low 3 mpopov Resolved
#135248 Dashboarding and bots Low 4 mpopov Done
#132711 Spline smoothing (sorta bug) on dashboards Low 0.5 mpopov Resolved
#136377 Compare ZRR for query features across other search engines Low 10 mpopov Done
#154722 External traffic dashboard - update with notation for Safari internal referrers Low chelsyx Done
#151832 Add Maps tile usage counts as a Data Cube in Pivot Low 6 mpopov Done
#150215 [Dashboard][Search] Sparklines for KPIs Low 4 chelsyx Done
#143753 Add comment to search-related eventlogging in iOS and Android apps to inform Discovery of changes Low 1 mpopov Done
#143726 Search metrics dashboard: add in new data that has recently been added to event logging for mobile apps Low 6 mpopov Done
#130027 Portal Dashboard: group by supported browser versions Low 1 mpopov Resolved
#129750 Produce a one-off report, comparing page view and traffic levels on wikimedia.org to wikipedia.org Low 2 mpopov Resolved