Discovery Snapshot 2015-12-01 02:00:04

Done 17 (32 story points)
Backlog 5 (10 story points)
In progress 1 (3 story points)
Needs review 1 (5 story points)
Resolved 148 (394 story points)
Stalled/Waiting 3 (9 story points)
total 175 (453 story points)
Title Priority Story Points Assignee Status
#119639 Validate bucketing used for backend tests which report to CirrusSearchUserTesting log Needs Triage 3 mpopov In progress
#119531 Analyse results of accept-language header test on or after 2015-12-14 Normal 0 - Backlog
#119530 Verify data pipeline for accept-language test on or after 2015-12-08 Normal 0 - Backlog
#119395 Fix ZRR file change Needs Triage 2 Ironholds Resolved
#118995 Create portal dashboard Needs Triage 5 Ironholds Done
#118994 Create portal data collection scripts Normal 3 Ironholds Resolved
#118926 Regenerate November data for maps High 1 mpopov Done
#118914 Figure out how many people run a search query on wikipedia and land on an article Needs Triage 0 mpopov Resolved
#118872 Missing data notifications on dashboards Normal 2 mpopov Resolved
#118587 "Error: subscript out of bounds" when choosing 7 day date range on referrer dashboard High 0 mpopov Resolved
#118483 Update satisfaction table references Needs Triage 1 Ironholds Resolved
#118295 Analyse results of A/B test for the "fewer than 3 results" test on or after 2015-11-27 Normal 0 - Backlog
#118294 Verify data pipeline for A/B test for the "fewer than 3 results" test on or after 2015-11-20 Normal 0 Deskana Resolved
#118218 Create query categoriser UDFs Normal 5 mpopov Backlog
#118214 Bring the smoothing and time frame selection features to the rest of Discovery's dashboards Normal 2 mpopov Done
#118027 Talk to Comms about Discovery blog posts Normal 1 Ironholds Done
#117982 Field name changes messed with Maps scripts; fix! Needs Triage 1 Ironholds Done
#117915 Create UDFs to replicate the Python script used for parsing the Cirrus logs Normal 0 - Stalled/Waiting
#117914 Analyse results of language switching test on or after 2015-11-11 High 5 Ironholds Needs review
#117903 API data not displayed correctly (read: at all) Needs Triage 3 Ironholds Done
#117805 Hive claims it doesn't have any Maps data on Oct 31st Needs Triage 0 Ironholds Resolved
#117789 Allow updating R packages installed from git repos High 2 mpopov Done
#117712 LDN script is still borken Unbreak Now! 1 mpopov Resolved
#117617 Search Satisfaction metric dropped by 38.4% Needs Triage 3 Ironholds Resolved
#117510 Golden not retrieving Varnish data Unbreak Now! 3 Ironholds Done
#117299 Backfill desktop events (from 10/28 onward) Needs Triage 0 - Done
#117096 Improve redandancy Needs Triage 2 Ironholds Done
#117094 Figure out why the links on the external traffic dashboard don't work Needs Triage 1 Ironholds Resolved
#116919 "LD50" user satisfaction metric code broken Unbreak Now! 0 mpopov Done
#116828 Standardize on the same label for Non Search Traffic on referrer Dashboard Needs Triage 1 Ironholds Done
#116822 Create data collection scripts for querying Google Webmaster Tools Normal 5 Deskana Stalled/Waiting
#116782 Time frame options for dashboard High 5 mpopov Resolved
#116325 Decide what actions we want to record on the www.wikipedia.org portal Normal 4 Ironholds Done
#116295 Create data collection scripts for referer data Needs Triage 3 Ironholds Resolved
#116291 Switch hive queries over to Beeline Needs Triage 2 mpopov Resolved
#116194 Backfill satisfaction code Needs Triage 4 Ironholds Resolved
#116189 Fix survival estimates in Dashboard Normal 2 mpopov Resolved
#115919 Create UDFs for categorising referers Normal 5 Ironholds Backlog
#115895 Create central list of all Discovery A/B tests Normal 2 mpopov Done
#115878 [Spike 2 hours] Create document enumerating possibilities for search referrer metric High 1 Ironholds Resolved
#115652 Dashboard: Create performance indicator on referrer traffic High 5 Ironholds Resolved
#115625 Move common dashboard JS to polloi Needs Triage 1 Ironholds Resolved
#115582 Fix weird indentation on discovery.wmflabs.org/index.html Needs Triage 1 mpopov Resolved
#115274 Review and implement methods for Bayesian Categorical Data Analysis Normal 4 mpopov Resolved
#115050 Do we have enough people in SearchSatisfaction to perform an A/B/C/D test and not screw up the dashboards? Needs Triage 1 mpopov Resolved
#115030 Backfill search data Normal 4 Ironholds Resolved
#114919 Refactor golden to be more robust and backfilling-friendly Needs Triage 5 Ironholds Resolved
#114776 Analyse results of AND operator relaxation test on or after ??? Normal 0 - Done
#114775 Verify data is transmitting correctly for AND operator relaxation test on 2015-10-14 Normal 4 Ironholds Resolved
#114696 Add link to Discovery landing page on data dash. Normal 1 Ironholds Resolved
#114517 Data is not being retrieved/sync'd Unbreak Now! 1 Ironholds Resolved
#114261 Updating documentation & setup script for dashboard project Needs Triage 1 mpopov Resolved
#113832 Write scripts for fetching server-side usage statistics Normal 2 mpopov Resolved
#113653 Post search data guidelines to officewiki Normal 2 Ironholds Resolved
#113637 Add augmented clickthrough & LDN metrics to dashboard Needs Triage 4 mpopov Resolved
#113513 Build retrieval scripts for dwell-time data Needs Triage 4 Ironholds Resolved
#113299 Identify an appropriate outlier detection method for dwell times Needs Triage 1 mpopov Resolved
#113297 Create backend for speedily computing various ad-hoc user satisfaction KPIs High 10 Ironholds Resolved
#113292 Select an arbitrary dwell-time threshold High 5 Ironholds Resolved
#112953 Create new web proxy for the search data Needs Triage 2 Ironholds Resolved
#112945 Present results of analysis on TestSearchSatisfaction data to Discovery leads on Friday 18th Sep 2015 High 1 mpopov Resolved
#112813 Verify that data from A/B test on suggester is coming through correctly (on or after 2015-09-17) redux High 1 mpopov Resolved
#112700 Package up common functions and files across the dashboards Normal 5 Ironholds Resolved
#112605 Analyze queries performed on Wikidata Query Service to identify what users are using it for, and produce report Normal 10 mpopov Resolved
#112604 Create a document listing all the steps for a Discovery A/B test Normal 1 mpopov Done
#112337 Display maps server-side usage metrics on maps dashboard Normal 2 mpopov Resolved
#112311 Create data retrieval scripts for maps KPIs Normal 4 Ironholds Resolved
#112295 Design and agree on an Avro schema for cirrus search request logging to hadoop High 2 Ironholds Resolved
#112269 Perform survival analysis on user satisfaction data Needs Triage 4 mpopov Resolved
#111979 Set up a staging/testing space for the dashboards (Discovery dashboards beta) Normal 4 mpopov Done
#111961 Patch dashboard setup scripts to set locale Needs Triage 1 Ironholds Resolved
#111892 Perform analysis of how often the "Did you mean" suggestions are actually used Normal 4 Ironholds Stalled/Waiting
#111880 Fix metrics dashboard Unbreak Now! 2 EBernhardson Resolved
#111858 Analyze results of A/B test on suggester (on or after 2015-09-22) Normal 4 mpopov Resolved
#111857 Verify that data from A/B test on suggester is coming through correctly (on or after 2015-09-09) High 2 mpopov Resolved
#111856 Begin to quantify why people use Search Engines instead of Wikimedia search Needs Triage 3 Ironholds Resolved
#111790 Improve Phabricator link on Wikidata Query Service dashboard Normal 0 He7d3r Resolved
#111749 Backfill dashboard data Unbreak Now! 2 Ironholds Resolved
#111714 Some golden retriever scripts not being run on a daily basis Unbreak Now! 1 Ironholds Resolved
#111549 Fix WDQS scripts and update dashboard accordingly Unbreak Now! 2 mpopov Resolved
#111269 Dashboard: conflicting data (results pages opened vs result sets per day) High 3 Ironholds Resolved
#111260 Validate TestSearchSatisfaction2 data after one week's data has streamed in (i.e. on or after 2015-09-10) to verify that we're happy with it High 2 Ironholds Resolved
#111256 WDQS Dashboard: reposition and relabel the plots Normal 0 mpopov Resolved
#111242 Write EventLogging schemas on Meta-Wiki to capture data for Maps KPIs Normal 0 Tfinc Resolved
#110942 Move data acquisition/aggregation scripts to their own repo Normal 1 mpopov Resolved
#110703 Provide data dump for Tijmen van Dijl Normal 1 Ironholds Resolved
#110618 Make sense of why the zero results rate is still going up in spite of us having tackled prominent zero results generators High 10 Ironholds Resolved
#110482 Circulate X-Request-Purpose proposal internally Normal 1 Ironholds Resolved
#110080 Help requested from an R expert to help tweak phlogiston (burnup chart scripts) Normal 2 Ironholds Resolved
#109762 Perform final analysis for the second A/B test, write report, and publish it to Meta High 4 mpopov Resolved
#109761 Analyse the initial results of the second A/B test Normal 4 mpopov Resolved
#109760 Report on the first A/B test Normal 4 Ironholds Resolved
#109758 Rewrite mysql_read to use the updated DBI version Unbreak Now! 0 Ironholds Resolved
#109744 Let user select date ranges (7/30/90 days) on the dashboard High 2 mpopov Resolved
#109648 Create consultant JD Normal 2 Ironholds Resolved
#109523 Create TestUserSatisfaction 2.0.0 schema Needs Triage 2 Ironholds Resolved
#109507 Coordinate data generation for maps KPIs Normal 2 Ironholds Resolved
#109479 Measure actual effect size from the first A/B test Needs Triage 2 mpopov Resolved
#109361 Create a Wikidata query service usage dashboard Normal 8 mpopov Resolved
#109360 Create a script to extract request logs for query.wikidata.org for dashboards Normal 4 mpopov Resolved
#109344 Perform a power analysis to figure out sample size for next A/B test High 2 mpopov Resolved
#108895 Get a fresh read on Google-referred traffic High 4 Ironholds Resolved
#108732 [Task] Train Wikidata people on how to add data/metrics to a Shiny dashboard for Wikidata Normal 2 mpopov Resolved
#108624 Add a new dashboard to searchdata.wmflabs.org to display maps KPIs Normal 6 Ironholds Resolved
#108389 Analyse the initial results of the A/B test data Normal 4 mpopov Resolved
#108239 Learn & doc how to access & work w/ queries and logs Normal 8 mpopov Resolved
#108233 Include generic shiny functionality in a package where it can be reused Normal 5 Ironholds Resolved
#108230 Fix display bug with KPI dashboard Normal 1 Deskana Resolved
#108094 As a project lead, I'd like documentation on how to set up a Shiny dashboard so that I can visualise the project's key performance indicators Normal 4 mpopov Resolved
#107815 Write up a Request for Comment for a x_analytics field that distinguishes sources of automated traffic Normal 2 Ironholds Resolved
#107814 Coordinate with legal and security around data access guidelines for Search data Normal 6 Ironholds Resolved
#107781 Display when the date range for the data backing the KPI summary Normal 0 Deskana Resolved
#107780 Expand KPI dashboard to also include time series for each KPI Normal 2 mpopov Resolved
#107724 Contact entities responsible for zero-results queries Needs Triage 2 Ironholds Resolved
#107463 Make it possible to link to individual dashboards via a URL Normal 2 mpopov Resolved
#107211 [Spike 1 day] Find sources of high-volume zero-results queries and determine the intent of the query originator Normal 4 Ironholds Resolved
#107202 Dashboard feature: trend lines / smoothing Normal 2 mpopov Resolved
#107112 Create dashboard for Discovery KPIs for Search Normal 4 mpopov Resolved
#107057 Include contact information into each markdown file Normal 1 mpopov Resolved
#106400 Create idealised schema for moving Cirrus logs into HDFS Needs Triage 0 Ironholds Resolved
#106397 Write onboarding stuff for new Data Analyst High 3 Ironholds Resolved
#106395 Backfill Cirrus logs and notify users that the reporting system changed High 2 Ironholds Resolved
#105910 EPIC: expansion of data retrieved from the Cirrus search logs High 10 Ironholds Resolved
#105739 Update dashboards to handle new adjustments to 'did you mean' feature Needs Triage 0 EBernhardson Resolved
#105512 Analyse requests from Iran prior to and after the HTTPS switchover Normal 2 Ironholds Resolved
#105359 [Spike 4 hrs] Perform initial analysis of the TestSearchSatisfaction data so that we know whether we can trust it or not Normal 2 Ironholds Resolved
#105355 [Spike 1 day] Perform initial analysis of the TestSearchSatisfaction data to validate that the theory works Normal 5 mpopov Resolved
#105193 Modify dashboard's zero results rate tab to also show rate of change of zero results rate over time, to track progress we're making to reduce it Normal 2 Deskana Resolved
#103596 Summarize what we know about the "zero results" queries Unbreak Now! 0 - Resolved
#103591 Migrate dashboards to gerrit Normal 2 mpopov Resolved
#102999 EPIC: Architectural hardening of the Discovery dashboards Normal 0 Ironholds Resolved
#102998 Distinguish iOS and Android app traffic Needs Triage 1 Ironholds Resolved
#102879 Fix immediate crash due to EL switchover and rebuilt basic dashboards to be resistant to such crashes Needs Triage 4 Ironholds Resolved
#102329 Figure out better approach for SI prefixes in dashboards Low 1 - Resolved
#102328 Remove mean from load time dashboard Normal 0 - Resolved
#102323 gzip: stdin: unexpected end of file in search failures script Needs Triage 2 Ironholds Resolved
#102249 Provide Dan with a list of "top n" search strings that returned 0 results Needs Triage 0 Ironholds Resolved
#102247 Run a manual job to backfill more of the zero results rate data in the search dashboards Normal 0 Ironholds Resolved
#102040 In the zero results stream it'd be nice to know if the user was given a search suggestion Needs Triage 3 - Resolved
#102038 In the zero results stream it'd be nice to be able to filter on whether the request was a prefix search or a full text search Needs Triage 2 - Resolved
#101902 Run a manual job to backfill more of the API data in the search dashboards Needs Triage 1 Ironholds Resolved
#101883 Create dashboards for the API statistics Needs Triage 4 Ironholds Resolved
#101774 Add counts of queries that are generating 0 results to searchdata.wmflabs.org Needs Triage 4 Deskana Resolved
#101515 Write an Oozie job to gather API-based metrics High 0 Ironholds Resolved
#101389 As a product manager, I'd like to know who is using our search API and how they're using it, so I can figure out how to prioritise tasks. (part 2) Normal 8 Ironholds Resolved
#101384 Add submit-form action to the desktop search dashboard at searchdata.wmflabs.org Needs Triage 0 Jdouglas Resolved
#101277 Summarise our hypothesis for "User satisfaction" measurement Unbreak Now! 3 Ironholds Resolved
#101273 Include sampling rate in dashboards Needs Triage 0 Ironholds Resolved
#101174 Estimate actual requirements for S&D analytics Normal 0 Ironholds Resolved
#100715 Use the Cirrus server side logs to get some elasticsearch time took metrics Normal 0 Deskana Resolved
#100674 Implement an isSearch UDF for Hadoop Needs Triage 0 Ironholds Resolved
#100672 Figure out some hypothetical formula for measuring the user perceived accuracy of full text search and create a plan to implement that including phabricator tasks Normal 4 Ironholds Resolved
#100669 Create initial stab at KPIs Normal 0 Deskana Resolved
#100668 Rework searchdata.wmflabs.org so it can handle multiple dashboards (starting with the web portal) Needs Triage 0 Deskana Resolved
#100449 Note the bug around clock times on the Apps page Needs Triage 1 Deskana Resolved
#100446 Fix dashboarding bug that's causing a claim of 6 events Needs Triage 2 Ironholds Resolved
#100056 Work out why mobile web dashboard entries abruptly terminate Needs Triage 0 Ironholds Resolved
#100055 Make clear that the search dashboards are grabbed from sampled logs Needs Triage 1 Ironholds Resolved
#100050 Stop using scientific notation in dashboards Needs Triage 1 Ironholds Resolved
#100049 "Events" should read "Load times" Normal 1 Ironholds Resolved
#99578 Add legend to search dashboards Normal 1 - Resolved
#99087 Technical task for the Data Analyst for R&D High 0 Ironholds Resolved
#99013 As a product manager, I'd like to know who is using our search API and how they're using it, so I can figure out how to prioritise tasks. Normal 0 Ironholds Resolved
#98569 Create EventLogging schema for initial data collection around the Wikipedia portal Normal 1 Ironholds Resolved
#98568 Spin up Labs instances for data vis High 3 Ironholds Resolved
#98383 Create rsync connector to fluorine Normal 1 Ironholds Resolved
#98366 Build search log parser High 5 Ironholds Resolved
#98212 Set up data visualisation platform High 10 Ironholds Resolved
#98078 What does the distribution of user agents/devices look like for the portal? Normal 2 Ironholds Resolved
#98076 Does portal traffic come from zero? Normal 1 Ironholds Resolved
#98071 Find search logs Normal 1 Ironholds Resolved
#98070 Document initial EventLogging needs Needs Triage 2 Ironholds Resolved
#98069 Data Analysis JDs Normal 5 Ironholds Resolved
#98068 NSA data support High 40 Ironholds Resolved
#94637 How many active editors are using VisualEditor at large Wikipedias? High 2 - Resolved