Library for building and maintaining draft quality prediction models for Wikipedia.

Clone this repo:
  1. cab424a Add a deprecation warning with pointers to WMF's pages on the topic (#45) by Tobias Klausmann (WMF) · 1 year, 1 month ago master
  2. b4b325c feat: add github action that pushes to PYPI (#44) by Ilias Sarantopoulos · 2 years, 1 month ago
  3. 06a056d Merge pull request #43 from elukey/master by Aiko · 2 years, 8 months ago
  4. 2b7cb9d Increments version to 0.0.3 by Luca Toscano · 2 years, 8 months ago
  5. abd1d17 Merge pull request #42 from wikimedia/revscoring-2.11 by A. Craze · 3 years ago

draftquality

While the code in this repository may still work, it is unmaintained, and as such may break at any time. Special consideration should also be given to machine learning models seeing drift in quality of predictions.

The replacement for ORES and associated infrastructure is Lift Wing: https://wikitech.wikimedia.org/wiki/Machine_Learning/LiftWing

Some Revscoring models from ORES run on the Lift Wing infrastructure, but they are otherwise unsupported (no new training or code updates).

They can be downloaded from the links documented at: https://wikitech.wikimedia.org/wiki/Machine_Learning/LiftWing#Revscoring_models_(migrated_from_ORES)

In the long term, some or all these models may be replaced by newer models specifically tailored to be run on modern ML infrastructure like Lift Wing.

If you have any questions, contact the WMF Machine Learning team: https://wikitech.wikimedia.org/wiki/Machine_Learning

This library provides a set of utilities for building and maintaining draft (new article creation) quality prediction models for Wikipedia.

Requirements

Optionally requires the following based on usage:

  • NLTK SentiWordnet - python -m nltk.downloader sentiwordnet
  • NLTK Wordnet - python -m nltk.downloader wordnet

Author

  NODES
mac 5
os 4
Training 1