Benutzer:Stefan Kühn/Check Wikipedia

aus Wikipedia, der freien Enzyklopädie
Wechseln zu: Navigation, Suche

The WikiProject Check Wikipedia will help to clean the syntax of Wikipedia and to find some other errors. Here you can discuss new features.

Webinterface

News[Bearbeiten]

  • 2013-12-01 - New page at tools.wmflabs.org
  • 2013-01-27 – Some bugfixes, add alswiki, svwiktionary, enwiktionary
  • 2012-11-10 – Many changes an checkwiki.cgi (faster, faster, …), new startpage
  • 2012-09-21 – Start with elwiki
  • 2012-03-28 – Code at GitHub
  • 2010-05-01 – New page for "next dumpscan" in the interface
  • 2010-04-29 – Start scan of svwikisource
  • 2010-04-29 – rewrite the script for updating the interface, not so much deadlocks
  • 2010-04-28 – Fix the setting of priorities in every project
  • 2010-02-19 – From now the new article and the last change articles will be insert into database.
  • 2010-02-07 – Fix CGI-Script
  • 2010-01-29 – Write a new script for output of wikipage. I need this for a rebuild of the mainscript.
  • 2010-01-29 – Now the statistic data will collect in a database. Later I build a nice output.
  • 2010-01-22 – Interface now with translations. Please update your Translation-Page into XHTML. In the future I will not support the Wikipage-Output. The only way is the new interface. In the next days I will insert the statistic feature in the new interface.
  • 2010-01-20 – Change the startpage. Now faster, but update only every 15 minutes.
  • 2009-10-12 – Please vote!
  • 2009-09-01 – New interface – 0.1 Alpha
  • 2009-06-20 – I have start the reprogramming of the script. After some hours I had a new version of the script. It is really faster. The pdcwiki was scanned live with the old script 1 minutes. The new script need only 0.18 seconds. In second test with dewiki I have a reducing of 45 minutes. At Sunday I scan all languages of the project an this need only 13 hours. This is really fast. With the old script I need more then 30 hours. But there is a new problem. In some chase the title of the article will don't fit to the error. (For examlpe here). I will fix this tonight.
  • 2009-06-14 – I have found a way to reduce the CPU-usage at the Toolserver (Thanks to Kalan). The script will be faster check the live Wikipedia with a better request to the API. This need a big change in the script. I will redesign the script in the next weeks. I have no idea, how much time I need. But after this relaunch the script will really faster. - At this weekend I have not the time to do this, so I reduce the CPU-usage now only with 3 points. First: The every language will start in a alphabetical row at 0:00 UTM (af, ar, cs, cy, de, en, …). So the start time is not fix. Second: All language together need more then 24 hours to scan. So the script start only Monday, Wednesday, Friday. Third: No dump scan in the next weeks. – Sorry for this interruption, but it need time.
  • 2009-06-11 – I have stopped all scripts for all languages. The CPU-usage at the Toolserver was too high. I will try to solve the problem at the weekend.
  • 2009-05-28 – Start the international project-page in yi
  • 2009-05-09 – Big problems at Toolserver with my script. Reason: The new very fast creation of dumps. Every 4-5 days we get so a new dump for every language. In the past it was every 30 days. I stop my cronjobs and will work at this problem. Sorry of the interruption.
  • 2009-04-26 – Start the international project-page in eo, id and sk
  • 2009-04-22 – Start the international project-page in uk
  • 2009-04-12 – Start the international project-page in hu and zh
  • 2009-03-16 – Start the international project-page in ar and tr
  • 2008-11-16 – Start the international project-page in pdc
  • 2008-10-17 – There are big problems at the Toolserver after a power failure.
  • 2008-10-04 – Start the international project-page in gd
  • 2008-09-24 – Problem with scan. Limitation were set on 20 errors. I start a new scan at 05:20 UTC.
  • 2008-09-17 – Start the international project-page in is, fy, ro
  • 2008-09-16 – Start the international project-page in cy, no
  • 2008-09-14 – Start the international project-page in af, ca, fi, he, la
  • 2008-09-13 – Start the international project-page in commons, ja
  • 2008-09-07 – Start the international project-page in cs, da, es, it, nds-nl, pl, pt, sv
  • 2008-09-06 – Start the international project-page in nl, en, de, nds, fr, nl, ru

Project pages[Bearbeiten]

All languages[Bearbeiten]

No. Language Scan Project page
1 Afrikaans 23:00 af:Wikipedia:WikiProject Check Wikipedia
2 Arabic 19:20 ar:ويكيبيديا:فحص ويكيبيديا
3 Català 01:00 ca:Viquipèdia:WikiProject Check Wikipedia
4 Commons 14:00 commons:Commons:WikiProject Check Wikipedia
5 Česky 07:00 cs:Wikipedie:WikiProjekt Check Wikipedia
6 Cymraegc 13:00 cy:Wicipedia:WikiProject Check Wikipedia
7 Dansk 12:00 da:Wikipedia:WikiProjekt Check Wikipedia
8 Deutsch 04:00 de:Wikipedia:WikiProject Check Wikipedia
9 English 15:00 en:Wikipedia:WikiProject Check Wikipedia
10 Español 08:00 es:Wikiproyecto:Check Wikipedia
11 Suomi 15:30 fi:Wikipedia:Wikiprojekti Check Wikipedia
12 Français 05:00 fr:Projet:Correction syntaxique
13 Frysk 17:00 fy:Wikipedy:WikiProject Check Wikipedia
14 Gàidhlig 18:00 gd:Wikipedia:WikiProject Check Wikipedia
15 עברית 19:00 he:ויקיפדיה:שגיאות תחביר
16 Íslenska 20:00 is:Wikipedia:WikiProject Check Wikipedia
17 Italiano 10:00 it:Wikipedia:WikiProjekt Check Wikipedia
18 日本語 11:00 ja:プロジェクト:ウィキ文法のチェック
19 Latina 21:00 la:Vicipaedia:WikiProject Check Wikipedia
20 Plattdüütsch 03:00 nds:Wikipedia:WikiProject Check Wikipedia
21 Magyar 01:30 hu:Wikipédia:WikiProject Check Wikipedia
22 Nedersaksisch 02:00 nds-nl:Wikipedie:WikiProject Check Wikipedia
23 Nederlands 23:30 nl:Wikipedia:Wikiproject/Check Wikipedia
24 ‪Norsk (bokmål)‬ 07:20 no:Wikipedia:WikiProject Check Wikipedia
25 Deitsch 22:00 pdc:Wikipedia:WikiProject Check Wikipedia
26 Polski 10:20 pl:Wikiprojekt:Check Wikipedia
27 Português 09:00 pt:Wikipedia:Projetos/Check Wikipedia
28 Română 10:40 ro:Wikipedia:WikiProject Check Wikipedia
29 Русский 06:00 ru:Википедия:Страницы с ошибками в викитексте
30 Svenska 07:40 sv:Wikipedia:Projekt wikifiering/Syntaxfel
31 Türkçe 19:40 tr:Vikipedi:Vikipedi proje kontrolü
32 Українська 06:30 uk:Вікіпедія:WikiProject Check Вікіпедія
33 Yiddish 18:20 yi:װיקיפּעדיע:קאנטראלירן בלעטער
34 中文 13:30 zh:维基百科:错误检查专题
  • (and more languages to follow)

Translation of a project page[Bearbeiten]

Under ~sk/checkwiki on the Toolserver, you will find for every project a translation text (dewiki/dewiki_translation.txt). Please copy this file at your page of translation (for example in de: Wikipedia:WikiProjekt Syntaxkorrektur/Übersetzung). Then you can translate the text in that file. With the next run of the script, that page will be used for the automatic translation of the project site. If I insert a new error and the script doesn't find the translation at your page of translation, then the new error appears in English.

 # Example for frwiki
 error_005_prio_script=1 END
 error_005_head_script=Comment not currect end END
 error_005_desc_script=Found a comment "<!--" with no "-->" end. END
 error_005_prio_frwiki=-1 END
 error_005_head_frwiki=Commentaire non fermé END
 error_005_desc_frwiki=Un commentaire "<!--" sans la balise "-->" a été trouvé. END

Thanks, to all translators.

The script[Bearbeiten]

Download[Bearbeiten]

Data[Bearbeiten]

  • The data (Toolserver): ~sk/checkwiki, for example in folder dewiki:
    • dewiki_translation.txt = text for translation
    • dewiki_output_for_wikipedia.txt = text for the project page
    • dewiki_error_list.txt = full list of articles with errors

Last changes at the script[Bearbeiten]

  • 2009-05-25 – Split error 7 in error 7 and 83
  • 2009-05-23 – New error 82
  • 2009-05-21 – New error 81 and table sortable
  • 2009-05-07 – New error 79, 80
  • 2009-04-12 – Split error 69 (ISBN-Check) in error 69-73
  • 2009-04-12 – New error 74, 75, 76
  • 2009-04-12 – New error 77, 78
  • 2009-04-09 – New error 69 (ISBN-Check).
  • 2009-04-05 – New error 68.
  • 2009-03-16 – Big change inside the script: Scan is now very fast.
  • 2009-03-16 – 9 new errors in the last days, many little changes
  • 2009-02-22 – New error (047): Template not correct begin
  • 2009-02-22 – New error (046): Square brackets not correct begin
  • 2009-02-22 – Fix namespaces, fix code of error 010 and 030
  • 2009-02-18 – New error (045): Interwiki double
  • 2009-02-18 – New error (044): Headlines with bold
  • 2009-02-15 – Merge Templatetiger with Check Wikipedia
  • 2009-02-07 – Many changes (better detection of image, categories; now with namespacealias, …)
  • 2009-01-29 – Use "Special:RecentChanges" as resource (at the moment: only 4000 Recent Changes)
  • 2008-12-28 – Fixing problem with error 037 – now "/" possible
  • 2008-12-22 – Insert statistic
  • 2008-12-04 – Splitting error 026 (html-elements) in 028, 038, 039, 040, 041, 042, 043
  • 2008-12-04 – Fixing problem with error 034 – template programming elements – ifeq:
  • 2008-11-29 – Fixing problem with error 028 – image description
  • 2008-11-29 – More errors in error 030 – image description
  • 2008-11-29 – Fixing problem with error 036 – redirect with ":"
  • 2008-11-24 – Fixing problem with error 003 ref without references
  • 2008-11-20 – Error 030: update for more images without description ( " ...|thumb]]" )
  • 2008-11-20 – New error (036): Redirect not correct ("#REDIRECT = [[Target page]]")
  • 2008-11-20 – New error (035): Gallery without description
  • 2008-11-18 – New error (034): Template programming element
  • 2008-11-16 – Fix "deactivating of error with the translation page"
  • 2008-11-15 – Deactivate Error 033 "HTML text style underline"
  • 2008-11-01 – Error 003 only in article namespace and include "references group"
  • 2008-10-23 – Error 002 activated, only for <br\> or <br.> or <\br>; Please update your translation !
  • 2008-10-13 – Error 006 (defaultsort), 030 (image description) and 028 (table end) now only in namespace 0 (article)
  • 2008-10-13 – Error 003 now without "listaref" in eswiki
  • 2008-10-06 – New error (032) double pipe in link ([[text|text2|text3]])
  • 2008-10-04 – Fixing error error 020 (†)
  • 2008-10-03 – New articles will be included in the scan
  • 2008-10-02 – Exclude all pages .js and .css from scan
  • 2008-10-02 – Fixing error 012 (list elements) and 026 (text style elements) – only one error per article
  • 2008-10-01 – Fixing error 031 (table elements) – only one error per article
  • 2008-10-01 – Defaultsort in ca with ORDENA
  • 2008-10-01 – Fixing problem with error 003 ref without references
  • 2008-10-01 – Fixing translation of category (category_001=)
  • 2008-09-30 – Update error limit from 10000 to 40000
  • 2008-09-27 – Fixing this *[[]] in en and commons
  • 2008-09-27 – Fixing he interwiki
  • 2008-09-25 – Update error 26 with <font> <u>
  • 2008-09-25 – Insert error 31: HTML table elements (<table> ...)
  • 2008-09-25 – Fixing interwiki af, he ,is and ja
  • 2008-09-24 – Better English
  • 2008-09-24 – Fixing problem with priority (unkown, deactivated, top, middle, lowest)! Reason for long run time.
  • 2008-09-23 – Translation finish!
  • 2008-09-20 – Deactivation of "br"-error and update the number of error from 5000 to 10000, in dewiki without limit (30893 errors)
  • 2008-09-19 – Fixing the DEFAULTSORT with special letters for pl and ro
  • 2008-09-17 – Fixing the DEFAULTSORT with special letters for nn, no, da and cs
  • 2008-09-16 – New error: Image without a description.
  • 2008-09-14 – New error: HTML elements <b>, <i> and <p> will be detected
  • 2008-09-14 – Fixing the problem with DEFAULTSORT without the {{ }}
  • 2008-09-14 – Fixing the problem with very long DEFAULTSORT
  • 2008-09-13 – New error: check hierarchy of headlines (nr. 25)
  • 2008-09-13 – Fixing the HTML-problem with <ol start=19>
  • 2008-09-13 – Fixing the DEFAULTSORT with special letters for sv and fr
  • 2008-09-13 – Fixing the problem with double :, like sv:Kategori:USA:s presidenter in sv
  • 2008-09-13 – Fixing of the pre problem.
  • 2008-09-12 – Fixing of the nowiki problem. Now it will correct work and will be ignored by that check.
  • 2008-09-12 – Fixing of the source problem. cs:ALTER will now not have a error.
  • 2008-09-12 – Create page for Check Wikipedia with last changes, new and discussion.
  • 2008-09-10 – Fixing of the source problem.

Tools[Bearbeiten]

Some tools can be used to help fixing errors detected by Check Wikipedia script. Also see the list of gadgets and user scripts (German).

Tool Wikis Detected errors Bot capabilities Description
WikiSyntaxTextMod All Description No Polishes and corrects wiki syntax automatically while editing.
Auto-Formatter All List No Adds an Auto-Format button to the wiki editor toolbar.
WPCleaner All List Yes WPCleaner is a tool designed to help with various maintenance tasks. It's written in Java.
AutoWikiBrowser All List Yes AutoWikiBrowser is a semi-automated editor designed to make tedious repetitive tasks quicker and easier. It's written in C#.
AutoEd Description No

FAQ[Bearbeiten]

HTML into XHTML or Wikisyntax[Bearbeiten]

See Wikipedia:WikiProjekt HTML5.

Not correct Correct
<b>testo</b> '''testo'''
<i>testo</i> ''testo''
<u>testo</u> <span style="text-decoration:underline;">testo</span>
<strike>testo</strike> <s>testo</s>
<tt>testo</tt> <code>testo</code>
<big>testo</big> <span style="font-size:larger;">testo</span>
<b><big>testo</big></b> <span style="font-size:larger;">'''testo'''</span>
<center>testo</center> <div class="center">testo</div>
<p align="center">testo</p> <div class="center">testo</div>
<font color="#224466">testo</font> <span style="color:#224466;">testo</span>
<font style="text-decoration:overline">testo</font> <span style="text-decoration:overline;">testo</span>

Next features[Bearbeiten]

To-do-list[Bearbeiten]

  • change errors
    • error 3 – allowed <references></references>
    • error 37 exclude chinese and japanese characters (here) like en:囝
    • error 37 template:lifetime (here) also it:Template:Bio
    • errors 10, 46, 43 and 47 with section where this error is
    • for error 60 output with linenumber
    • error 36 with tabulator and line break
    • error 54 – after line break math, code …
    • error 63 – not in user signatuers
    • error 69 – no detect "ISBN-10:", "ISBN-13:", "(ISBN-10)", "(ISBN-13)" most before or after a ISBN
    • eowiki error 6 and 37: "DEFAŬLTORDIGO" ĉ, ĝ, ĥ, ĵ, ŝ, ŭ and also Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ
    • error 84 – only pre-Block de:Dekker-Algorithmus
    • error 30 – only 60px or bigger images
  • new error
    • endless tag like <poem> or <ref>
    • Plainlinks in article namespace <span class="plainlinks">(here)
    • thumbs with forced size: [[File:Foo.jpg|thumb|250px|Foo]]
    • Pipe in external link [http:/www.wikipedia.org|Wikipedia]
    • Definition ; '''name''' : definition no bold!
    • detection for page titles that contains characters outside Basic Multilingual Plane of Unicode; "𩺊" (U+29E8A), which was once redirected to ja:アラ, is prohibited for use in page titles in Japanese Wikipedia because it is outside BMP (from U+0000 to U+FFFF).
    • detect excessive boldface text, more then 10 pairs of '''
    • DEFAULTSORTs which have a sortkey identical to the page's name (these can be removed).
    • Pages where every category has an identical sortkey but no DEFAULTSORT (the individual sortkeys can be removed and a DEFAULTSORT should be added).
    • ºC and °C, Wikipedia:Bots/Anfragen/Archiv/2009-1#Nummernzeichen statt Gradzeichen
    • double defaultsort in one article
    • references with name and not double; Example here
    • category with space in front or behinde "category: test "
  • new interface
    • 10 or 25 errors set as done from one page.
  • other
    • problem with "+" for example [[GTK+|GIMP-Toolkit]]
    • translation phrase "This error was found *** times".
    • translation phrase "This output was limited to *** article."
    • translation phrase for statistic

Your wish-list[Bearbeiten]

Please write your wish in English or German at the site: Benutzer Diskussion:Stefan Kühn/Check Wikipedia. Thanks! -- sk 22:11, 12. Sep. 2008 (CEST)