„Wikipedia:WikiProjekt Georeferenzierung/Hauptseite/Wikipedia-World/en“ – Versionsunterschied

aus Wikipedia, der freien Enzyklopädie
Zur Navigation springen Zur Suche springen
Inhalt gelöscht Inhalt hinzugefügt
→‎Generation of geotags: As of early April 2007, around 45000...
→‎Maybe-Checker: very crude machine translation added...
Zeile 109: Zeile 109:


==Maybe-Checker==
==Maybe-Checker==
*'''[http://tools.wikimedia.de/~kolossos/wp-world/maybechecker.php?lang=en Maybe-Checker]''' - translation ...


The '''[http://tools.wikimedia.de/~kolossos/wp-world/maybechecker.php?lang=en Maybe-Checker]''' is a database-based suggestion system with article announcement and containment on certain categories, in the languages German, English and Czech. All articles listed here had no coordinate when last scanning the data base still, but a category (e.g. place in Germany, bridge, hotel), which suggests that it concerns an article, which one can georeference.

"Maybe" because each filtered article also absolutely a geo coordinate does not need (e.g. list of German churches in Germany).

One calls the Maybe Checker and looks at themselves the article.
:Button 1) One does not know the coordinates.
:2) The article should not get a coordinate.
:3) The article has meanwhile already a coordinate
:4) a coordinate straight inserted.

In particular the Button 2 is interesting, because these info. by the Script with the next update equal for pre-sorting one uses, because some articles (e.g. drop lattices, head station or Zugbrücke) emerge otherwise again and again.


==Usage of the datas==
==Usage of the datas==

Version vom 1. April 2007, 20:46 Uhr

NEWS: At the moment (2015-11-30) we have (Source): 4,15 million entries as input:

  • en 1145421
  • de 772004
  • sv 372494
  • fr 293196
  • nl 232555
  • ru 226063
  • pl 149819
  • ja 106954
  • ca 94353
  • es 91030
  • it 86936
  • sr 65041
  • cs 53493
  • zh 52971
  • uk 50251
  • da 48126
  • no 40229
  • fa 39004
  • eu 25054
  • lt 24879


With interwikilinks we get an output for 273 languages.

Database-Dumps (2015-11)

All people with WMFLAbs-account can read and use the database u_kolossos on osmdb database server. (Helppage Connecting to OSM) Diese Seite auf Deutsch: Wikipedia:WikiProjekt Georeferenzierung/Wikipedia-World

Thanks to the free availability of the collected data and thanks to innumerous helpers attaching geo information to the articles, it is possible to describe the position of locations with the degrees of longitude and latitude in the most common languages by using maps. This shows in a very good way the international aspects of Wikipedia.

This page will be translated in all corresponding languages in order to allow an easy start and to make this project well established. Therefore, we are looking for users that are active on the German Wikipedia and in Wikipedia in other languages in order to promote this project.

project description

This is the international co-ordination page for the multilingual usage and analysis of the geographical data collected in the projects Georeferenzierung, en:Wikipedia:WikiProject Geographical coordinates and others. We are looking for categories in the languages pl, ja, it, zh, sv, etc. that include all geotags (see interwiki links in the category Kategorie:Vorlage mit Koordinate) in order to be able to include more languages.

At a later point the largest amount of language variants shall be analyzed and recorded in a common database. In order to achieve this, a co-ordination on the used geographical coordinates templates is necessary.

At the time being, there are the following applications that are all based on the central database on the Toolserver. The maps can also be accessed via geohack:

search engines

http://tools.wikimedia.de/~kolossos/wp-world/place-search.php?la=en

simple maps

With the help of simple HTML and CSS code simple maps without background can be produced using the following languages:

English, German, Spanish, French, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swedish.

The map view is also available via a world map, in which you have to click only on the desired region.

Google Earth Wikipedia masking

Static Layer

The static layer for Google Earth is a KMZ-file with many different folders (Castles, Parks, Buildings, ...) sorted by type, continent and countrys. Download at webkuehn.de

Dynamic Layer

The data used for this are in a database and are filtered according to the articles length and the focus of the user. The corresponding articles are then put over the map using a layer. The database access occures after a one second stand still of the user's viewpoint.

A click on one of the symbols and a further click on the Wikipedia link enables the user to comfortably access and read the wikipedia articles.

Following languages can be used:

English, German, Spanish, French, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swedish.

Expert mode

This view can be configured in multiple ways. If you click on the networklink in Google Earth with the right mouse button in order to edit the link, the URL (http://tools.wikimedia.de/~kolossos/geoworld/marks.php?LANG=de) is displayed, which can be appended with additional parameters:

  • pop
    • URL&pop=1000 shows only cities with more than 1000 inhabitants
    • URL&pop=-1 suppresses all cities, which alows it to see more easily the landmarks
  • style
    • URL&style=world_cultur shows only the woruld cultural heritage sites. A full list of all styles can be found in the file info.php listed above. Because objects are only assigned to one object type, this is only reliable in a limited way.
  • photo
    • URL&photo=no shows only articles, that do not feature an image. Ideal to plan a photosafari in a certain region.
    • URL&photo=yes shows only articles, that have already an image. However, this also includes images like COAs or location maps, etc..
  • source
    • URL&source=de shows only sites, that come from the German Wikipedia. All input languages are supported.
  • notsource
    • URL&notregion=de does the contrary of the parameter "source".
  • region
    • URL&region=DE shows only sites in Germany.
    • URL&region=DE-SN shows only sites in the German Land Saxony. However, this works only if an article was additionally tagged with the Land code information.
  • notregion
    • URL&notregion=DE does the contrary of the parameter "region" (e.g. shows only places outside of Germany)
    • URL&notregion=DE-SN this would be the same for Saxony. Ideal to look for errors in a certain region.

Generation of geotags

Another usage for the interwikilinks is the automatic generation of geotags using the information from wikipedias in a different language. As a first example English geographical coordnates were used to produce coordinates using templates for German wikipedia articles without such coordinates: Wikipedia:WikiProjekt_Georeferenzierung/Artikel_ohne_Koordinate/da_in_engl_WP.

The coordinates included in this list were already put into the corresponding articles by hand. An automated process for such an inclusion could also be done by a bot in the future.

As of early April 2007, around 45000 en: articles have now been automatically geotagged based on automatic reconciliation of the article category tree with the NIMA GEOnet Names Server dataset, augmented with data from the CSV files listed above. This data is now available for cataloging by Kolossus in its next dump analysis.

world map with a representation of the concentration of wikipoints

Vorlage:Link-Bild Image: concentration of entries in logarithmic scale. Click on it.

The image schown above can be generated online from the database. There are three parameters, which can be passed with the URL:

    • Parameter "so" for "source" limits the display of articles orginating from a certain Wikipedia language source. Only these coordinates are shown that have not already been read from another language.
    • Parameter "la" uses all available coordinates of a language.
    • Parameter "fa" is a zoom factor, which can be 0.5, 1, 2, 4 or 8. Zoom fa=1 equals one pixel per degree. The time need for the generation of the image increases with the increase of the zoom factor.

Example: http://tools.wikimedia.de/~kolossos/wp-world/imageworld-art.php?so=pt&fa=2 shows the articles from the portugues Wikipedia.


Maybe-Checker

The Maybe-Checker is a database-based suggestion system with article announcement and containment on certain categories, in the languages German, English and Czech. All articles listed here had no coordinate when last scanning the data base still, but a category (e.g. place in Germany, bridge, hotel), which suggests that it concerns an article, which one can georeference.

"Maybe" because each filtered article also absolutely a geo coordinate does not need (e.g. list of German churches in Germany).

One calls the Maybe Checker and looks at themselves the article.

Button 1) One does not know the coordinates.
2) The article should not get a coordinate.
3) The article has meanwhile already a coordinate
4) a coordinate straight inserted.

In particular the Button 2 is interesting, because these info. by the Script with the next update equal for pre-sorting one uses, because some articles (e.g. drop lattices, head station or Zugbrücke) emerge otherwise again and again.

Usage of the datas

WikiMiniAtlas

WikiMiniAtlas

To-Do-Liste

  1. Link with Google Earth using style; speeding up database queries done
  2. generate static KMLss done
  3. further development of the CSV service
  4. internationalisation
    1. gathering of all geotag templates in one category, which can be linked via the interwiki-links with Kategorie:Vorlage mit Koordinate
    2. translation of the special type list (this is need for the data enhancement via categories)
    3. translation of the continent list, en:ISO 3166-1 for countries, en:ISO 3166-2 for regions within countries
    4. translation of the KML folder names
  5. extension of the database for more than 11 languages. Which ones?
  6. user interface for the error list control comparable to the interwiki-link-checker
  7. list of articles which use the exact same coordinates
  8. maybe suggestion list (without the articles which can receive a coordinate via an article in a different language)
  9. extract also en:Template:Mapit-US-cityscale done
  10. last step: en:GeoTagging

contacts