Wikipedia:WikiProjekt Georeferenzierung/Wikipedia-World/en

aus Wikipedia, der freien Enzyklopädie

Wechseln zu: Navigation, Suche
NEWS: At the moment (08-05-10) we have (Source: info.php):

264.288 entries without duplicates (input):

  • en 115426
  • de 95761
  • nl 29699
  • cs 5625
  • pt 5321
  • es 4917
  • ca 2645
  • fr 2408
  • ru 1310
  • fi 1028
  • eo 148

922.005 entries with duplicates (result):

  • en 180086
  • de 103419
  • es 40563
  • fr 76890
  • it 74600
  • ja 21947
  • nl 121041
  • pl 65960
  • pt 71155
  • ru 27627
  • sv 18877
  • fi 11254
  • no 16741
  • eo 28073
  • sk 11734
  • da 6050
  • cs 14951
  • tr 7225
  • zh 10971
  • is 1583
  • ca 11258


+ over 1 Mio. in all languages (1.163.797)

Database-Dumps (08-05-10)

All people with toolserver-account can read and use the database.

Diese Seite auf Deutsch: Wikipedia:WikiProjekt Georeferenzierung/Wikipedia-World

Thanks to the free availability of the collected data and thanks to the innumerable helpers who attach geo information to the articles, it is possible to describe the position of locations with the degrees of longitude and latitude in the most common languages by using maps. This shows in a very good way the international aspects of Wikipedia.

This page will be translated in all corresponding languages in order to allow an easy start and to make this project well established. Therefore, we are looking for users that are active on the German Wikipedia and in Wikipedia in other languages in order to promote this project.

Inhaltsverzeichnis

[Bearbeiten] Project description

This is the international co-ordination page for the multilingual usage and analysis of the geographical data collected in the projects Georeferenzierung, en:Wikipedia:WikiProject Geographical coordinates and others. We are looking for categories in the languages pl, ja, it, zh, sv, etc. that include all geotags (see interwiki links in the category Kategorie:Vorlage mit Koordinate) in order to be able to include more languages.

Via the interwiki links all language variants are analyzed and recorded in a common database. In order to achieve this, a co-ordination on the used geographical coordinates templates is necessary.

At the time being, there are the following applications that are all based on the central database on the Toolserver. The maps can also be accessed via geohack:

[Bearbeiten] Search engines

http://tools.wikimedia.de/~kolossos/wp-world/place-search.php?la=en

[Bearbeiten] Simple maps

With the help of simple HTML and CSS code simple maps without background can be comfortably produced using the following links for a limited number of languages or by modifying the URL: For example use http://tools.wikimedia.de/~kolossos/wp-world/umkreis.php?la=af&submit=v&lon=13&lat=52&rang=1462.5&map=2&limit=80 for Afrikaans.

English, German, Spanish, French, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swedish. Turkish, Chinese, Icelandic, Catalan.


The map view is also available via a world map, in which you have to click only on the desired region.

[Bearbeiten] Source code

[Bearbeiten] Google Maps Wikipedia masking

  • Google Maps Link with Wikipedia layer all parameters from the Google Earth Layer (below) and all (over 200) languages of the project are usable.

[Bearbeiten] Google Earth Wikipedia masking

[Bearbeiten] Static layer

The static layer for Google Earth is a KMZ-file with many different folders (Castles, Parks, Buildings, ...) sorted by type, continent and countries. Download at webkuehn.de

[Bearbeiten] Dynamic layer

The data used for this are in a database and are filtered according to the articles length and the focus of the user. The corresponding articles are then put over the map using a layer. The database access occurs after a one second stand still of the user's viewpoint.

A click on one of the symbols and a further click on the Wikipedia link enables the user to comfortably access and read the wikipedia articles.

You can select the desired language using the links below or via the URL http://tools.wikimedia.de/~kolossos/world-link.php?long=13.3783&lat=52.5163&lang=LANGUAGE replacing LANGUAGE with the ISO code of the requested language.

English,

Czech, Danish, Dutch, Esperanto, Finnish, French, German, Italian, Japanese, Norwegian, Polish, Portuguese, Russian, Slovak, Spanish, Swedish. Turkish, Chinese, Icelandic, Catalan.

[Bearbeiten] Expert mode

This view can be configured in multiple ways. If you click on the networklink in Google Earth with the right mouse button in order to edit the link, the URL (http://tools.wikimedia.de/~kolossos/geoworld/marks.php?LANG=de) is displayed, which can be appended with additional parameters:

  • thumbs
    • URL&thumbs=yes So you can see little thumbnail pictures. We are using the first jpeg-image of each article, see Screenshot, so we use over 37.000 images. Direct Link
  • pop
    • URL&pop=1000 shows only cities with more than 1000 inhabitants
    • URL&pop=-1 suppresses all cities, which makes it easier to see landmarks
  • style
    • URL&style=world_cultur shows only the world cultural heritage sites. A full list of all styles can be found in the file info.php listed above. Because objects are only assigned to one object type, this is only reliable in a limited way.
  • photo
    • URL&photo=no shows only articles, that do not feature an image. Ideal to plan a photosafari in a certain region. Direct Link
    • URL&photo=yes shows only articles, that have already an image. However, this also includes images like COAs or location maps, etc..
  • source
    • URL&source=de shows only sites, that come from the German Wikipedia. All input languages are supported.
  • notsource
    • URL&notregion=de does the contrary of the parameter "source".
  • region
    • URL&region=DE shows only sites in Germany.
    • URL&region=DE-SN shows only sites in the German Land Saxony. However, this works only if an article was additionally tagged with the Land code information.
  • notregion
    • URL&notregion=DE does the contrary of the parameter "region" (e.g. shows only places outside of Germany)
    • URL&notregion=DE-SN this would be the same for Saxony. Ideal to look for errors in a certain region.

[Bearbeiten] Source code

[Bearbeiten] Generation of geotags

Another usage for the interwikilinks is the automatic generation of geotags using the information from wikipedias in a different language. As a first example English geographical coordinates were used to produce coordinates using templates for German wikipedia articles without such coordinates: Wikipedia:WikiProjekt_Georeferenzierung/Artikel_ohne_Koordinate/da_in_engl_WP.

The coordinates included in this list were already put into the corresponding articles by hand. An automated process for such an inclusion could also be done by a bot in the future.

As of May 2008, around 100,000 en: articles have now been automatically geotagged based on automatic reconciliation of the article category tree with the NIMA GEOnet Names Server dataset, augmented with data from the CSV files listed above. These data are now available for cataloging by Kolossos in its next dump analysis.

[Bearbeiten] World map with a representation of the concentration of wikipoints

:Bild:Imageworld-artphp3.png

Image: concentration of entries in logarithmic scale. Click on it.

The image shown above can be generated online from the database. There are three parameters, which can be passed with the URL:

    • Parameter "so" for "source" limits the display of articles orginating from a certain Wikipedia language source. Only these coordinates are shown that have not already been read from another language.
    • Parameter "la" uses all available coordinates of a language.
    • Parameter "fa" is a zoom factor, which can be 0.5, 1, 2, 4 or 8. Zoom fa=1 equals one pixel per degree. The time need for the generation of the image increases with the increase of the zoom factor.

Example: http://tools.wikimedia.de/~kolossos/wp-world/imageworld-art-new.php?so=pt&fa=2 shows the articles from the portugues Wikipedia.

[Bearbeiten] Maybe-Checker

The Maybe-Checker is a database-based suggestion system with article announcement and containment on certain categories, in the languages German, English and Czech. All articles listed here had no coordinate when last scanning the database, but a category (e.g. place in Germany, bridge, hotel), which suggests that it concerns an article, which one can georeference.

"Maybe" because some articles can't have a georeference (e.g. list of German churches in Germany).

You open the Maybe-Checker and look at the article:

If the article needs coordinates, you can add them by clicking edit and/or mark the page as follows after you have finished:

Button 1) You don't know the coordinates.
2) The article should not get coordinates.
3) The article already has coordinates.
4) You have added the coordinates.

In particular Button 2 is interesting, because this information will be used by the Script with the next update for pre-sorting, because some articles (e.g. drop lattices, head station or Zugbrücke) otherwise would show up again and again.

[Bearbeiten] Usage of the data

WikiMiniAtlas
WikiMiniAtlas

[Bearbeiten] WikiMiniAtlas

[Bearbeiten] Panorama-Geocoding

[Bearbeiten] KDE Marble

[Bearbeiten] To-do list

  1. Link with Google Earth using style; speeding up database queries done
  2. generate static KMLss done
  3. further development of the CSV service
  4. internationalisation
    1. gathering of all geotag templates in one category, which can be linked via the interwiki-links with Kategorie:Vorlage mit Koordinate
    2. translation of the special type list (this is need for the data enhancement via categories)
    3. translation of the continent list, en:ISO 3166-1 for countries, en:ISO 3166-2 for regions within countries
    4. translation of the KML folder names
  5. extension of the database for more than 11 languages. Which ones?
  6. user interface for the error list control comparable to the interwiki-link-checker
  7. list of articles which use the exact same coordinates
  8. maybe suggestion list (without the articles which can receive a coordinate via an article in a different language)
  9. extract also en:Template:Mapit-US-cityscale done
  10. last step: en:GeoTagging

[Bearbeiten] Contacts

Persönliche Werkzeuge