„Wikipedia:WikiProjekt Georeferenzierung/Hauptseite/Wikipedia-World/en“ – Versionsunterschied
→Generation of geotags: As of early April 2007, around 45000... |
→Maybe-Checker: very crude machine translation added... |
||
Zeile 109: | Zeile 109: | ||
==Maybe-Checker== |
==Maybe-Checker== |
||
*'''[http://tools.wikimedia.de/~kolossos/wp-world/maybechecker.php?lang=en Maybe-Checker]''' - translation ... |
|||
The '''[http://tools.wikimedia.de/~kolossos/wp-world/maybechecker.php?lang=en Maybe-Checker]''' is a database-based suggestion system with article announcement and containment on certain categories, in the languages German, English and Czech. All articles listed here had no coordinate when last scanning the data base still, but a category (e.g. place in Germany, bridge, hotel), which suggests that it concerns an article, which one can georeference. |
|||
"Maybe" because each filtered article also absolutely a geo coordinate does not need (e.g. list of German churches in Germany). |
|||
One calls the Maybe Checker and looks at themselves the article. |
|||
:Button 1) One does not know the coordinates. |
|||
:2) The article should not get a coordinate. |
|||
:3) The article has meanwhile already a coordinate |
|||
:4) a coordinate straight inserted. |
|||
In particular the Button 2 is interesting, because these info. by the Script with the next update equal for pre-sorting one uses, because some articles (e.g. drop lattices, head station or Zugbrücke) emerge otherwise again and again. |
|||
==Usage of the datas== |
==Usage of the datas== |
Version vom 1. April 2007, 20:46 Uhr
NEWS: At the moment (2015-11-30) we have (Source): 4,15 million entries as input:
- en 1145421
- de 772004
- sv 372494
- fr 293196
- nl 232555
- ru 226063
- pl 149819
- ja 106954
- ca 94353
- es 91030
- it 86936
- sr 65041
- cs 53493
- zh 52971
- uk 50251
- da 48126
- no 40229
- fa 39004
- eu 25054
- lt 24879
With interwikilinks we get an output for 273 languages.
Database-Dumps (2015-11)
- toollabs:wp-world/dumps/new_red0.gz Wikipedia-World as PostGIS-dumps
- Database structure (PostGIS)
All people with WMFLAbs-account can read and use the database u_kolossos on osmdb database server. (Helppage Connecting to OSM) Diese Seite auf Deutsch: Wikipedia:WikiProjekt Georeferenzierung/Wikipedia-World
Thanks to the free availability of the collected data and thanks to innumerous helpers attaching geo information to the articles, it is possible to describe the position of locations with the degrees of longitude and latitude in the most common languages by using maps. This shows in a very good way the international aspects of Wikipedia.
This page will be translated in all corresponding languages in order to allow an easy start and to make this project well established. Therefore, we are looking for users that are active on the German Wikipedia and in Wikipedia in other languages in order to promote this project.
project description
This is the international co-ordination page for the multilingual usage and analysis of the geographical data collected in the projects Georeferenzierung, en:Wikipedia:WikiProject Geographical coordinates and others. We are looking for categories in the languages pl, ja, it, zh, sv, etc. that include all geotags (see interwiki links in the category Kategorie:Vorlage mit Koordinate) in order to be able to include more languages.
At a later point the largest amount of language variants shall be analyzed and recorded in a common database. In order to achieve this, a co-ordination on the used geographical coordinates templates is necessary.
At the time being, there are the following applications that are all based on the central database on the Toolserver. The maps can also be accessed via geohack:
search engines
http://tools.wikimedia.de/~kolossos/wp-world/place-search.php?la=en
simple maps
With the help of simple HTML and CSS code simple maps without background can be produced using the following languages:
English, German, Spanish, French, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swedish.
The map view is also available via a world map, in which you have to click only on the desired region.
Google Earth Wikipedia masking
Static Layer
The static layer for Google Earth is a KMZ-file with many different folders (Castles, Parks, Buildings, ...) sorted by type, continent and countrys. Download at webkuehn.de
Dynamic Layer
The data used for this are in a database and are filtered according to the articles length and the focus of the user. The corresponding articles are then put over the map using a layer. The database access occures after a one second stand still of the user's viewpoint.
A click on one of the symbols and a further click on the Wikipedia link enables the user to comfortably access and read the wikipedia articles.
Following languages can be used:
English, German, Spanish, French, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swedish.
Expert mode
This view can be configured in multiple ways.
If you click on the networklink in Google Earth with the right mouse button in order to edit the link, the URL (http://tools.wikimedia.de/~kolossos/geoworld/marks.php?LANG=de)
is displayed, which can be appended with additional parameters:
- pop
URL&pop=1000
shows only cities with more than 1000 inhabitantsURL&pop=-1
suppresses all cities, which alows it to see more easily the landmarks
- style
URL&style=world_cultur
shows only the woruld cultural heritage sites. A full list of all styles can be found in the file info.php listed above. Because objects are only assigned to one object type, this is only reliable in a limited way.
- photo
URL&photo=no
shows only articles, that do not feature an image. Ideal to plan a photosafari in a certain region.URL&photo=yes
shows only articles, that have already an image. However, this also includes images like COAs or location maps, etc..
- source
URL&source=de
shows only sites, that come from the German Wikipedia. All input languages are supported.
- notsource
URL¬region=de
does the contrary of the parameter "source".
- region
URL®ion=DE
shows only sites in Germany.URL®ion=DE-SN
shows only sites in the German Land Saxony. However, this works only if an article was additionally tagged with the Land code information.
- notregion
URL¬region=DE
does the contrary of the parameter "region" (e.g. shows only places outside of Germany)URL¬region=DE-SN
this would be the same for Saxony. Ideal to look for errors in a certain region.
Generation of geotags
Another usage for the interwikilinks is the automatic generation of geotags using the information from wikipedias in a different language. As a first example English geographical coordnates were used to produce coordinates using templates for German wikipedia articles without such coordinates: Wikipedia:WikiProjekt_Georeferenzierung/Artikel_ohne_Koordinate/da_in_engl_WP.
The coordinates included in this list were already put into the corresponding articles by hand. An automated process for such an inclusion could also be done by a bot in the future.
As of early April 2007, around 45000 en: articles have now been automatically geotagged based on automatic reconciliation of the article category tree with the NIMA GEOnet Names Server dataset, augmented with data from the CSV files listed above. This data is now available for cataloging by Kolossus in its next dump analysis.
world map with a representation of the concentration of wikipoints
Vorlage:Link-Bild Image: concentration of entries in logarithmic scale. Click on it. |
The image schown above can be generated online from the database. There are three parameters, which can be passed with the URL:
- Parameter "so" for "source" limits the display of articles orginating from a certain Wikipedia language source. Only these coordinates are shown that have not already been read from another language.
- Parameter "la" uses all available coordinates of a language.
- Parameter "fa" is a zoom factor, which can be 0.5, 1, 2, 4 or 8. Zoom fa=1 equals one pixel per degree. The time need for the generation of the image increases with the increase of the zoom factor.
Example: http://tools.wikimedia.de/~kolossos/wp-world/imageworld-art.php?so=pt&fa=2 shows the articles from the portugues Wikipedia.
Maybe-Checker
The Maybe-Checker is a database-based suggestion system with article announcement and containment on certain categories, in the languages German, English and Czech. All articles listed here had no coordinate when last scanning the data base still, but a category (e.g. place in Germany, bridge, hotel), which suggests that it concerns an article, which one can georeference.
"Maybe" because each filtered article also absolutely a geo coordinate does not need (e.g. list of German churches in Germany).
One calls the Maybe Checker and looks at themselves the article.
- Button 1) One does not know the coordinates.
- 2) The article should not get a coordinate.
- 3) The article has meanwhile already a coordinate
- 4) a coordinate straight inserted.
In particular the Button 2 is interesting, because these info. by the Script with the next update equal for pre-sorting one uses, because some articles (e.g. drop lattices, head station or Zugbrücke) emerge otherwise again and again.
Usage of the datas
![](http://upload.wikimedia.org/wikipedia/commons/thumb/0/00/MiniWikiAtlas_screenshot.png/220px-MiniWikiAtlas_screenshot.png)
WikiMiniAtlas
To-Do-Liste
Link with Google Earth using style; speeding up database queriesdonegenerate static KMLssdone- further development of the CSV service
- internationalisation
- gathering of all geotag templates in one category, which can be linked via the interwiki-links with Kategorie:Vorlage mit Koordinate
- translation of the special type list (this is need for the data enhancement via categories)
- translation of the continent list, en:ISO 3166-1 for countries, en:ISO 3166-2 for regions within countries
- translation of the KML folder names
- extension of the database for more than 11 languages. Which ones?
- user interface for the error list control comparable to the interwiki-link-checker
- list of articles which use the exact same coordinates
- maybe suggestion list (without the articles which can receive a coordinate via an article in a different language)
extract also en:Template:Mapit-US-cityscaledone- last step: en:GeoTagging
contacts
- Benutzer:Stefan Kühn: data extraction from the dumps and static layer
- Benutzer:Kolossos: transfer of the data into the database, programming of the application
- ALE! ¿…?: Translation of this page
- en:User:The Anome: automatic reconciliation of GNS data with category-tagged Wikipedia articles, and auto-tagging of articles on en: by en:User:The Anomebot2