hig.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Characterizing the Heterogeneity of the OpenStreetMap Data and Community
University of Gävle, Faculty of Engineering and Sustainable Development, Department of Industrial Development, IT and Land Management, Land management, GIS.ORCID iD: 0000-0001-9328-9584
University of Gävle, Faculty of Engineering and Sustainable Development, Department of Building, Energy and Environmental Engineering, Energy system.ORCID iD: 0000-0003-1121-2394
University of Gävle, Faculty of Engineering and Sustainable Development, Department of Industrial Development, IT and Land Management, Land management, GIS.ORCID iD: 0000-0002-2337-2486
2015 (English)In: ISPRS International Journal of Geo-Information, ISSN 2220-9964, Vol. 4, no 2, p. 535-550Article in journal (Refereed) Published
Abstract [en]

OpenStreetMap (OSM) constitutes an unprecedented, free, geographical information source contributed by millions of individuals, resulting in a database of great volume and heterogeneity. In this study, we characterize the heterogeneity of the entire OSM database and historical archive in the context of big data. We consider all users, geographic elements and user contributions from an eight-year data archive, at a size of 692 GB. We rely on some nonlinear methods such as power law statistics and head/tail breaks to uncover and illustrate the underlying scaling properties. All three aspects (users, elements, and contributions) demonstrate striking power laws or heavy-tailed distributions. The heavy-tailed distributions imply that there are far more small elements than large ones, far more inactive users than active ones, and far more lightly edited elements than heavy-edited ones. Furthermore, about 500 users in the core group of the OSM are highly networked in terms of collaboration.

Place, publisher, year, edition, pages
MDPI , 2015. Vol. 4, no 2, p. 535-550
Keywords [en]
OpenStreetMap, big data, power laws, head/tail breaks, ht-index
National Category
Other Computer and Information Science Physical Geography
Identifiers
URN: urn:nbn:se:hig:diva-20223DOI: 10.3390/ijgi4020535ISI: 000358987600006Scopus ID: 2-s2.0-84948967039OAI: oai:DiVA.org:hig-20223DiVA, id: diva2:852398
Available from: 2015-09-09 Created: 2015-09-09 Last updated: 2025-10-02Bibliographically approved
In thesis
1. Topological and Scaling Analysis of Geospatial Big Data
Open this publication in new window or tab >>Topological and Scaling Analysis of Geospatial Big Data
2018 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Geographic information science and systems face challenges related to understanding the instinctive heterogeneity of geographic space, since conventional geospatial analysis is mainly founded on Euclidean geometry and Gaussian statistics. This thesis adopts a new paradigm, based on fractal geometry and Paretian statistics for geospatial analysis. The thesis relies on the third definition of fractal geometry: A set or pattern is fractal if the scaling of far more small things than large ones recurs multiple times. Therefore, the terms fractal and scaling are used interchangeably in this thesis. The new definition of fractal is well-described by Paretian statistics, which is mathematically defined as heavy-tailed distributions. The topology of geographic features is the key prerequisite that enables us to see the fractal or scaling structure of the geographic space. In this thesis, topology refers to the relationship among meaningful geographic features (such as natural streets and natural cities).

The thesis conducts topological and scaling analyses of geographic space and its involved human activities in the context of geospatial big data. The thesis utilizes the massive, volunteered, geographic information coming from LBSM platforms, which are the global OpenStreetMap database and countrywide, geo-referenced tweets and check-in locations. The thesis develops geospatial big-data processing and modeling techniques, and employs complexity science methods, including heavy-tailed distribution detection and head/tail breaks, along with some complex network analysis. Head/tail breaks and the induced ht-index are a powerful tool for geospatial big-data analytics and visualization. The derived scaling hierarchies, power-law metrics, and network measures provide quantitative insights into the heterogeneity of geographic space and help us understand how it shapes human activities at city, country, and world scales. 

Place, publisher, year, edition, pages
Gävle: Gävle University Press, 2018. p. 73
Series
Studies in the Research Profile Built Environment. Doctoral thesis ; 7
Keywords
Third definition of fractal, scaling, topology, power law, head/tail breaks, ht-index, complex network, geospatial big data, natural cities, natural streets
National Category
Computer and Information Sciences Earth and Related Environmental Sciences
Research subject
Sustainable Urban Development
Identifiers
urn:nbn:se:hig:diva-26197 (URN)978-91-88145-24-6 (ISBN)978-91-88145-25-3 (ISBN)
Public defence
2018-05-16, Lilla Jadwiga-salen, Kungsbäcksvägen 47, Gävle, 10:00 (English)
Opponent
Supervisors
Available from: 2018-04-24 Created: 2018-03-04 Last updated: 2025-10-02

Open Access in DiVA

fulltext(1542 kB)15 downloads
File information
File name FULLTEXT01.pdfFile size 1542 kBChecksum SHA-512
33d650ef0d42373243f191d432dfa12d8faf70344c3efaa5b652325c798f966a9b8d08d6e3ab1ea720e8b4b05eb33319029d2e270a94e4a1cc6a446a66ef30f1
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Ma, DingSandberg, MatsJiang, Bin

Search in DiVA

By author/editor
Ma, DingSandberg, MatsJiang, Bin
By organisation
Land management, GISEnergy system
In the same journal
ISPRS International Journal of Geo-Information
Other Computer and Information SciencePhysical Geography

Search outside of DiVA

GoogleGoogle Scholar
Total: 15 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 853 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf