hig.sePublications
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Zipf's Law for Natural Cities Extracted from Location-Based Social Media Data
University of Gävle, Faculty of Engineering and Sustainable Development, Department of Industrial Development, IT and Land Management.
2015 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Zipf’s law is one of the empirical statistical regularities found within many natural systems, ranging from protein sequences of immune receptors in cells to the intensity of solar flares from the sun. Verifying the universality of Zipf’s law can provide many opportunities for us to further seek the commonalities of phenomena that possess the power law behavior. Since power law-like phenomena, as many studies have previously indicated, is often interpreted as evidence for studying complex systems, exploring the universality of Zipf’s law is also of potential capability in explaining underlying generative mechanisms and endogenous processes, i.e. self-organization and chaos theory.

The main purpose of this study was to verify whether Zipf’s law is valid for city sizes, city numbers and population extracted from natural cities. Unlike traditional city boundaries extracted by applying census-imposed and top-down imposed data, which are arbitrary and subjective, the study established the new kind of boundaries of cities, namely, natural cities through using four location-based social media data from Twitter, Brightkite, Gowalla and Freebase and head/tail breaks rule. In order to capture and quantify the hierarchical level for studying heterogeneous scales of cities, ht-index derived from head/tail breaks rule was employed. Furthermore, the validation of Zipf’s law was examined.

The result revealed that the natural cities had deviations in subtle patterns when different social media data were examined. By employing head/tail breaks method, the result calculated the ht-index and detected that hierarchy levels were not largely influenced by spatial-temporal changes but rather data itself. On the other hand, the study found that Zipf’s law is not universal in the case of using location-based social media data. Compared to city numbers extracted from nightlight imagery, the study found out the reason why Zipf’s law does not hold for location-based social media data, i.e. due to bias of customer behavior. The bias mainly resulted in the emergence of natural cities were much more frequent than others in certain regions and countries so that making the emergence of natural cities was not exhibited objectively. Furthermore, the study showed whether Zipf’s law could be well observed depends not only on the data itself and man-made limitations but also on calculation methods, data precisions and scales and the idealized status of observed data.

Place, publisher, year, edition, pages
2015. , p. 44 + appendixes
Keywords [en]
big data, location-based social media data, Zipf's law, power law, natural cities, ht-index
National Category
Earth and Related Environmental Sciences
Identifiers
URN: urn:nbn:se:hig:diva-19121OAI: oai:DiVA.org:hig-19121DiVA, id: diva2:796324
Subject / course
Geomatics
Educational program
Geomatics – master’s programme (one year) (swe or eng)
Presentation
2015-01-28, 13103, UNIVERSITY GÄVLE, GÄVLE, 14:00 (English)
Supervisors
Examiners
Available from: 2015-04-22 Created: 2015-03-18 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

fulltext(1738 kB)1120 downloads
File information
File name FULLTEXT01.pdfFile size 1738 kBChecksum SHA-512
f226775856352d15b019f3d3a6a982c599478e221d06ef1d38ae7ac9773ef4c0ec0eaa52e593ac9744810f2f2dce1c23e16806b48baff0de9385315b6197de62
Type fulltextMimetype application/pdf

By organisation
Department of Industrial Development, IT and Land Management
Earth and Related Environmental Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 1120 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 513 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf