hig.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Exploration of relationships from texts using self-organizing maps
University of Gävle, Department of Technology and Built Environment.
2007 (English)Independent thesis Advanced level (degree of Magister), 20 points / 30 hpStudent thesis
Abstract [en]

This thesis explored and visualized the relationships of documents data, based on the technique of self-organizing maps (SOM), a subtype of artificial neural network for visualizing high-dimensional data in low-dimensional views. The source data for this thesis are the full Extensible Markup Language (XML) texts of A Standard Corpus of Present Day Edited American English. The first step is transforming these XML files to produce a term-document matrix, including stop word removal, stemming, tf-idf (term frequency–inverse document frequency) weighting, global filtering; here rows of this matrix represent documents as n-dimensional vectors. Secondly, these vectors are clustered and visualized by SOM consisting of neurons, each neuron relatives to a set of documents with a certain number of same terms. Then a network has been constructed from SOM, with vertices set of neurons and documents, lines set of linkages between neurons and documents. Finally this network exports to the Pajek for analysis and final visualization.

Place, publisher, year, edition, pages
2007. , p. v+37
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:hig:diva-129OAI: oai:DiVA.org:hig-129DiVA, id: diva2:119682
Uppsok
teknik
Supervisors
Examiners
Available from: 2007-05-29 Created: 2007-05-29

Open Access in DiVA

fulltext(1632 kB)984 downloads
File information
File name FULLTEXT01.pdfFile size 1632 kBChecksum MD5
c25e8258ddd9466345094aee38cd36af152c372e690d3caae2eda9c1a0b7638dfad10707
Type fulltextMimetype application/pdf

By organisation
Department of Technology and Built Environment
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 984 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 486 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf