Scalable framework for AIS data exploration through effective density visualizations

With tens of thousands of vessels around the globe transmitting their positions daily, interpreting such large volumes of data is more than a challenging task. Through the Automatic Identification System (AIS), introduced in 2002, the coordinates and status of the vessels are continuously reported,...

Full description

Saved in:
Bibliographic Details
Published inOCEANS 2023 - Limerick pp. 1 - 6
Main Authors Troupiotis-Kapeliaris, Alexandros, Tsili, Eleni, Kaliorakis, Manolis, Spiliopoulos, Giannis, Zissis, Dimitris
Format Conference Proceeding
LanguageEnglish
Published IEEE 05.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract With tens of thousands of vessels around the globe transmitting their positions daily, interpreting such large volumes of data is more than a challenging task. Through the Automatic Identification System (AIS), introduced in 2002, the coordinates and status of the vessels are continuously reported, with a transmission frequency ranging from 3 minutes down to a few seconds depending on their speed. Today, these millions of AIS messages and dozens of gigabytes of new data produced daily allow monitoring the movement of passenger or commercial vessels, as well as more complex activities like fishing and search-and-rescue operations. Studying the properties of AIS data and modeling vessel behavior has been the subject of numerous works the past few years. These attempts to describe vessel activity aim at a better understanding of their movement, often through the use of advanced mechanisms for capturing specific types of events. Although such approaches have been proven effective for a variety of scenarios, the resulting models are not easily comprehensible by the user, with notable examples being the trained neural networks or many of the classification models. Moreover, although recently there have been a few proposed works for extracting common vessel routes through historic data analysis, the end results by design do not provide the full picture regarding all movement in the area, solely including representative pathways. In order to overcome these issues, easily interpretable visualizations of movement at sea would provide a clear understanding of vessel behavior and the occurring trends. An experimental analysis that highlights the utility of vessel density maps in marine activities was presented in 2015 by Shelmerdine [1], with indicative experiments performed on a limited area around Shetland. A more scenario-specific analysis by Vespe et al. [2] focuses on visualizing the impact of piracy events over transport, while Chen et al. [3] attempted to also include the reported speed and course information from the AIS data in their maps. Furthermore, a framework for creating heat maps through a parallel Kernel Density Estimation (KDE) was proposed recently [4]. In their approach, Huang et al. present an efficient pipeline for trajectory compression and visualization in the context of Internet of Things (IoT) applications, with their solution relying on GPU-related accelerations. In this work, we extend our own MT-AIS-Toolbox [5], and present a scalable and effective tool for handling AIS datasets and visualizing vessel activity. For the purpose of creating an efficient and easily configurable solution, a state-of-the-art framework for scalable data processing, namely PySpark, is utilized. The proposed tool is able to manage large volumes of raw AIS messages and produce effective density maps for vessel movement according to the user configurations and needs. Our approach is split into two separate steps: first a dedicated mechanism is responsible for removing unnecessary or erroneous records and limiting the dataset within the spatio-temporal constraints of each use case. Then, the density of the area of interest is extracted, according to the selected metric, and ready-for-display density maps, that depict the vessel traffic, are generated. A few options for density metrics (such as number of different vessels that passed, the time spent at each area, the number of times vessels passed over an area etc.) are provided, with the user also being able to easily define a function that is best suited for their desired results. Additionally, options for comparing and combining different density maps are also included for a more complete analysis. Indicative experiments on a large real-world trajectory dataset were conducted, highlighting the performance capabilities of the proposed framework, in terms of execution time. Finally, as an application, the proposed extended tool has been utilized for data exploration and preparation during the training of machine learning models, as part of an EU-funded project for the digitalization of vessel behavior (i.e. VesselAI).
AbstractList With tens of thousands of vessels around the globe transmitting their positions daily, interpreting such large volumes of data is more than a challenging task. Through the Automatic Identification System (AIS), introduced in 2002, the coordinates and status of the vessels are continuously reported, with a transmission frequency ranging from 3 minutes down to a few seconds depending on their speed. Today, these millions of AIS messages and dozens of gigabytes of new data produced daily allow monitoring the movement of passenger or commercial vessels, as well as more complex activities like fishing and search-and-rescue operations. Studying the properties of AIS data and modeling vessel behavior has been the subject of numerous works the past few years. These attempts to describe vessel activity aim at a better understanding of their movement, often through the use of advanced mechanisms for capturing specific types of events. Although such approaches have been proven effective for a variety of scenarios, the resulting models are not easily comprehensible by the user, with notable examples being the trained neural networks or many of the classification models. Moreover, although recently there have been a few proposed works for extracting common vessel routes through historic data analysis, the end results by design do not provide the full picture regarding all movement in the area, solely including representative pathways. In order to overcome these issues, easily interpretable visualizations of movement at sea would provide a clear understanding of vessel behavior and the occurring trends. An experimental analysis that highlights the utility of vessel density maps in marine activities was presented in 2015 by Shelmerdine [1], with indicative experiments performed on a limited area around Shetland. A more scenario-specific analysis by Vespe et al. [2] focuses on visualizing the impact of piracy events over transport, while Chen et al. [3] attempted to also include the reported speed and course information from the AIS data in their maps. Furthermore, a framework for creating heat maps through a parallel Kernel Density Estimation (KDE) was proposed recently [4]. In their approach, Huang et al. present an efficient pipeline for trajectory compression and visualization in the context of Internet of Things (IoT) applications, with their solution relying on GPU-related accelerations. In this work, we extend our own MT-AIS-Toolbox [5], and present a scalable and effective tool for handling AIS datasets and visualizing vessel activity. For the purpose of creating an efficient and easily configurable solution, a state-of-the-art framework for scalable data processing, namely PySpark, is utilized. The proposed tool is able to manage large volumes of raw AIS messages and produce effective density maps for vessel movement according to the user configurations and needs. Our approach is split into two separate steps: first a dedicated mechanism is responsible for removing unnecessary or erroneous records and limiting the dataset within the spatio-temporal constraints of each use case. Then, the density of the area of interest is extracted, according to the selected metric, and ready-for-display density maps, that depict the vessel traffic, are generated. A few options for density metrics (such as number of different vessels that passed, the time spent at each area, the number of times vessels passed over an area etc.) are provided, with the user also being able to easily define a function that is best suited for their desired results. Additionally, options for comparing and combining different density maps are also included for a more complete analysis. Indicative experiments on a large real-world trajectory dataset were conducted, highlighting the performance capabilities of the proposed framework, in terms of execution time. Finally, as an application, the proposed extended tool has been utilized for data exploration and preparation during the training of machine learning models, as part of an EU-funded project for the digitalization of vessel behavior (i.e. VesselAI).
Author Zissis, Dimitris
Spiliopoulos, Giannis
Tsili, Eleni
Troupiotis-Kapeliaris, Alexandros
Kaliorakis, Manolis
Author_xml – sequence: 1
  givenname: Alexandros
  surname: Troupiotis-Kapeliaris
  fullname: Troupiotis-Kapeliaris, Alexandros
  email: alexandros.troupiotis@marinetraffic.com
  organization: University of the Aegean,Research Labs,Dep. of Product & Systems Design Eng. MarineTraffic,Athens / Ermoupoli,Greece
– sequence: 2
  givenname: Eleni
  surname: Tsili
  fullname: Tsili, Eleni
  email: eleni.tsili@marinetraffic.com
  organization: MarineTraffic,Research Labs,Athens,Greece
– sequence: 3
  givenname: Manolis
  surname: Kaliorakis
  fullname: Kaliorakis, Manolis
  email: manolis.kaliorakis@marinetraffic.com
  organization: MarineTraffic,Research Labs,Athens,Greece
– sequence: 4
  givenname: Giannis
  surname: Spiliopoulos
  fullname: Spiliopoulos, Giannis
  email: giannis.spiliopoulos@marinetraffic.com
  organization: MarineTraffic,Research Labs,Athens,Greece
– sequence: 5
  givenname: Dimitris
  surname: Zissis
  fullname: Zissis, Dimitris
  email: dzissis@aegean.gr
  organization: University of the Aegean,Dep. of Product & Systems Design Eng.,Ermoupoli,Greece
BookMark eNo1j01PAjEUAGuiB0X-gYdePbD2td3SHglBJSFyAM_kbfsqDcsu6S4o_nqNH6e5TCaZG3bZtA0xdg-iABDuYTmdTV5Wi7SnnPyulNqMCymkKkBIrY2zF2zoxs6qUiglpYFrtl55rLGqiceMe3pv847HNvPJfMUD9sjp41C3GfvUNrzf5vb4tuUUI_k-nYgHarrUn_kpdUes0-eP192yq4h1R8M_Dtjr42w9fR4tlk_z6WQxStKofuR0gFJZqMCWBoM3vvruOoCog5bSB00RsAyVqzBaMApFsBqCDMJiZb0asLvfbiKizSGnPebz5n9WfQEaaVTs
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/OCEANSLimerick52467.2023.10244698
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350332261
EndPage 6
ExternalDocumentID 10244698
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i263t-94d15381b1856adc6cbfec911f4d422cd4ef1a5db9baf8163a0d841d2d08ab8c3
IEDL.DBID RIE
IngestDate Wed Dec 20 05:18:59 EST 2023
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i263t-94d15381b1856adc6cbfec911f4d422cd4ef1a5db9baf8163a0d841d2d08ab8c3
OpenAccessLink https://doi.org/10.1109/OCEANSLimerick52467.2023.10244698
PageCount 6
ParticipantIDs ieee_primary_10244698
PublicationCentury 2000
PublicationDate 2023-June-5
PublicationDateYYYYMMDD 2023-06-05
PublicationDate_xml – month: 06
  year: 2023
  text: 2023-June-5
  day: 05
PublicationDecade 2020
PublicationTitle OCEANS 2023 - Limerick
PublicationTitleAbbrev OCEANS Limerick
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8440578
Snippet With tens of thousands of vessels around the globe transmitting their positions daily, interpreting such large volumes of data is more than a challenging task....
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Automatic Identification System
Behavioral sciences
big data
Cleaning
Data models
Data visualization
density maps
Distance measurement
Training
Transforms
vessel traffic
vessel trajectories
Title Scalable framework for AIS data exploration through effective density visualizations
URI https://ieeexplore.ieee.org/document/10244698
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH64HcSTihN_k4MXD6ltlqbpcYyNKTqFbbDbaJoExmCT2Qr61_uStoqC4K0UkpT30ve9Nt_3HsB1aqxMmWGUSxVSnklFVRpxanHD4HZhmJI7NfLjWIxm_H4ez2uxutfCGGM8-cwE7tKf5etNXrpfZfiGIxiJVLaghV9ulVhrF27qupm3T_1Bbzx5WPqjjlXMMAYErjt40Iz70UHFA8hwH8bN0hVvZBWUhQryj19VGf_9bAfQ-dbqkecvFDqEHbM-gukETe9EUcQ25CuC2Snp3U2I44SSagXvFVK36iEVtQOjH9GO1V68k7flq9Nc1krNDsyGg2l_ROv-CXTJRLegKdcunkUKMVlkOhe5wnkwulmuOWO55sZGWaxVqjIrMTHLQi15pJkOZaZk3j2G9nqzNidA4sQmkW9OnQiEs1DhOEwslbWJFZp3T6HjTLJ4qUpkLBprnP1x_xz2nGc85yq-gHaxLc0lonuhrrxXPwFh3qdN
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwED50gvqk4sTf5sEXH1rbLE3TxzE2Nt2qsA18G02TwBhsop2gf72XtFUUBN9KIE24S-87mu-7A7hOtBEJ1dRjQgYey4T0ZBIyz-CBweNCMSW3auRRyvtTdvcUPVVidaeF0Vo78pn27aO7y1erfG1_leEXjmDEE7EJWwj8ES3lWttwU1XOvH3odNvpeDh3lx2LiGIU8G1_cL-e-aOHioOQ3h6k9eIlc2Thrwvp5x-_6jL-e3f70PxW65HHLxw6gA29PITJGI1vZVHE1PQrgvkpaQ_GxLJCSbmC8wupmvWQktyB8Y8oy2sv3snb_NWqLiutZhOmve6k0_eqDgrenPJW4SVM2YgWSkRlnqmc5xLfg_HNMMUozRXTJswiJROZGYGpWRYowUJFVSAyKfLWETSWq6U-BhLFJg5de-qYI6AFEudhaimNiQ1XrHUCTWuS2XNZJGNWW-P0j_Er2OlPRsPZcJDen8Gu9ZJjYEXn0Che1voCsb6Ql87Dn0rcqpc
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=OCEANS+2023+-+Limerick&rft.atitle=Scalable+framework+for+AIS+data+exploration+through+effective+density+visualizations&rft.au=Troupiotis-Kapeliaris%2C+Alexandros&rft.au=Tsili%2C+Eleni&rft.au=Kaliorakis%2C+Manolis&rft.au=Spiliopoulos%2C+Giannis&rft.date=2023-06-05&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FOCEANSLimerick52467.2023.10244698&rft.externalDocID=10244698