SlideShare a Scribd company logo
1 of 33
Download to read offline
VARIABLE NEIGHBORHOOD
PREDICTION OF TEMPORAL
COLLECTIVE PROFILES
Presentation for EuroIoTA ’16
Speaker: Keun-Woo Lim
Telecom Paristech
24-11-2016
Brief Overview
 What do we do in this work?
 Analysis of temporal collective profiles (time-series)
 Use of mobile datasets (cellular, Wi-Fi)
 Real–time & Lightweight prediction (online prediction)
 What do we try to achieve?
 Better prediction accuracy
 Lower computational complexity
 Better application & use case
Contents
Contents  Introduction
 Methodology
 Prediction
 Outlier Detection
 Future Work
Introduction
Temporal collective profiles
 Representation of data that aggregate behavior of
group of individuals – over time
 Can be categorized in various ways
 “Daily Profiles”
What are collected?
 Basic telephone calls and SMS?
 However, we want to focus on more specific matters
 Specific application data
 Usage of Internet service
Why do we analyze these data?
 For “online network analysis”
 Real-time prediction of the near-future
 Recognition of sudden changes/outliers
 Timely adaptation
 Use cases
 Resource allocation
 Traffic handling
 Social behavior
Requirements
 Low computational complexity
 Lightweight prediction methods
 Accuracy
 Still have to be accurate
Dataset
 Cellular mobile dataset
 1-week data from 90 lacs in Paris
 More than 500 daily profiles
 Wi-Fi cloud dataset
 122 days (March 1st to June 30th, 2014)
 60 million URL connection logs
(Top 20 mobile applications)
Methodology
What should we do with daily profiles?
 Daily profiles can be:
 Very similar to each other (same day, location, etc.)
 Very different too (outlier, events)
 We use methods to calculate similarity
 Cluster similar profiles
 Distinguish different profiles
Previous work (Offline analysis)1
 Utilization of clustering methods (UPGMA)
 With similarity comparison techniques (DTW, quantiles)
 Not ideal in online data analysis
 Clustering may take long time (𝑂(𝑀2 𝑁3)with DTW)
1K. Lim, S. Secci, L. Tabourier, B. Tebbani, “Characterizing and predicting mobile application usage,”
https://hal.archives-ouvertes.fr/hal-01345824/document
Profile similarity
 We use two examples of similarity measures
(M values in a time-series)
 Euclidean distance (ED) = Θ(M)
 Dynamic time warping (DTW) = Θ(M2)
 For specific dataset containing N profiles,
 ED = Θ(N2M)
 DTW = Θ(N2M2)
to compare all with each other
Weighted graph representation
 Using similarity measures, we acquire a graph
structure of neighbors
 E.g., if ED is used, lower value = more similar
Filtering paths
 Filter neighbors with high distance
 Depending on the value of α, the number of neighbors
change for all profiles
Visualization of graph structure
 Example graph structure for ED – cellular dataset
Variable Neighborhood Prediction
(VNP)
Principle of VNP
 For a new day 𝑥 𝑛(𝑡), we configure
 𝑡0 = 0, 𝑡1 = 0~24, 𝑡2 = 24 (hour)
 Objective
 Observation period = 𝑥 𝑛 𝑡0, 𝑡1
 Create a temporal profile to predict 𝑥 𝑛 𝑡1, 𝑡2
 Find 𝑥 from the observation period
 The closest profile 𝑥, in 𝑥 𝑡0, 𝑡1 and 𝑥 𝑛 𝑡0, 𝑡1
Find the neighbors
 Using closest neighbor 𝑥, we find the group of
neighbors 𝑁 𝑛 to be used for prediction
 For any other profile y of the training set,
 𝑦 ∈ 𝑁𝑛 𝑖𝑖𝑓
𝑠 𝑥 𝑛 𝑡0, 𝑡1 , 𝑦 𝑡0, 𝑡1 ≤ 𝑎 ∙ 𝑠 𝑥 𝑛 𝑡0, 𝑡1 , 𝑥 𝑡0, 𝑡1
Creating the prediction profile
 Using 𝑁 𝑛, formulate the prediction
 𝑥 𝑛 𝑡 =
σ 𝑦∈𝑁 𝑛
𝑠(𝑥 𝑛,𝑦)∙𝑦(𝑡)
σ 𝑦∈𝑁 𝑛
𝑠(𝑥 𝑛,𝑦)
 Simply put, it is the weighted average over the profiles
of its neighborhood
Training Parameter 𝑎
 𝑎 can be tuned to select the optimal number of
neighbors
 Variable neighborhood search to find the 𝑎 that yields
the highest accuracy over time
 E.g. 1.0 < 𝑎 < 10.0
 Drawbacks
 Increase in complexity (recalculate for each 𝑎)
Calculating multiple 𝑡1
 For a more fine-grained prediction, multiple 𝑡1 can
be used in one day
 Repetition of the VNP (e.g. per-hour analysis)
Handling Complexity - VNP
 Computation of calculating neighborhood of target
day per 𝑎 :
 ED = Θ(NM)
 DTW = Θ(NM2)
 Depending on N, this can be large in practice
 Also, in case of multiple 𝑡1 analysis, large M can
also impact
Handling Complexity - Graph
 Can be heavy
 ED = Θ(N2M)
 DTW = Θ(N2M2)
 Luckily, graph representation is only updated once per day
 Although, needed for various M in case of multiple 𝑡1 analysis
 Also, space partitioning can be used to reduce time
 Via Kd-tree
 This can reduce complexity of ED to Θ(log(N)M)
Prediction Analysis
Prediction accuracy analysis
 Prediction through relative error, defined as
 𝜀 =
σ 𝑡=𝑡1
𝑡2 𝑥 𝑛 𝑡 − ෣𝑥 𝑛 𝑡
2
σ 𝑡=𝑡1
𝑡2 𝑥 𝑛 𝑡
2
 Comparison with closest neighbor ( 𝑎 =1), UPGMA
 𝑡1 = 12
cellular data - ED cellular data - DTW
Effect of changing 𝑡1
 Per-hour analysis
 The length of observation period may also effect the performance
of prediction
cellular data - ED cellular data - DTW
Time consumption
 The required time can be acceptable for both methods in a
per-hour analysis.
 However, need caution for DTW when many profiles are used
cellular data - ED cellular data - DTW
Distribution of α
 The distribution of optimal α is focused in range [1,2], allowing
us to easily limit the range of α
 Distribution of neighbors is heterogeneous, but most are < 20
Conclusion & Future work
Conclusion & Future work
 We have proposed a methodology for online
prediction of mobile time-series datasets
 Acceptable time for our current dataset
 Can be used for other time-series datasets in various
IoT environment
 Further studies include
 Testing in a bigger scale dataset
Any Questions?
Appendix – Wi-Fi data prediction
Wifi data - ED Wifi data - DTW

More Related Content

What's hot

K means Clustering
K means ClusteringK means Clustering
K means Clustering
Edureka!
 
★Mean shift a_robust_approach_to_feature_space_analysis
★Mean shift a_robust_approach_to_feature_space_analysis★Mean shift a_robust_approach_to_feature_space_analysis
★Mean shift a_robust_approach_to_feature_space_analysis
irisshicat
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
butest
 
Modelling Accessibility Performance in LTE networks, An Analytics Methodology
Modelling Accessibility Performance in LTE networks, An Analytics MethodologyModelling Accessibility Performance in LTE networks, An Analytics Methodology
Modelling Accessibility Performance in LTE networks, An Analytics Methodology
alien_gmx
 

What's hot (20)

K-means Clustering
K-means ClusteringK-means Clustering
K-means Clustering
 
Customer Segmentation using Clustering
Customer Segmentation using ClusteringCustomer Segmentation using Clustering
Customer Segmentation using Clustering
 
Proximity Detection in Distributed Simulation of Wireless Mobile Systems
Proximity Detection in Distributed Simulation of Wireless Mobile SystemsProximity Detection in Distributed Simulation of Wireless Mobile Systems
Proximity Detection in Distributed Simulation of Wireless Mobile Systems
 
K-Means clustring @jax
K-Means clustring @jaxK-Means clustring @jax
K-Means clustring @jax
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering method
 
K means Clustering
K means ClusteringK means Clustering
K means Clustering
 
★Mean shift a_robust_approach_to_feature_space_analysis
★Mean shift a_robust_approach_to_feature_space_analysis★Mean shift a_robust_approach_to_feature_space_analysis
★Mean shift a_robust_approach_to_feature_space_analysis
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
Enhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial DatasetEnhance The K Means Algorithm On Spatial Dataset
Enhance The K Means Algorithm On Spatial Dataset
 
3.6 constraint based cluster analysis
3.6 constraint based cluster analysis3.6 constraint based cluster analysis
3.6 constraint based cluster analysis
 
"k-means-clustering" presentation @ Papers We Love Bucharest
"k-means-clustering" presentation @ Papers We Love Bucharest"k-means-clustering" presentation @ Papers We Love Bucharest
"k-means-clustering" presentation @ Papers We Love Bucharest
 
K means and dbscan
K means and dbscanK means and dbscan
K means and dbscan
 
Types of clustering and different types of clustering algorithms
Types of clustering and different types of clustering algorithmsTypes of clustering and different types of clustering algorithms
Types of clustering and different types of clustering algorithms
 
New Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids AlgorithmNew Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids Algorithm
 
Modelling Accessibility Performance in LTE networks, An Analytics Methodology
Modelling Accessibility Performance in LTE networks, An Analytics MethodologyModelling Accessibility Performance in LTE networks, An Analytics Methodology
Modelling Accessibility Performance in LTE networks, An Analytics Methodology
 
Clique
Clique Clique
Clique
 
K means clustering algorithm
K means clustering algorithmK means clustering algorithm
K means clustering algorithm
 
Unsupervised Learning
Unsupervised LearningUnsupervised Learning
Unsupervised Learning
 
3.2 partitioning methods
3.2 partitioning methods3.2 partitioning methods
3.2 partitioning methods
 
Kmeans
KmeansKmeans
Kmeans
 

Viewers also liked

Hybrid neural networks for time series learning by Tian Guo, EPFL, Switzerland
Hybrid neural networks for time series learning by Tian Guo,  EPFL, SwitzerlandHybrid neural networks for time series learning by Tian Guo,  EPFL, Switzerland
Hybrid neural networks for time series learning by Tian Guo, EPFL, Switzerland
EuroIoTa
 

Viewers also liked (9)

Hybrid neural networks for time series learning by Tian Guo, EPFL, Switzerland
Hybrid neural networks for time series learning by Tian Guo,  EPFL, SwitzerlandHybrid neural networks for time series learning by Tian Guo,  EPFL, Switzerland
Hybrid neural networks for time series learning by Tian Guo, EPFL, Switzerland
 
Data analytics for monitoring IoT infrastructures by G.Madhusudan, Orange Labs
Data analytics for monitoring IoT infrastructures by G.Madhusudan, Orange LabsData analytics for monitoring IoT infrastructures by G.Madhusudan, Orange Labs
Data analytics for monitoring IoT infrastructures by G.Madhusudan, Orange Labs
 
OpenML.org: Networked Science and IoT Data Streams by Jan van Rijn, Universit...
OpenML.org: Networked Science and IoT Data Streams by Jan van Rijn, Universit...OpenML.org: Networked Science and IoT Data Streams by Jan van Rijn, Universit...
OpenML.org: Networked Science and IoT Data Streams by Jan van Rijn, Universit...
 
5G: a revolution or an evolution for IoT by Merouane DEBBAH, Huawei
5G: a revolution or an evolution for IoT by Merouane DEBBAH, Huawei5G: a revolution or an evolution for IoT by Merouane DEBBAH, Huawei
5G: a revolution or an evolution for IoT by Merouane DEBBAH, Huawei
 
Real-Time Big Data Stream Analytics
Real-Time Big Data Stream AnalyticsReal-Time Big Data Stream Analytics
Real-Time Big Data Stream Analytics
 
Grid Analytics Europe 2016: "Open for Business", April 2016
Grid Analytics Europe 2016: "Open for Business", April 2016Grid Analytics Europe 2016: "Open for Business", April 2016
Grid Analytics Europe 2016: "Open for Business", April 2016
 
Big data/Data Mining/IoT/Smart City
Big data/Data Mining/IoT/Smart CityBig data/Data Mining/IoT/Smart City
Big data/Data Mining/IoT/Smart City
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
 
Predictive Analytics in Telecommunication
Predictive Analytics in TelecommunicationPredictive Analytics in Telecommunication
Predictive Analytics in Telecommunication
 

Similar to Variable neighborhood Prediction of temporal collective profiles by Keun-Woo Lim, Telecom ParisTech

Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
IDES Editor
 
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
csandit
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
Pedro Lopes
 

Similar to Variable neighborhood Prediction of temporal collective profiles by Keun-Woo Lim, Telecom ParisTech (20)

Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
Data Accuracy Models under Spatio - Temporal Correlation with Adaptive Strate...
 
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
 
Combination of Similarity Measures for Time Series Classification using Genet...
Combination of Similarity Measures for Time Series Classification using Genet...Combination of Similarity Measures for Time Series Classification using Genet...
Combination of Similarity Measures for Time Series Classification using Genet...
 
Estimation of global solar radiation by using machine learning methods
Estimation of global solar radiation by using machine learning methodsEstimation of global solar radiation by using machine learning methods
Estimation of global solar radiation by using machine learning methods
 
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...
DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...
 
USING LEARNING AUTOMATA AND GENETIC ALGORITHMS TO IMPROVE THE QUALITY OF SERV...
USING LEARNING AUTOMATA AND GENETIC ALGORITHMS TO IMPROVE THE QUALITY OF SERV...USING LEARNING AUTOMATA AND GENETIC ALGORITHMS TO IMPROVE THE QUALITY OF SERV...
USING LEARNING AUTOMATA AND GENETIC ALGORITHMS TO IMPROVE THE QUALITY OF SERV...
 
Data mining projects topics for java and dot net
Data mining projects topics for java and dot netData mining projects topics for java and dot net
Data mining projects topics for java and dot net
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
08039246
0803924608039246
08039246
 
M017327378
M017327378M017327378
M017327378
 
Scheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic AlgorithmScheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic Algorithm
 
L010628894
L010628894L010628894
L010628894
 
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHMLIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
LIDAR POINT CLOUD CLASSIFICATION USING EXPECTATION MAXIMIZATION ALGORITHM
 
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
Lidar Point Cloud Classification Using Expectation Maximization AlgorithmLidar Point Cloud Classification Using Expectation Maximization Algorithm
Lidar Point Cloud Classification Using Expectation Maximization Algorithm
 
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMSIMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
 
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMSIMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
 
Accurate indoor positioning system based on modify nearest point technique
Accurate indoor positioning system based on modify nearest point techniqueAccurate indoor positioning system based on modify nearest point technique
Accurate indoor positioning system based on modify nearest point technique
 
Data collection in multi application sharing wireless sensor networks
Data collection in multi application sharing wireless sensor networksData collection in multi application sharing wireless sensor networks
Data collection in multi application sharing wireless sensor networks
 
Performance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal ApproximationPerformance of Matching Algorithmsfor Signal Approximation
Performance of Matching Algorithmsfor Signal Approximation
 

Recently uploaded

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Recently uploaded (20)

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 

Variable neighborhood Prediction of temporal collective profiles by Keun-Woo Lim, Telecom ParisTech

  • 1. VARIABLE NEIGHBORHOOD PREDICTION OF TEMPORAL COLLECTIVE PROFILES Presentation for EuroIoTA ’16 Speaker: Keun-Woo Lim Telecom Paristech 24-11-2016
  • 2. Brief Overview  What do we do in this work?  Analysis of temporal collective profiles (time-series)  Use of mobile datasets (cellular, Wi-Fi)  Real–time & Lightweight prediction (online prediction)  What do we try to achieve?  Better prediction accuracy  Lower computational complexity  Better application & use case
  • 3. Contents Contents  Introduction  Methodology  Prediction  Outlier Detection  Future Work
  • 5. Temporal collective profiles  Representation of data that aggregate behavior of group of individuals – over time  Can be categorized in various ways  “Daily Profiles”
  • 6. What are collected?  Basic telephone calls and SMS?  However, we want to focus on more specific matters  Specific application data  Usage of Internet service
  • 7. Why do we analyze these data?  For “online network analysis”  Real-time prediction of the near-future  Recognition of sudden changes/outliers  Timely adaptation  Use cases  Resource allocation  Traffic handling  Social behavior
  • 8. Requirements  Low computational complexity  Lightweight prediction methods  Accuracy  Still have to be accurate
  • 9. Dataset  Cellular mobile dataset  1-week data from 90 lacs in Paris  More than 500 daily profiles  Wi-Fi cloud dataset  122 days (March 1st to June 30th, 2014)  60 million URL connection logs (Top 20 mobile applications)
  • 11. What should we do with daily profiles?  Daily profiles can be:  Very similar to each other (same day, location, etc.)  Very different too (outlier, events)  We use methods to calculate similarity  Cluster similar profiles  Distinguish different profiles
  • 12. Previous work (Offline analysis)1  Utilization of clustering methods (UPGMA)  With similarity comparison techniques (DTW, quantiles)  Not ideal in online data analysis  Clustering may take long time (𝑂(𝑀2 𝑁3)with DTW) 1K. Lim, S. Secci, L. Tabourier, B. Tebbani, “Characterizing and predicting mobile application usage,” https://hal.archives-ouvertes.fr/hal-01345824/document
  • 13. Profile similarity  We use two examples of similarity measures (M values in a time-series)  Euclidean distance (ED) = Θ(M)  Dynamic time warping (DTW) = Θ(M2)  For specific dataset containing N profiles,  ED = Θ(N2M)  DTW = Θ(N2M2) to compare all with each other
  • 14. Weighted graph representation  Using similarity measures, we acquire a graph structure of neighbors  E.g., if ED is used, lower value = more similar
  • 15. Filtering paths  Filter neighbors with high distance  Depending on the value of α, the number of neighbors change for all profiles
  • 16. Visualization of graph structure  Example graph structure for ED – cellular dataset
  • 18. Principle of VNP  For a new day 𝑥 𝑛(𝑡), we configure  𝑡0 = 0, 𝑡1 = 0~24, 𝑡2 = 24 (hour)  Objective  Observation period = 𝑥 𝑛 𝑡0, 𝑡1  Create a temporal profile to predict 𝑥 𝑛 𝑡1, 𝑡2  Find 𝑥 from the observation period  The closest profile 𝑥, in 𝑥 𝑡0, 𝑡1 and 𝑥 𝑛 𝑡0, 𝑡1
  • 19. Find the neighbors  Using closest neighbor 𝑥, we find the group of neighbors 𝑁 𝑛 to be used for prediction  For any other profile y of the training set,  𝑦 ∈ 𝑁𝑛 𝑖𝑖𝑓 𝑠 𝑥 𝑛 𝑡0, 𝑡1 , 𝑦 𝑡0, 𝑡1 ≤ 𝑎 ∙ 𝑠 𝑥 𝑛 𝑡0, 𝑡1 , 𝑥 𝑡0, 𝑡1
  • 20. Creating the prediction profile  Using 𝑁 𝑛, formulate the prediction  𝑥 𝑛 𝑡 = σ 𝑦∈𝑁 𝑛 𝑠(𝑥 𝑛,𝑦)∙𝑦(𝑡) σ 𝑦∈𝑁 𝑛 𝑠(𝑥 𝑛,𝑦)  Simply put, it is the weighted average over the profiles of its neighborhood
  • 21. Training Parameter 𝑎  𝑎 can be tuned to select the optimal number of neighbors  Variable neighborhood search to find the 𝑎 that yields the highest accuracy over time  E.g. 1.0 < 𝑎 < 10.0  Drawbacks  Increase in complexity (recalculate for each 𝑎)
  • 22. Calculating multiple 𝑡1  For a more fine-grained prediction, multiple 𝑡1 can be used in one day  Repetition of the VNP (e.g. per-hour analysis)
  • 23. Handling Complexity - VNP  Computation of calculating neighborhood of target day per 𝑎 :  ED = Θ(NM)  DTW = Θ(NM2)  Depending on N, this can be large in practice  Also, in case of multiple 𝑡1 analysis, large M can also impact
  • 24. Handling Complexity - Graph  Can be heavy  ED = Θ(N2M)  DTW = Θ(N2M2)  Luckily, graph representation is only updated once per day  Although, needed for various M in case of multiple 𝑡1 analysis  Also, space partitioning can be used to reduce time  Via Kd-tree  This can reduce complexity of ED to Θ(log(N)M)
  • 26. Prediction accuracy analysis  Prediction through relative error, defined as  𝜀 = σ 𝑡=𝑡1 𝑡2 𝑥 𝑛 𝑡 − ෣𝑥 𝑛 𝑡 2 σ 𝑡=𝑡1 𝑡2 𝑥 𝑛 𝑡 2  Comparison with closest neighbor ( 𝑎 =1), UPGMA  𝑡1 = 12 cellular data - ED cellular data - DTW
  • 27. Effect of changing 𝑡1  Per-hour analysis  The length of observation period may also effect the performance of prediction cellular data - ED cellular data - DTW
  • 28. Time consumption  The required time can be acceptable for both methods in a per-hour analysis.  However, need caution for DTW when many profiles are used cellular data - ED cellular data - DTW
  • 29. Distribution of α  The distribution of optimal α is focused in range [1,2], allowing us to easily limit the range of α  Distribution of neighbors is heterogeneous, but most are < 20
  • 31. Conclusion & Future work  We have proposed a methodology for online prediction of mobile time-series datasets  Acceptable time for our current dataset  Can be used for other time-series datasets in various IoT environment  Further studies include  Testing in a bigger scale dataset
  • 33. Appendix – Wi-Fi data prediction Wifi data - ED Wifi data - DTW