SlideShare a Scribd company logo
1 of 46
Download to read offline
1
Free
                       the data




by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l
                                2
Why ?




by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l
                                2
Because we
                  will get new
                    insights




by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l
                                2
Issues and Considerations regarding Sharable Data
      Sets for Recommender Systems in TEL
 29.09.2010 RecSysTEL workshop at RecSys and ECTEL conference, Barcelona, Spain




Hendrik Drachsler                                                    #dataTEL
Centre for Learning Sciences and Technology
@ Open University of the Netherlands 3
What is dataTEL
• dataTEL is a Theme Team funded by the
  STELLAR network of excellence.
• It address 2 STELLAR Grand Challenges
  1. Connecting Learner
  2. Contextualisation



                    4
Recommender in TEL




         5
The TEL recommender
  are a bit like this...




           6
The TEL recommender
  are a bit like this...
  We need to select for each application an
    appropriate recsys that fits its needs.




                     6
But...
“The performance results
of different research
efforts in recommender
systems are hardly
comparable.”

(Manouselis et al., 2010)
                                Kaptain Kobold
                                http://www.flickr.com/photos/
                                kaptainkobold/3203311346/




                            7
But...
The TEL recommender
“The performance results
experiments lack
of different research
transparency. They need
efforts in recommender
to be repeatable to test:
systems are hardly
comparable.”
• Validity
• Verificationet al., 2010)
(Manouselis
• Compare results                Kaptain Kobold
                                 http://www.flickr.com/photos/
                                 kaptainkobold/3203311346/




                             7
How others compare their
    recommenders




           8
What kind of data set we can
                                                         Fin
                                                           f
                                                           c
       expect in TEL                                      M

                                                         Ma
                                                          s
                                                          f
                                                         W
                                                          a




Graphic by J. Cross, Informal learning, Pfeifer (2006)

                           9
The Long Tail




Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004)
                                  tail
The Long Tail of Learning




Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004)
                                  tail
The Long Tail of Learning
  Formal Data

                               Informal Data




Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004)
                                  tail
Guidelines for Data Set
     Aggregation




                Justin Marshall, Coded Ornament by
                rootoftwo
                http://www.flickr.com/photos/rootoftwo/
                267285816


           11
Guidelines for Data Set
        Aggregation
1. Create a data set that
realistically reflects the
variables of the learning
setting.




                                 Justin Marshall, Coded Ornament by
                                 rootoftwo
                                 http://www.flickr.com/photos/rootoftwo/
                                 267285816


                            11
Guidelines for Data Set
        Aggregation
1. Create a data set that
realistically reflects the
variables of the learning
setting.
2. Use a sufficiently large
set of user profiles


                                  Justin Marshall, Coded Ornament by
                                  rootoftwo
                                  http://www.flickr.com/photos/rootoftwo/
                                  267285816


                             11
Guidelines for Data Set
        Aggregation
1. Create a data set that
realistically reflects the
variables of the learning
setting.
2. Use a sufficiently large
set of user profiles
3. Create data sets that
are comparable to others
                                  Justin Marshall, Coded Ornament by
                                  rootoftwo
                                  http://www.flickr.com/photos/rootoftwo/
                                  267285816


                             11
Guidelines for Data Set
        Aggregation
1. Create a data set that
realistically reflects the
variables of the learning
setting.
2. Use a sufficiently large
set of user profiles
3. Create data sets that
are comparable to others
                                  Justin Marshall, Coded Ornament by
                                  rootoftwo
                                  http://www.flickr.com/photos/rootoftwo/
                                  267285816


                             11
dataTEL::Objectives
                   Standardize research on
                recommender systems in TEL

Five core questions:
 1.How can data sets be shared according to privacy and legal
   protection rights?
 2.How to development a policy to use and share data sets?
 3.How to pre-process data sets to make them suitable for
   other researchers?
 4.How to define common evaluation criteria for TEL
   recommender systems?
 5.How to develop overview methods to monitor the
   performance of TEL recommender systems on data sets?
                                12
1. Protection Rights




        13
1. Protection Rights




        13
1. Protection Rights
  OVERSHARING




        13
1. Protection Rights
              OVERSHARING
Were the founders of PleaseRobMe.com actually
allowed to grab the data from the web and present it
in that way?




                         13
1. Protection Rights
              OVERSHARING
Were the founders of PleaseRobMe.com actually
allowed to grab the data from the web and present it
in that way?

Are we allowed to use data from social handles and
reuse it for research purposes?


                         13
1. Protection Rights
              OVERSHARING
Were the founders of PleaseRobMe.com actually
allowed to grab the data from the web and present it
in that way?

Are we allowed to use data from social handles and
reuse it for research purposes?

and there is more company protection rights!
                         13
2. Policies to use Data




          14
2. Policies to use Data




          14
2. Policies to use Data




          14
2. Policies to use Data




          14
2. Policies to use Data




          14
2. Policies to use Data




          14
2. Policies to use Data




          14
3. Pre-Process Data Sets
For informal data sets:

1. Collect data
2. Process data
3. Share data

For formal data sets
from LMS:

1. Data storing scripts
2. Anonymisation scripts


                           15
4. Evaluation Criteria




Combine approach by           Kirkpatrick model by
 Drachsler et al. 2008        Manouselis et al. 2010
                         16
4. Evaluation Criteria
 1. Accuracy
 2. Coverage
 3. Precision




Combine approach by           Kirkpatrick model by
 Drachsler et al. 2008        Manouselis et al. 2010
                         16
4. Evaluation Criteria
 1. Accuracy
 2. Coverage
 3. Precision



1. Effectiveness of learning
2. Efficiency of learning
3. Drop out rate
4. Satisfaction


Combine approach by             Kirkpatrick model by
 Drachsler et al. 2008          Manouselis et al. 2010
                           16
4. Evaluation Criteria
 1. Accuracy                      1. Reaction of learner
 2. Coverage                      2. Learning improved
 3. Precision                     3. Behaviour
                                  4. Results


1. Effectiveness of learning
2. Efficiency of learning
3. Drop out rate
4. Satisfaction


Combine approach by             Kirkpatrick model by
 Drachsler et al. 2008          Manouselis et al. 2010
                           16
4. Evaluation Criteria
 1. Accuracy                      1. Reaction of learner
 2. Coverage                      2. Learning improved
 3. Precision                     3. Behaviour
                                  4. Results


1. Effectiveness of learning
2. Efficiency of learning
3. Drop out rate
4. Satisfaction


Combine approach by             Kirkpatrick model by
 Drachsler et al. 2008          Manouselis et al. 2010
                           16
4. Evaluation Criteria
 1. Accuracy                      1. Reaction of learner
 2. Coverage                      2. Learning improved
 3. Precision                     3. Behaviour
                                  4. Results


1. Effectiveness of learning
2. Efficiency of learning
3. Drop out rate
4. Satisfaction


Combine approach by             Kirkpatrick model by
 Drachsler et al. 2008          Manouselis et al. 2010
                           16
5. Data Set Framework to Monitor
           Performance
               Datasets
Formal                                                Informal


   Data A                   Data B                   Data C



Algorithms:            Algorithms:            Algorithms:
Algoritmen A           Algoritmen D           Algoritmen B
Algoritmen B           Algoritmen E           Algoritmen D
Algoritmen C

Models:                Models:                Models:
Learner Model A        Learner Model C        Learner Model A
Learner Model B        Learner Model E        Learner Model C

Measured attributes:   Measured attributes:   Measured attributes:
Attribute A            Attribute A            Attribute A
Attribute B            Attribute B            Attribute B
Attribute C            Attribute C            Attribute C
                               17
Join us for a Coffee ...
http://www.teleurope.eu/pg/groups/9405/datatel/




                      18
Upcoming event ...
Workshop: dataTEL- Data Sets for Technology Enhanced Learning
Date: March 30th to March 31st, 2011
Location: Ski resort La Clusaz in the French Alps, Massif des
Aravis
Funding: Food and lodging for 3 nights for 10 selected participants
Submissions: For being funded, please send extended abstracts to
http://www.easychair.org/conferences/?conf=datatel2011
Deadline for submissions: October 25th, 2010
CfP: http://www.teleurope.eu/pg/pages/view/46082/



                                 19
Many thanks for your interests
   This silde is available at:
   http://www.slideshare.com/Drachsler

   Email:       hendrik.drachsler@ou.nl
   Skype:       celstec-hendrik.drachsler
   Blogging at: http://www.drachsler.de
   Twittering at: http://twitter.com/HDrachsler


                       20

More Related Content

Viewers also liked

We b development trends
We b  development  trendsWe b  development  trends
We b development trendsRajib Ahmed
 
Еmail vs Social — Евгений Вольнов
Еmail vs Social — Евгений ВольновЕmail vs Social — Евгений Вольнов
Еmail vs Social — Евгений ВольновMaria Podolyak
 
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.Carlo Colomba
 
Upping engagement with digital resources
Upping engagement with digital resourcesUpping engagement with digital resources
Upping engagement with digital resourcesLis Parcell
 
Y vuestros ojos lo veran
Y vuestros ojos lo veranY vuestros ojos lo veran
Y vuestros ojos lo veranPaulo Arieu
 
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el Aula
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el AulaMás allá del Pokémon Go: Realidad Aumentada y Virtual en el Aula
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el AulaIsmael Burone
 
Analisis crítico del texto
Analisis crítico del textoAnalisis crítico del texto
Analisis crítico del textoMarcela Ibaceta
 
Our school in winter
Our school in winterOur school in winter
Our school in winterGavranica
 
Groovy every day
Groovy every dayGroovy every day
Groovy every dayPaul Woods
 
Thinking about thinking
Thinking about thinkingThinking about thinking
Thinking about thinkingJennifer Orr
 
A Long Walk to Water - Lssn 11
A Long Walk to Water - Lssn 11A Long Walk to Water - Lssn 11
A Long Walk to Water - Lssn 11Terri Weiss
 
Web 2.0 for Learning
Web 2.0 for LearningWeb 2.0 for Learning
Web 2.0 for Learningtuchodi
 
Lyddie: Unit3 lesson3
Lyddie:  Unit3 lesson3Lyddie:  Unit3 lesson3
Lyddie: Unit3 lesson3Terri Weiss
 
Incline & Pacific - なぜ作るのか
Incline & Pacific - なぜ作るのかIncline & Pacific - なぜ作るのか
Incline & Pacific - なぜ作るのかKazuho Oku
 
A ToolBox for Handover practices in Europe
A ToolBox for Handover practices in Europe  A ToolBox for Handover practices in Europe
A ToolBox for Handover practices in Europe Hendrik Drachsler
 

Viewers also liked (20)

Koty
KotyKoty
Koty
 
We b development trends
We b  development  trendsWe b  development  trends
We b development trends
 
Online presentatie MvdV
Online presentatie MvdVOnline presentatie MvdV
Online presentatie MvdV
 
Еmail vs Social — Евгений Вольнов
Еmail vs Social — Евгений ВольновЕmail vs Social — Евгений Вольнов
Еmail vs Social — Евгений Вольнов
 
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.
Occupabilità e competitività come muoversi da protagonista nel mondo che cambia.
 
Upping engagement with digital resources
Upping engagement with digital resourcesUpping engagement with digital resources
Upping engagement with digital resources
 
Y vuestros ojos lo veran
Y vuestros ojos lo veranY vuestros ojos lo veran
Y vuestros ojos lo veran
 
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el Aula
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el AulaMás allá del Pokémon Go: Realidad Aumentada y Virtual en el Aula
Más allá del Pokémon Go: Realidad Aumentada y Virtual en el Aula
 
Analisis crítico del texto
Analisis crítico del textoAnalisis crítico del texto
Analisis crítico del texto
 
Our school in winter
Our school in winterOur school in winter
Our school in winter
 
Groovy every day
Groovy every dayGroovy every day
Groovy every day
 
Thinking about thinking
Thinking about thinkingThinking about thinking
Thinking about thinking
 
PresentacióN
PresentacióNPresentacióN
PresentacióN
 
A Long Walk to Water - Lssn 11
A Long Walk to Water - Lssn 11A Long Walk to Water - Lssn 11
A Long Walk to Water - Lssn 11
 
Web 2.0 for Learning
Web 2.0 for LearningWeb 2.0 for Learning
Web 2.0 for Learning
 
Lyddie: Unit3 lesson3
Lyddie:  Unit3 lesson3Lyddie:  Unit3 lesson3
Lyddie: Unit3 lesson3
 
Spasovo
SpasovoSpasovo
Spasovo
 
Incline & Pacific - なぜ作るのか
Incline & Pacific - なぜ作るのかIncline & Pacific - なぜ作るのか
Incline & Pacific - なぜ作るのか
 
A ToolBox for Handover practices in Europe
A ToolBox for Handover practices in Europe  A ToolBox for Handover practices in Europe
A ToolBox for Handover practices in Europe
 
Toronto
TorontoToronto
Toronto
 

Similar to Issues and Considerations regarding Sharable Data Sets for Recommender Systems in TEL

Recent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TELRecent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TELHendrik Drachsler
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsKatrien Verbert
 
Launch of the EATEL SIG dataTEL at ECTEL 2011
Launch of the EATEL SIG dataTEL at ECTEL 2011 Launch of the EATEL SIG dataTEL at ECTEL 2011
Launch of the EATEL SIG dataTEL at ECTEL 2011 Hendrik Drachsler
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slidestoolboc
 
Case Study - Virtual Worlds and Learning Analytics
Case Study - Virtual Worlds and Learning AnalyticsCase Study - Virtual Worlds and Learning Analytics
Case Study - Virtual Worlds and Learning AnalyticsVanessa Camilleri
 
Turning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkTurning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkHendrik Drachsler
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy DevelopmentRobin Rice
 
Rda nitrd 2015 berman - final
Rda nitrd 2015 berman  - finalRda nitrd 2015 berman  - final
Rda nitrd 2015 berman - finalKathy Fontaine
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUEIJDKP
 
Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterCASRAI
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Miningnabil_alsharafi
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...IJERA Editor
 

Similar to Issues and Considerations regarding Sharable Data Sets for Recommender Systems in TEL (20)

Recent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TELRecent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TEL
 
Dataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systemsDataset-driven research to improve TEL recommender systems
Dataset-driven research to improve TEL recommender systems
 
Launch of the EATEL SIG dataTEL at ECTEL 2011
Launch of the EATEL SIG dataTEL at ECTEL 2011 Launch of the EATEL SIG dataTEL at ECTEL 2011
Launch of the EATEL SIG dataTEL at ECTEL 2011
 
ICELW Conference Slides
ICELW Conference SlidesICELW Conference Slides
ICELW Conference Slides
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
Case Study - Virtual Worlds and Learning Analytics
Case Study - Virtual Worlds and Learning AnalyticsCase Study - Virtual Worlds and Learning Analytics
Case Study - Virtual Worlds and Learning Analytics
 
Turning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkTurning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics Framework
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
Data mining
Data miningData mining
Data mining
 
Research Data Management: Policy Development
Research Data Management: Policy DevelopmentResearch Data Management: Policy Development
Research Data Management: Policy Development
 
Rda nitrd 2015 berman - final
Rda nitrd 2015 berman  - finalRda nitrd 2015 berman  - final
Rda nitrd 2015 berman - final
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
 
Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon Porter
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...
 

More from Hendrik Drachsler

Trusted Learning Analytics Research Program
Trusted Learning Analytics Research ProgramTrusted Learning Analytics Research Program
Trusted Learning Analytics Research ProgramHendrik Drachsler
 
Smart Speaker as Studying Assistant by Joao Pargana
Smart Speaker as Studying Assistant by Joao ParganaSmart Speaker as Studying Assistant by Joao Pargana
Smart Speaker as Studying Assistant by Joao ParganaHendrik Drachsler
 
Verhaltenskodex Trusted Learning Analytics
Verhaltenskodex Trusted Learning AnalyticsVerhaltenskodex Trusted Learning Analytics
Verhaltenskodex Trusted Learning AnalyticsHendrik Drachsler
 
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...Hendrik Drachsler
 
E.Leute: Learning the impact of Learning Analytics with an authentic dataset
E.Leute: Learning the impact of Learning Analytics with an authentic datasetE.Leute: Learning the impact of Learning Analytics with an authentic dataset
E.Leute: Learning the impact of Learning Analytics with an authentic datasetHendrik Drachsler
 
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...Hendrik Drachsler
 
Towards Tangible Trusted Learning Analytics
Towards Tangible Trusted Learning AnalyticsTowards Tangible Trusted Learning Analytics
Towards Tangible Trusted Learning AnalyticsHendrik Drachsler
 
Fighting level 3: From the LA framework to LA practice on the micro-level
Fighting level 3: From the LA framework to LA practice on the micro-levelFighting level 3: From the LA framework to LA practice on the micro-level
Fighting level 3: From the LA framework to LA practice on the micro-levelHendrik Drachsler
 
LACE Project Overview and Exploitation
LACE Project Overview and ExploitationLACE Project Overview and Exploitation
LACE Project Overview and ExploitationHendrik Drachsler
 
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentDutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentHendrik Drachsler
 
Recommendations for Open Online Education: An Algorithmic Study
Recommendations for Open Online Education:  An Algorithmic StudyRecommendations for Open Online Education:  An Algorithmic Study
Recommendations for Open Online Education: An Algorithmic StudyHendrik Drachsler
 
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...Hendrik Drachsler
 
DELICATE checklist - to establish trusted Learning Analytics
DELICATE checklist - to establish trusted Learning AnalyticsDELICATE checklist - to establish trusted Learning Analytics
DELICATE checklist - to establish trusted Learning AnalyticsHendrik Drachsler
 
The Future of Big Data in Education
The Future of Big Data in EducationThe Future of Big Data in Education
The Future of Big Data in EducationHendrik Drachsler
 
The Future of Learning Analytics
The Future of Learning AnalyticsThe Future of Learning Analytics
The Future of Learning AnalyticsHendrik Drachsler
 
Six dimensions of Learning Analytics
Six dimensions of Learning AnalyticsSix dimensions of Learning Analytics
Six dimensions of Learning AnalyticsHendrik Drachsler
 
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Hendrik Drachsler
 

More from Hendrik Drachsler (20)

Trusted Learning Analytics Research Program
Trusted Learning Analytics Research ProgramTrusted Learning Analytics Research Program
Trusted Learning Analytics Research Program
 
Smart Speaker as Studying Assistant by Joao Pargana
Smart Speaker as Studying Assistant by Joao ParganaSmart Speaker as Studying Assistant by Joao Pargana
Smart Speaker as Studying Assistant by Joao Pargana
 
Verhaltenskodex Trusted Learning Analytics
Verhaltenskodex Trusted Learning AnalyticsVerhaltenskodex Trusted Learning Analytics
Verhaltenskodex Trusted Learning Analytics
 
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...
Rödling, S. (2019). Entwicklung einer Applikation zum assoziativen Medien Ler...
 
E.Leute: Learning the impact of Learning Analytics with an authentic dataset
E.Leute: Learning the impact of Learning Analytics with an authentic datasetE.Leute: Learning the impact of Learning Analytics with an authentic dataset
E.Leute: Learning the impact of Learning Analytics with an authentic dataset
 
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...
Romano, G. (2019) Dancing Trainer: A System For Humans To Learn Dancing Using...
 
Towards Tangible Trusted Learning Analytics
Towards Tangible Trusted Learning AnalyticsTowards Tangible Trusted Learning Analytics
Towards Tangible Trusted Learning Analytics
 
Trusted Learning Analytics
Trusted Learning Analytics Trusted Learning Analytics
Trusted Learning Analytics
 
Fighting level 3: From the LA framework to LA practice on the micro-level
Fighting level 3: From the LA framework to LA practice on the micro-levelFighting level 3: From the LA framework to LA practice on the micro-level
Fighting level 3: From the LA framework to LA practice on the micro-level
 
LACE Project Overview and Exploitation
LACE Project Overview and ExploitationLACE Project Overview and Exploitation
LACE Project Overview and Exploitation
 
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentDutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
 
Recommendations for Open Online Education: An Algorithmic Study
Recommendations for Open Online Education:  An Algorithmic StudyRecommendations for Open Online Education:  An Algorithmic Study
Recommendations for Open Online Education: An Algorithmic Study
 
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...
Privacy and Analytics – it’s a DELICATE Issue. A Checklist for Trusted Learni...
 
DELICATE checklist - to establish trusted Learning Analytics
DELICATE checklist - to establish trusted Learning AnalyticsDELICATE checklist - to establish trusted Learning Analytics
DELICATE checklist - to establish trusted Learning Analytics
 
LACE Flyer 2016
LACE Flyer 2016 LACE Flyer 2016
LACE Flyer 2016
 
The Future of Big Data in Education
The Future of Big Data in EducationThe Future of Big Data in Education
The Future of Big Data in Education
 
The Future of Learning Analytics
The Future of Learning AnalyticsThe Future of Learning Analytics
The Future of Learning Analytics
 
Six dimensions of Learning Analytics
Six dimensions of Learning AnalyticsSix dimensions of Learning Analytics
Six dimensions of Learning Analytics
 
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store - Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
Learning Analytics Metadata Standards, xAPI recipes & Learning Record Store -
 
Ethics privacy washington
Ethics privacy washingtonEthics privacy washington
Ethics privacy washington
 

Issues and Considerations regarding Sharable Data Sets for Recommender Systems in TEL

  • 1. 1
  • 2. Free the data by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l 2
  • 3. Why ? by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l 2
  • 4. Because we will get new insights by Tom Raftery http://www.flickr.com/photos/traftery/4773457853/sizes/l 2
  • 5. Issues and Considerations regarding Sharable Data Sets for Recommender Systems in TEL 29.09.2010 RecSysTEL workshop at RecSys and ECTEL conference, Barcelona, Spain Hendrik Drachsler #dataTEL Centre for Learning Sciences and Technology @ Open University of the Netherlands 3
  • 6. What is dataTEL • dataTEL is a Theme Team funded by the STELLAR network of excellence. • It address 2 STELLAR Grand Challenges 1. Connecting Learner 2. Contextualisation 4
  • 8. The TEL recommender are a bit like this... 6
  • 9. The TEL recommender are a bit like this... We need to select for each application an appropriate recsys that fits its needs. 6
  • 10. But... “The performance results of different research efforts in recommender systems are hardly comparable.” (Manouselis et al., 2010) Kaptain Kobold http://www.flickr.com/photos/ kaptainkobold/3203311346/ 7
  • 11. But... The TEL recommender “The performance results experiments lack of different research transparency. They need efforts in recommender to be repeatable to test: systems are hardly comparable.” • Validity • Verificationet al., 2010) (Manouselis • Compare results Kaptain Kobold http://www.flickr.com/photos/ kaptainkobold/3203311346/ 7
  • 12. How others compare their recommenders 8
  • 13. What kind of data set we can Fin f c expect in TEL M Ma s f W a Graphic by J. Cross, Informal learning, Pfeifer (2006) 9
  • 14. The Long Tail Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004) tail
  • 15. The Long Tail of Learning Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004) tail
  • 16. The Long Tail of Learning Formal Data Informal Data Graphic Wilkins, D., (2009); Long10 concept Anderson, C. (2004) tail
  • 17. Guidelines for Data Set Aggregation Justin Marshall, Coded Ornament by rootoftwo http://www.flickr.com/photos/rootoftwo/ 267285816 11
  • 18. Guidelines for Data Set Aggregation 1. Create a data set that realistically reflects the variables of the learning setting. Justin Marshall, Coded Ornament by rootoftwo http://www.flickr.com/photos/rootoftwo/ 267285816 11
  • 19. Guidelines for Data Set Aggregation 1. Create a data set that realistically reflects the variables of the learning setting. 2. Use a sufficiently large set of user profiles Justin Marshall, Coded Ornament by rootoftwo http://www.flickr.com/photos/rootoftwo/ 267285816 11
  • 20. Guidelines for Data Set Aggregation 1. Create a data set that realistically reflects the variables of the learning setting. 2. Use a sufficiently large set of user profiles 3. Create data sets that are comparable to others Justin Marshall, Coded Ornament by rootoftwo http://www.flickr.com/photos/rootoftwo/ 267285816 11
  • 21. Guidelines for Data Set Aggregation 1. Create a data set that realistically reflects the variables of the learning setting. 2. Use a sufficiently large set of user profiles 3. Create data sets that are comparable to others Justin Marshall, Coded Ornament by rootoftwo http://www.flickr.com/photos/rootoftwo/ 267285816 11
  • 22. dataTEL::Objectives Standardize research on recommender systems in TEL Five core questions: 1.How can data sets be shared according to privacy and legal protection rights? 2.How to development a policy to use and share data sets? 3.How to pre-process data sets to make them suitable for other researchers? 4.How to define common evaluation criteria for TEL recommender systems? 5.How to develop overview methods to monitor the performance of TEL recommender systems on data sets? 12
  • 25. 1. Protection Rights OVERSHARING 13
  • 26. 1. Protection Rights OVERSHARING Were the founders of PleaseRobMe.com actually allowed to grab the data from the web and present it in that way? 13
  • 27. 1. Protection Rights OVERSHARING Were the founders of PleaseRobMe.com actually allowed to grab the data from the web and present it in that way? Are we allowed to use data from social handles and reuse it for research purposes? 13
  • 28. 1. Protection Rights OVERSHARING Were the founders of PleaseRobMe.com actually allowed to grab the data from the web and present it in that way? Are we allowed to use data from social handles and reuse it for research purposes? and there is more company protection rights! 13
  • 29. 2. Policies to use Data 14
  • 30. 2. Policies to use Data 14
  • 31. 2. Policies to use Data 14
  • 32. 2. Policies to use Data 14
  • 33. 2. Policies to use Data 14
  • 34. 2. Policies to use Data 14
  • 35. 2. Policies to use Data 14
  • 36. 3. Pre-Process Data Sets For informal data sets: 1. Collect data 2. Process data 3. Share data For formal data sets from LMS: 1. Data storing scripts 2. Anonymisation scripts 15
  • 37. 4. Evaluation Criteria Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 38. 4. Evaluation Criteria 1. Accuracy 2. Coverage 3. Precision Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 39. 4. Evaluation Criteria 1. Accuracy 2. Coverage 3. Precision 1. Effectiveness of learning 2. Efficiency of learning 3. Drop out rate 4. Satisfaction Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 40. 4. Evaluation Criteria 1. Accuracy 1. Reaction of learner 2. Coverage 2. Learning improved 3. Precision 3. Behaviour 4. Results 1. Effectiveness of learning 2. Efficiency of learning 3. Drop out rate 4. Satisfaction Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 41. 4. Evaluation Criteria 1. Accuracy 1. Reaction of learner 2. Coverage 2. Learning improved 3. Precision 3. Behaviour 4. Results 1. Effectiveness of learning 2. Efficiency of learning 3. Drop out rate 4. Satisfaction Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 42. 4. Evaluation Criteria 1. Accuracy 1. Reaction of learner 2. Coverage 2. Learning improved 3. Precision 3. Behaviour 4. Results 1. Effectiveness of learning 2. Efficiency of learning 3. Drop out rate 4. Satisfaction Combine approach by Kirkpatrick model by Drachsler et al. 2008 Manouselis et al. 2010 16
  • 43. 5. Data Set Framework to Monitor Performance Datasets Formal Informal Data A Data B Data C Algorithms: Algorithms: Algorithms: Algoritmen A Algoritmen D Algoritmen B Algoritmen B Algoritmen E Algoritmen D Algoritmen C Models: Models: Models: Learner Model A Learner Model C Learner Model A Learner Model B Learner Model E Learner Model C Measured attributes: Measured attributes: Measured attributes: Attribute A Attribute A Attribute A Attribute B Attribute B Attribute B Attribute C Attribute C Attribute C 17
  • 44. Join us for a Coffee ... http://www.teleurope.eu/pg/groups/9405/datatel/ 18
  • 45. Upcoming event ... Workshop: dataTEL- Data Sets for Technology Enhanced Learning Date: March 30th to March 31st, 2011 Location: Ski resort La Clusaz in the French Alps, Massif des Aravis Funding: Food and lodging for 3 nights for 10 selected participants Submissions: For being funded, please send extended abstracts to http://www.easychair.org/conferences/?conf=datatel2011 Deadline for submissions: October 25th, 2010 CfP: http://www.teleurope.eu/pg/pages/view/46082/ 19
  • 46. Many thanks for your interests This silde is available at: http://www.slideshare.com/Drachsler Email: hendrik.drachsler@ou.nl Skype: celstec-hendrik.drachsler Blogging at: http://www.drachsler.de Twittering at: http://twitter.com/HDrachsler 20