SlideShare a Scribd company logo
1 of 24
Analysis on Steam Platform
Team 3
Jianjun Luo
Sushma Tayanna
Wen Liu
Yousef Fadila
Zhanfeng Huang
Which of these games is more affordable?
59.99$ 9.99$ 4.99$
?
Background: Global Video Games Market is HUGE!
Why Analyzing Steam’s Data?
● Steam is the dominant platform serving at the
de facto PC game distribution hub.
● Total games owned: 3 Billions!
● Real time statistics of Steam’s users and games
shed lights on a market of billions.
Guiding Questions
1)How many valid users and active users there are on Steam?
2)How much time do Steam’s users spend on Steam?
3)How much money do Steam’s users spend on Steam?
4)What is the Price–performance ratio (Avg. Cost Per Hour) of Steam's games?
Sampling Method
● Steam ID is assigned a 64-bit unique ID in a sequential manner
● Steam ID space length is 462,322,592
● Query Steam web API by a generated steam ID can get this user’s summary and game list
● Question: how to sample a certain amount of users for our analysis? Simple Random Sampling?
Stratified Sampling?
Stratified Sampling with
Random Sampled Strata?
Estimating Total Valid Users
● Let Yi = 1 denote the i th ID is a valid user (or active users, private users...) and Yi = 0 otherwise
● N is the total number of IDs in ID space, so the variable of interest is: Y1 ,...,YN
● Let n denote the number of samples, so the variable of interest is: y1 ,..., yn
● For Stratified Sampling, let L denote the number of strata and in our case and h is a specific stratum
● For Stratified Sampling with Random Sampled Strata, let B denote total number of strata we randomly choosed in L
Est. of Simple Random Sampling
Est. of Stratified Sampling
Est. of Stratified Sampling with
Random Sampled Strata
where
Actually in our case, the strata are equal length and number of
samples in each stratum or chosen bucket is also equal...
Variance!
The variance of estimator of population total is not same
Simple Random Sampling
Stratified Sampling
Stratified Sampling with Random Sampled Strata
Number of Invalid Users Samples in Each Stratum
Split 462,322,592 (N) into 1000 Strata (L) and 150 (nh) Samples in each stratum
Stratified Sampling and Estimation of Valid users
● The insight of Stratified Sampling
○ may produce a smaller error of estimation than a simple random sample with same size
○ estimates of population parameters may be desired for subgroups of the population
● For total valid users
○ ID samples: 150,000 from 150 samples (nh) * 1000 strata (L)
○ ID space length (Nh) in each stratum: 462322
○ Number of invalid users in samples: 17541
○ Total invalid users: 54,064,004
○ Total valid users: 408,258,588
● When the stratum sample sizes are at least 30, we can use z to approximate t in t distribution
○ the 90% confidence interval is: 408,258,588 ± 559,047
○ If samples are from uniformed sampling, it is: 408,258,588 ± 799,945
Same analysis can be performed on estimating active users, private users ...
Computational Experiments
● Distribution of Game Price
● Distribution of Number of Games Purchased by Users
● Gamer Behavior
● Price Performance Ratio (Average Cost/Hour)
Distribution of Game Price
48,213 Games in Steam
23.14% of them are free
85.23% of them are less than $10
Distribution of Game Price
99.73% less
than $80
Number of Games Purchased by User
most users purchase less than 25 games
45.61% of users have not purchased any games
Game Behavior - Account Value
25,260 sample users 46% of them never paid for games → $0
99.45% of them are less than $3,000
11.09% of all games bought on
Steam are never played
90.07% of them are played less
than 10 hours
Game Behavior - Average Play Time
Cost/Hour for user: Is playing game expensive?
No, for most users(98%), they spent 2.72$/hour on game playing.
7.09$/h
Ticket only
2.72$/h 11.57$/h
Ticket only
Economical and NOT one time consumption.
Hour/User for Game: “White Elephant” VS “Worth Buying”
The average playing time per game owner:
<1: Unsatisfying Games (36%)
85% <10$, Average price is 7.47$
>=1: Satisfying Games (64%)
96%<10$, Average price is 11.48$
Higher Price cannot guarantee quality, but
satisfying games on average sells more
expensive.
Cost/Hour,User for Game
Case Study: overall gameplay experience
16.489$/h
49.99$
4 hours to finish
Definitely not worth
$50
0.836$/h
59.99$
Urge to play again
Not a waste, even at
full price.
Cost/Hour,User for Game
Case Study: overall gameplay experience
Back To The Question:
Which game is more affordable?
Price could be misleading!
Cost VS Cost-Benefit - Cost per unit of time
59.99$ 9.99$ 4.99$
0.485$/h 1.463$/h 1.921$/h
Conclusion
● Apply the stratified sampling on Steam ID space, and compare it with simple
random sampling and stratified sampling with random sampled strata
● Get the unbiased estimation of total valid user number and the its confidence
interval
● 46% users only play free games or discard the account and play no games.
● Playing computer games is economical comparing to other leisure activities.
● Satisfying game --->Higher Price
● Higher price-performance ratio ---> Better overall gameplay experience
Thank You

More Related Content

What's hot

Shihtzu Exchange, NFT Minting & Metaverse Platform
Shihtzu Exchange, NFT Minting & Metaverse PlatformShihtzu Exchange, NFT Minting & Metaverse Platform
Shihtzu Exchange, NFT Minting & Metaverse PlatformJitinThomas6
 
20 Game Ideas You Should Steal
20 Game Ideas You Should Steal20 Game Ideas You Should Steal
20 Game Ideas You Should StealStuart Dredge
 
Powering-Up Your Game with Twitch
Powering-Up Your Game with TwitchPowering-Up Your Game with Twitch
Powering-Up Your Game with TwitchTwitch Developers
 
Esports Playbook: A Guide for Brands and Marketers
Esports Playbook: A Guide for Brands and MarketersEsports Playbook: A Guide for Brands and Marketers
Esports Playbook: A Guide for Brands and MarketersEsports Group
 
Gamification in 2012: Trends in Consumer and Enterprise Markets with Metrics
Gamification in 2012: Trends in Consumer and Enterprise Markets with MetricsGamification in 2012: Trends in Consumer and Enterprise Markets with Metrics
Gamification in 2012: Trends in Consumer and Enterprise Markets with MetricsWanda Meloni
 
랜덤다이스_라이브 게임 분석_강인중.pdf
랜덤다이스_라이브 게임 분석_강인중.pdf랜덤다이스_라이브 게임 분석_강인중.pdf
랜덤다이스_라이브 게임 분석_강인중.pdfingin1
 
Metaverse White Paper by TENTUPLAY
Metaverse White Paper by TENTUPLAYMetaverse White Paper by TENTUPLAY
Metaverse White Paper by TENTUPLAYEunkyoung Lee
 
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POs
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POsWhy Live Ops Matters for Casual Games: 3 Stategic Mindset for POs
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POsTimShepherd83
 
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~日本マイクロソフト株式会社
 
The Metaverse Economy - by Tommaso Di Bartolo
The Metaverse Economy - by Tommaso Di BartoloThe Metaverse Economy - by Tommaso Di Bartolo
The Metaverse Economy - by Tommaso Di BartoloTommaso Di Bartolo
 
Animoca Brands Presentation on TGV
Animoca Brands Presentation on TGVAnimoca Brands Presentation on TGV
Animoca Brands Presentation on TGVAnimoca Brands
 
Node canvasで作るプロトタイプ
Node canvasで作るプロトタイプNode canvasで作るプロトタイプ
Node canvasで作るプロトタイプH T
 
デジタルゲームにおけるマルチエージェント操作技術
デジタルゲームにおけるマルチエージェント操作技術デジタルゲームにおけるマルチエージェント操作技術
デジタルゲームにおけるマルチエージェント操作技術Youichiro Miyake
 
02. 게임 전투 공식
02. 게임 전투 공식02. 게임 전투 공식
02. 게임 전투 공식Chanman Jo
 
Game Presentation
Game PresentationGame Presentation
Game Presentationguest25b2f3
 
実践イカパケット解析α
実践イカパケット解析α実践イカパケット解析α
実践イカパケット解析αYuki Mizuno
 
eSports: The Biggest Sport You've Probably Never Heard Of
eSports: The Biggest Sport You've Probably Never Heard OfeSports: The Biggest Sport You've Probably Never Heard Of
eSports: The Biggest Sport You've Probably Never Heard Ofsparks & honey
 

What's hot (20)

Shihtzu Exchange, NFT Minting & Metaverse Platform
Shihtzu Exchange, NFT Minting & Metaverse PlatformShihtzu Exchange, NFT Minting & Metaverse Platform
Shihtzu Exchange, NFT Minting & Metaverse Platform
 
The foundations of metaverse
The foundations of metaverseThe foundations of metaverse
The foundations of metaverse
 
20 Game Ideas You Should Steal
20 Game Ideas You Should Steal20 Game Ideas You Should Steal
20 Game Ideas You Should Steal
 
Powering-Up Your Game with Twitch
Powering-Up Your Game with TwitchPowering-Up Your Game with Twitch
Powering-Up Your Game with Twitch
 
Esports Playbook: A Guide for Brands and Marketers
Esports Playbook: A Guide for Brands and MarketersEsports Playbook: A Guide for Brands and Marketers
Esports Playbook: A Guide for Brands and Marketers
 
Gamification in 2012: Trends in Consumer and Enterprise Markets with Metrics
Gamification in 2012: Trends in Consumer and Enterprise Markets with MetricsGamification in 2012: Trends in Consumer and Enterprise Markets with Metrics
Gamification in 2012: Trends in Consumer and Enterprise Markets with Metrics
 
랜덤다이스_라이브 게임 분석_강인중.pdf
랜덤다이스_라이브 게임 분석_강인중.pdf랜덤다이스_라이브 게임 분석_강인중.pdf
랜덤다이스_라이브 게임 분석_강인중.pdf
 
日本のeSportsのユーザー調査
日本のeSportsのユーザー調査日本のeSportsのユーザー調査
日本のeSportsのユーザー調査
 
Metaverse White Paper by TENTUPLAY
Metaverse White Paper by TENTUPLAYMetaverse White Paper by TENTUPLAY
Metaverse White Paper by TENTUPLAY
 
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POs
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POsWhy Live Ops Matters for Casual Games: 3 Stategic Mindset for POs
Why Live Ops Matters for Casual Games: 3 Stategic Mindset for POs
 
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~
【de:code 2020】 2020 年も最高のゲームをつくろう! Game Stack でゲーム開発をしよう! ~ マルチプレイサーバー編 ~
 
The Metaverse Economy - by Tommaso Di Bartolo
The Metaverse Economy - by Tommaso Di BartoloThe Metaverse Economy - by Tommaso Di Bartolo
The Metaverse Economy - by Tommaso Di Bartolo
 
Animoca Brands Presentation on TGV
Animoca Brands Presentation on TGVAnimoca Brands Presentation on TGV
Animoca Brands Presentation on TGV
 
Node canvasで作るプロトタイプ
Node canvasで作るプロトタイプNode canvasで作るプロトタイプ
Node canvasで作るプロトタイプ
 
デジタルゲームにおけるマルチエージェント操作技術
デジタルゲームにおけるマルチエージェント操作技術デジタルゲームにおけるマルチエージェント操作技術
デジタルゲームにおけるマルチエージェント操作技術
 
02. 게임 전투 공식
02. 게임 전투 공식02. 게임 전투 공식
02. 게임 전투 공식
 
Game Presentation
Game PresentationGame Presentation
Game Presentation
 
実践イカパケット解析α
実践イカパケット解析α実践イカパケット解析α
実践イカパケット解析α
 
Neutrality in the new Cold War
Neutrality in the new Cold WarNeutrality in the new Cold War
Neutrality in the new Cold War
 
eSports: The Biggest Sport You've Probably Never Heard Of
eSports: The Biggest Sport You've Probably Never Heard OfeSports: The Biggest Sport You've Probably Never Heard Of
eSports: The Biggest Sport You've Probably Never Heard Of
 

Similar to Analysis on steam platform

Study on Evaluation Function Design of Mahjong using Supervised Learning
Study on Evaluation Function Design of Mahjong using Supervised LearningStudy on Evaluation Function Design of Mahjong using Supervised Learning
Study on Evaluation Function Design of Mahjong using Supervised Learningharmonylab
 
Anz Stadium Case Study - SeatShare Concept Test - CheckinLine
Anz Stadium Case Study - SeatShare Concept Test - CheckinLineAnz Stadium Case Study - SeatShare Concept Test - CheckinLine
Anz Stadium Case Study - SeatShare Concept Test - CheckinLineCheckinLine
 
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...Tunisha Singleton, PhD
 
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...eMadrid network
 
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...Kongregate
 
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P Games
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P GamesGDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P Games
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P GamesTamara (Tammy) Levy
 
Understanding and improving games through machine learning - Natasha Latysheva
Understanding and improving games through machine learning - Natasha LatyshevaUnderstanding and improving games through machine learning - Natasha Latysheva
Understanding and improving games through machine learning - Natasha LatyshevaLauren Cormack
 
Understanding Players Who are Payers | Nick Berry
Understanding Players Who are Payers | Nick BerryUnderstanding Players Who are Payers | Nick Berry
Understanding Players Who are Payers | Nick BerryJessica Tams
 
Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02HRForum
 
Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02HRForum
 
Slot Yield Presentation 2015
Slot Yield Presentation 2015Slot Yield Presentation 2015
Slot Yield Presentation 2015richlehman
 
Better service monitoring through histograms
Better service monitoring through histogramsBetter service monitoring through histograms
Better service monitoring through histogramsFred Moyer
 
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...Horizons RG
 
AIT Indoor Gaming Center.
AIT Indoor Gaming Center.AIT Indoor Gaming Center.
AIT Indoor Gaming Center.Roman Shrestha
 
Linguistic Analysis of Toxic Behavior in an Online Video Game
Linguistic Analysis of Toxic Behavior in an Online Video GameLinguistic Analysis of Toxic Behavior in an Online Video Game
Linguistic Analysis of Toxic Behavior in an Online Video GameHaewoon Kwak
 
Core Games, Real Numbers: Asian & Western Games
Core Games, Real Numbers: Asian & Western GamesCore Games, Real Numbers: Asian & Western Games
Core Games, Real Numbers: Asian & Western Gamesemily_greer
 
Multiplayer Online Gaming
Multiplayer Online GamingMultiplayer Online Gaming
Multiplayer Online Gamingchetnamistry
 

Similar to Analysis on steam platform (19)

Study on Evaluation Function Design of Mahjong using Supervised Learning
Study on Evaluation Function Design of Mahjong using Supervised LearningStudy on Evaluation Function Design of Mahjong using Supervised Learning
Study on Evaluation Function Design of Mahjong using Supervised Learning
 
Anz Stadium Case Study - SeatShare Concept Test - CheckinLine
Anz Stadium Case Study - SeatShare Concept Test - CheckinLineAnz Stadium Case Study - SeatShare Concept Test - CheckinLine
Anz Stadium Case Study - SeatShare Concept Test - CheckinLine
 
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...
Virtual Fandomonium: Considerations and Efficacy of Virtual Reality as a Plat...
 
Ninento play box
Ninento play boxNinento play box
Ninento play box
 
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...
V Jornadas eMadrid sobre “Educación Digital”. Ángel Serrano, Universidad Comp...
 
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...
Tammy Levy at GDC 2019: Nature vs. Nurture: Unpacking Player Spending in F2P ...
 
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P Games
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P GamesGDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P Games
GDC Talk - Nature vs Nurture: Unpacking Player Spending in F2P Games
 
Understanding and improving games through machine learning - Natasha Latysheva
Understanding and improving games through machine learning - Natasha LatyshevaUnderstanding and improving games through machine learning - Natasha Latysheva
Understanding and improving games through machine learning - Natasha Latysheva
 
Understanding Players Who are Payers | Nick Berry
Understanding Players Who are Payers | Nick BerryUnderstanding Players Who are Payers | Nick Berry
Understanding Players Who are Payers | Nick Berry
 
Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02
 
Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02Harrigan guelph presentation march 20 2013 v02
Harrigan guelph presentation march 20 2013 v02
 
Slot Yield Presentation 2015
Slot Yield Presentation 2015Slot Yield Presentation 2015
Slot Yield Presentation 2015
 
Better service monitoring through histograms
Better service monitoring through histogramsBetter service monitoring through histograms
Better service monitoring through histograms
 
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...
Identifying Harm Among Machine Players: Findings from a Multi-Component Resea...
 
AIT Indoor Gaming Center.
AIT Indoor Gaming Center.AIT Indoor Gaming Center.
AIT Indoor Gaming Center.
 
Linguistic Analysis of Toxic Behavior in an Online Video Game
Linguistic Analysis of Toxic Behavior in an Online Video GameLinguistic Analysis of Toxic Behavior in an Online Video Game
Linguistic Analysis of Toxic Behavior in an Online Video Game
 
Core Games, Real Numbers: Asian & Western Games
Core Games, Real Numbers: Asian & Western GamesCore Games, Real Numbers: Asian & Western Games
Core Games, Real Numbers: Asian & Western Games
 
Multiplayer Online Gaming
Multiplayer Online GamingMultiplayer Online Gaming
Multiplayer Online Gaming
 
How cloud is fueling growth for online gaming
How cloud is fueling growth for online gamingHow cloud is fueling growth for online gaming
How cloud is fueling growth for online gaming
 

More from Yousef Fadila

Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterYousef Fadila
 
Synergy on the Blockchain! whitepaper
Synergy on the Blockchain!  whitepaperSynergy on the Blockchain!  whitepaper
Synergy on the Blockchain! whitepaperYousef Fadila
 
Synergy Platform Whitepaper alpha
Synergy Platform Whitepaper alphaSynergy Platform Whitepaper alpha
Synergy Platform Whitepaper alphaYousef Fadila
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems - Yousef Fadila
 
interactive voting based map matching algorithm
interactive voting based map matching algorithminteractive voting based map matching algorithm
interactive voting based map matching algorithmYousef Fadila
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.Yousef Fadila
 
Spot deceptive TripAdvisor Reviews
Spot deceptive TripAdvisor ReviewsSpot deceptive TripAdvisor Reviews
Spot deceptive TripAdvisor ReviewsYousef Fadila
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsYousef Fadila
 
Anomaly Detection - Catch me if you can
Anomaly Detection - Catch me if you canAnomaly Detection - Catch me if you can
Anomaly Detection - Catch me if you canYousef Fadila
 
Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Yousef Fadila
 
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1Yousef Fadila
 
Innovative thinking التفكير الابداعي
Innovative thinking التفكير الابداعيInnovative thinking التفكير الابداعي
Innovative thinking التفكير الابداعيYousef Fadila
 
Am i overpaying - business proposal
Am i overpaying - business proposal Am i overpaying - business proposal
Am i overpaying - business proposal Yousef Fadila
 

More from Yousef Fadila (13)

Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
 
Synergy on the Blockchain! whitepaper
Synergy on the Blockchain!  whitepaperSynergy on the Blockchain!  whitepaper
Synergy on the Blockchain! whitepaper
 
Synergy Platform Whitepaper alpha
Synergy Platform Whitepaper alphaSynergy Platform Whitepaper alpha
Synergy Platform Whitepaper alpha
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
 
interactive voting based map matching algorithm
interactive voting based map matching algorithminteractive voting based map matching algorithm
interactive voting based map matching algorithm
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.
 
Spot deceptive TripAdvisor Reviews
Spot deceptive TripAdvisor ReviewsSpot deceptive TripAdvisor Reviews
Spot deceptive TripAdvisor Reviews
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie Reviews
 
Anomaly Detection - Catch me if you can
Anomaly Detection - Catch me if you canAnomaly Detection - Catch me if you can
Anomaly Detection - Catch me if you can
 
Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1Tweeting for Hillary - DS 501 case study 1
Tweeting for Hillary - DS 501 case study 1
 
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1
CS 548 KNOWLEDGE DISCOVERY AND DATA MINING Project 1
 
Innovative thinking التفكير الابداعي
Innovative thinking التفكير الابداعيInnovative thinking التفكير الابداعي
Innovative thinking التفكير الابداعي
 
Am i overpaying - business proposal
Am i overpaying - business proposal Am i overpaying - business proposal
Am i overpaying - business proposal
 

Recently uploaded

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 

Recently uploaded (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 

Analysis on steam platform

  • 1. Analysis on Steam Platform Team 3 Jianjun Luo Sushma Tayanna Wen Liu Yousef Fadila Zhanfeng Huang
  • 2. Which of these games is more affordable? 59.99$ 9.99$ 4.99$ ?
  • 3. Background: Global Video Games Market is HUGE!
  • 4. Why Analyzing Steam’s Data? ● Steam is the dominant platform serving at the de facto PC game distribution hub. ● Total games owned: 3 Billions! ● Real time statistics of Steam’s users and games shed lights on a market of billions.
  • 5. Guiding Questions 1)How many valid users and active users there are on Steam? 2)How much time do Steam’s users spend on Steam? 3)How much money do Steam’s users spend on Steam? 4)What is the Price–performance ratio (Avg. Cost Per Hour) of Steam's games?
  • 6. Sampling Method ● Steam ID is assigned a 64-bit unique ID in a sequential manner ● Steam ID space length is 462,322,592 ● Query Steam web API by a generated steam ID can get this user’s summary and game list ● Question: how to sample a certain amount of users for our analysis? Simple Random Sampling? Stratified Sampling? Stratified Sampling with Random Sampled Strata?
  • 7. Estimating Total Valid Users ● Let Yi = 1 denote the i th ID is a valid user (or active users, private users...) and Yi = 0 otherwise ● N is the total number of IDs in ID space, so the variable of interest is: Y1 ,...,YN ● Let n denote the number of samples, so the variable of interest is: y1 ,..., yn ● For Stratified Sampling, let L denote the number of strata and in our case and h is a specific stratum ● For Stratified Sampling with Random Sampled Strata, let B denote total number of strata we randomly choosed in L Est. of Simple Random Sampling Est. of Stratified Sampling Est. of Stratified Sampling with Random Sampled Strata where Actually in our case, the strata are equal length and number of samples in each stratum or chosen bucket is also equal...
  • 8. Variance! The variance of estimator of population total is not same Simple Random Sampling Stratified Sampling Stratified Sampling with Random Sampled Strata
  • 9. Number of Invalid Users Samples in Each Stratum Split 462,322,592 (N) into 1000 Strata (L) and 150 (nh) Samples in each stratum
  • 10. Stratified Sampling and Estimation of Valid users ● The insight of Stratified Sampling ○ may produce a smaller error of estimation than a simple random sample with same size ○ estimates of population parameters may be desired for subgroups of the population ● For total valid users ○ ID samples: 150,000 from 150 samples (nh) * 1000 strata (L) ○ ID space length (Nh) in each stratum: 462322 ○ Number of invalid users in samples: 17541 ○ Total invalid users: 54,064,004 ○ Total valid users: 408,258,588 ● When the stratum sample sizes are at least 30, we can use z to approximate t in t distribution ○ the 90% confidence interval is: 408,258,588 ± 559,047 ○ If samples are from uniformed sampling, it is: 408,258,588 ± 799,945 Same analysis can be performed on estimating active users, private users ...
  • 11. Computational Experiments ● Distribution of Game Price ● Distribution of Number of Games Purchased by Users ● Gamer Behavior ● Price Performance Ratio (Average Cost/Hour)
  • 12. Distribution of Game Price 48,213 Games in Steam 23.14% of them are free 85.23% of them are less than $10
  • 13. Distribution of Game Price 99.73% less than $80
  • 14. Number of Games Purchased by User most users purchase less than 25 games 45.61% of users have not purchased any games
  • 15. Game Behavior - Account Value 25,260 sample users 46% of them never paid for games → $0 99.45% of them are less than $3,000
  • 16. 11.09% of all games bought on Steam are never played 90.07% of them are played less than 10 hours Game Behavior - Average Play Time
  • 17. Cost/Hour for user: Is playing game expensive? No, for most users(98%), they spent 2.72$/hour on game playing. 7.09$/h Ticket only 2.72$/h 11.57$/h Ticket only Economical and NOT one time consumption.
  • 18. Hour/User for Game: “White Elephant” VS “Worth Buying” The average playing time per game owner: <1: Unsatisfying Games (36%) 85% <10$, Average price is 7.47$ >=1: Satisfying Games (64%) 96%<10$, Average price is 11.48$ Higher Price cannot guarantee quality, but satisfying games on average sells more expensive.
  • 19. Cost/Hour,User for Game Case Study: overall gameplay experience 16.489$/h 49.99$ 4 hours to finish Definitely not worth $50
  • 20. 0.836$/h 59.99$ Urge to play again Not a waste, even at full price. Cost/Hour,User for Game Case Study: overall gameplay experience
  • 21. Back To The Question: Which game is more affordable? Price could be misleading!
  • 22. Cost VS Cost-Benefit - Cost per unit of time 59.99$ 9.99$ 4.99$ 0.485$/h 1.463$/h 1.921$/h
  • 23. Conclusion ● Apply the stratified sampling on Steam ID space, and compare it with simple random sampling and stratified sampling with random sampled strata ● Get the unbiased estimation of total valid user number and the its confidence interval ● 46% users only play free games or discard the account and play no games. ● Playing computer games is economical comparing to other leisure activities. ● Satisfying game --->Higher Price ● Higher price-performance ratio ---> Better overall gameplay experience

Editor's Notes

  1. Let me start with this question, which of these games is more affordable? Any idea? what is missing to answer? The price here is the price (click) what do you think now? for now, let's put this question on hold, we will back to it at the end of the presentation
  2. In our project, we are going to research these question, which we think are important measure the size and impact of the video game industry
  3. We found out that not many users are willing to try multiple games.
  4. the total money spent on owned games of each account 46% 11,577