SlideShare a Scribd company logo
1 of 46
Download to read offline
Copyright	
  ©	
  2014	
  Splunk	
  Inc.	
  
Data
Onboarding
Ingestion without the
Indigestion
Jeff	
  Meyers	
  
Sales	
  Engineer	
  
•  Major	
  components	
  involved	
  in	
  data	
  indexing	
  
•  What	
  happens	
  to	
  data	
  within	
  Splunk	
  
•  What	
  the	
  data	
  pipeline	
  is	
  &	
  how	
  to	
  influence	
  it	
  
•  Shaping	
  data	
  understanding	
  via	
  props.conf	
  
•  Configuring	
  data	
  inputs	
  via	
  inputs.conf	
  
•  What	
  goes	
  where	
  
•  Heavy	
  Forwarders	
  vs.	
  Universal	
  Forwarders	
  
•  How	
  to	
  get	
  your	
  data	
  into	
  Splunk	
  (mostly	
  correctly)	
  
	
  
~	
  60	
  minutes	
  from	
  now...	
  
•  SystemaMc	
  way	
  to	
  bring	
  new	
  data	
  sources	
  into	
  Splunk	
  
•  Make	
  sure	
  that	
  new	
  data	
  is	
  instantly	
  usable	
  	
  
&	
  has	
  maximum	
  value	
  for	
  users	
  
•  Goes	
  hand-­‐in-­‐hand	
  with	
  the	
  User	
  Onboarding	
  process	
  
(sold	
  separately)	
  
	
  
What	
  is	
  the	
  Data	
  Onboarding	
  Process?	
  
4	
  
Machine Data > Business Value
Index	
  Untapped	
  Data:	
  Any	
  Source,	
  Type,	
  Volume	
  
Online	
  
Services	
   Web	
  
Services	
  
Servers	
  
Security	
   GPS	
  
LocaMon	
  
Storage	
  
Desktops	
  
Networks	
  
Packaged	
  
ApplicaMons	
  
Custom	
  
ApplicaMons	
  Messaging	
  
Telecoms	
  
Online	
  
Shopping	
  
Cart	
  
Web	
  
Clickstream
s	
  
Databases	
  
Energy	
  
Meters	
  
Call	
  Detail	
  
Records	
  
Smartphones	
  
and	
  Devices	
  
RFID	
  
On-­‐	
  
Premises	
  
Private	
  	
  
Cloud	
  
Public	
  	
  
Cloud	
  
	
  Ask	
  Any	
  QuesMon	
  
ApplicaMon	
  Delivery	
  
Security,	
  Compliance	
  and	
  
Fraud	
  
IT	
  OperaMons	
  
Business	
  AnalyMcs	
  
Industrial	
  Data	
  and	
  
the	
  Internet	
  of	
  Things	
  
Flavors of Machine Data
Order	
  Processing	
  
TwiRer	
  
Care	
  IVR	
  
Middleware	
  	
  
Error	
  
Getting Data Into Splunk
6
Agent	
  and	
  Agent-­‐less	
  Approach	
  for	
  Flexibility	
  
perf	
  
shell	
  
code	
  
Mounted	
  File	
  Systems	
  
hostnamemount	
  
syslog	
  
TCP/UDP	
  
WMI	
  
Event	
  Logs	
  Performance	
  
AcMve	
  	
  
Directory	
  
syslog	
  compaMble	
  hosts	
  
and	
  network	
  devices	
  
Unix,	
  Linux	
  and	
  Windows	
  hosts	
  
Windows	
  hosts	
   Custom	
  apps	
  and	
  scripted	
  API	
  connecMons	
  
Local	
  File	
  Monitoring	
  
log	
  files,	
  config	
  files	
  
dumps	
  and	
  trace	
  files	
  
Windows	
  Inputs	
  
Event	
  Logs	
  
performance	
  counters	
  
registry	
  monitoring	
  
AcAve	
  Directory	
  monitoring	
  
virtual	
  
host	
  
Windows	
  hosts	
  
Scripted	
  Inputs	
  
shell	
  scripts	
  custom	
  
parsers	
  batch	
  loading	
  
	
  
Agent-­‐less	
  Data	
  Input	
   Splunk	
  Forwarder	
  
Splunk	
  Data	
  Ingest	
  
UF	
   UF	
   HF	
   UF	
  
IDX	
  
SH	
  
Splunk	
  Enterprise	
  
	
  (with	
  opMonal	
  configs)	
  
Splunk	
  Universal	
  Forwarder	
  
Summary:	
  when	
  it	
  comes	
  to	
  "core"	
  
Splunk,	
  there	
  are	
  two	
  dis8nct	
  
products:	
  Splunk	
  Universal	
  
Forwarder	
  and	
  Splunk	
  Enterprise.	
  	
  
	
  
"Everything	
  else"	
  –	
  Indexer,	
  Search	
  
Head,	
  License	
  Server,	
  Deployment	
  
Server,	
  Cluster	
  Master,	
  Deployer,	
  
Heavy	
  Forwarder,	
  etc.	
  are	
  all	
  
instances	
  of	
  Splunk	
  Enterprise	
  with	
  
varying	
  configs.	
  
Data Pipeline
(what the what?)
The	
  Data	
  Pipeline	
  
The	
  Data	
  Pipeline	
  
Any	
  QuesMons?	
  
The	
  Data	
  Pipeline	
  
•  Input	
  Processors:	
  Monitor,	
  FIFO,	
  UDP,	
  TCP,	
  Scripted	
  	
  	
  	
  	
  	
  
•  No	
  events	
  yet-­‐-­‐	
  just	
  a	
  stream	
  of	
  bytes	
  
•  Break	
  data	
  stream	
  into	
  64KB	
  blocks	
  
•  Annotate	
  stream	
  with	
  metadata	
  keys	
  (host,	
  source,	
  
sourcetype,	
  index,	
  etc.)	
  
•  Can	
  happen	
  on	
  UF,	
  HF	
  or	
  indexer	
  
Inputs–	
  Where	
  it	
  all	
  starts	
  
• Check	
  character	
  set	
  
• Break	
  lines	
  
• Process	
  headers	
  
• Can	
  happen	
  on	
  HF	
  or	
  indexer	
  
Parsing	
  
•  Merge	
  lines	
  for	
  mulM-­‐line	
  events	
  
•  IdenMfy	
  events	
  (finally!)	
  
•  Extract	
  Mmestamps	
  
•  Exclude	
  events	
  based	
  on	
  Mmestamp	
  (MAX_DAYS_AGO,	
  ..)	
  
•  Can	
  happen	
  on	
  HF	
  or	
  indexer	
  
AggregaMon/Merging	
  
•  Do	
  regex	
  replacement	
  (field	
  extracMon,	
  punctuaMon	
  
extracMon,	
  event	
  rouMng,	
  host/source/sourcetype	
  
overrides)	
  
•  Annotate	
  events	
  with	
  metadata	
  keys	
  	
  
(host,	
  source,	
  sourcetype,	
  ..)	
  
•  Can	
  happen	
  on	
  HF	
  or	
  indexer	
  
Typing	
  
•  Output	
  processors:	
  TCP,	
  syslog,	
  HTTP	
  
•  indexAndForward	
  
•  Sign	
  blocks	
  
•  Calculate	
  license	
  volume	
  and	
  throughput	
  metrics	
  
•  Index	
  
•  [Write	
  to	
  disk	
  ]	
  /	
  [forward	
  elsewhere]	
  /	
  ...	
  
•  Can	
  happen	
  on	
  HF	
  or	
  indexer	
  
Indexing	
  
The	
  Data	
  Pipeline	
  
Data	
  Pipeline:	
  UF	
  &	
  Indexer	
  
Data	
  Pipeline:	
  HF	
  &	
  Indexer	
  
Data	
  Pipeline:	
  UF,	
  IF	
  &	
  Indexer	
  
UF	
  vs.	
  HF	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:16]	
  "GET	
  /oldlink?itemId=EST-­‐6&JSESSIONID=SD0SL6FF7AD...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:17]	
  "GET	
  /product.screen?productId=BS-­‐AG-­‐G09&JSESSION...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:19]	
  "POST	
  /category.screen?categoryId=STRATEGY&JSESSI...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:20]	
  "GET	
  /product.screen?productId=FS-­‐SG-­‐G03&JSESSION...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:20]	
  "POST	
  /cart.do?acMon=addtocart&itemId=EST-­‐21&pro...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:21]	
  "POST	
  /cart.do?acMon=purchase&itemId=EST-­‐21&JSES...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:22]	
  "POST	
  /cart/success.do?JSESSIONID=SD0SL6FF7ADFF49...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:21]	
  "GET	
  /cart.do?acMon=remove&itemId=EST-­‐11&product...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:22]	
  "GET	
  /oldlink?itemId=EST-­‐14&JSESSIONID=SD0SL6FF7A...	
  
112.111.162.4	
  -­‐	
  -­‐	
  [23/Feb/2016:18:26:36]	
  "GET	
  /product.screen?productId=WC-­‐SH-­‐G04&JSESSION...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:16]	
  "GET	
  /oldlink?itemId=EST-­‐6&JSESSIONID=SD0SL6FF7AD...	
  
209.160.24.63	
  -­‐	
  -­‐	
  [23/Feb/2016:18:22:17]	
  "GET	
  /product.screen?productId=BS-­‐AG-­‐G09&SSN=xxxyyyzzz...	
  
sourcetype=access_combined,	
  _8me=1456251739,	
  index=foo,	
  host=bar,	
  …	
  
sourcetype=access_combined,	
  _8me=1456251739,	
  index=foo,	
  host=bar,	
  …	
  
sourcetype=access_combined,	
  index=foo,	
  host=bar,	
  …	
  
UF	
  	
  
HF	
  	
  emits	
  events	
  
emits	
  chunks	
  of	
  
data	
  
Splunk	
  Data	
  Ingest	
  
UF	
   UF	
   HF	
   UF	
  
IDX	
  
SH	
  
Parsing	
  
Not	
  Parsing	
  
Note:	
  the	
  data	
  is	
  parsed	
  at	
  the	
  first	
  
component	
  that	
  has	
  a	
  parsing	
  
engine	
  –	
  and	
  not	
  again	
  
	
  
This	
  effects	
  where	
  you	
  put	
  certain	
  
props.conf	
  and	
  transforms.conf	
  
files	
  (a.k.a.	
  some8mes	
  they	
  go	
  on	
  
the	
  forwarder)	
  
Data Onboarding Process
(bringing it together)
•  IdenMfy	
  the	
  specific	
  sourcetype(s)	
  -­‐	
  onboard	
  each	
  separately	
  
•  Check	
  for	
  pre-­‐exisMng	
  app/TA	
  on	
  splunk.com-­‐-­‐	
  don't	
  reinvent	
  the	
  wheel!	
  
•  Gather	
  info	
  
•  Where	
  does	
  this	
  data	
  originate/reside?	
  	
  How	
  will	
  Splunk	
  collect	
  it?	
  
•  Which	
  users/groups	
  will	
  need	
  access	
  to	
  this	
  data?	
  	
  Access	
  controls?	
  
•  Determine	
  the	
  indexing	
  volume	
  and	
  data	
  retenMon	
  requirements	
  
•  Will	
  this	
  data	
  need	
  to	
  drive	
  exisMng	
  dashboards	
  (ES,	
  PCI,	
  etc.)?	
  
•  Who	
  is	
  the	
  SME	
  for	
  this	
  data?	
  
•  Map	
  it	
  out	
  
•  Get	
  a	
  "big	
  enough"	
  sample	
  of	
  the	
  event	
  data	
  
•  IdenMfy	
  and	
  map	
  out	
  fields	
  
•  Assign	
  sourcetype	
  and	
  TA	
  names	
  according	
  to	
  CIM	
  convenMons	
  
On-­‐boarding	
  Process	
  
•  Dev	
  
•  Create	
  (or	
  use)	
  an	
  app	
  
•  Props	
  /	
  inputs	
  definiMon	
  
•  Sourcetype	
  definiMon	
  
•  Use	
  data	
  import	
  wizard	
  
•  Import,	
  tweak,	
  repeat	
  
•  Oneshot	
  
•  [hook	
  up	
  monitor]	
  
On-­‐boarding	
  Process	
  
•  Prod	
  
•  Deploy	
  app	
  
•  Validate	
  
•  Monitor	
  
•  Test	
  
•  Deploy	
  app	
  
•  Oneshost	
  
•  Validate	
  
•  Hook	
  up	
  monitor	
  
•  Validate	
  	
  
1	
   2	
  
3	
  
•  General:	
  
•  Use	
  apps	
  for	
  configs	
  
•  Use	
  TAs	
  /	
  add-­‐ons	
  from	
  Splunk	
  if	
  possible	
  
•  Use	
  dev,	
  test,	
  prod	
  
•  Dev	
  can	
  be	
  laptop,	
  test	
  can	
  be	
  ephemeral	
  
•  UF	
  when	
  possible	
  
•  HF	
  only	
  if	
  filtering	
  /	
  transforming	
  is	
  required	
  in	
  foreign	
  land	
  
•  Unique	
  Sourcetype	
  per	
  event	
  stream	
  
•  Don't	
  send	
  data	
  through	
  Search	
  Heads	
  
•  Don't	
  send	
  data	
  direct	
  to	
  Indexers	
  
Good	
  Hygiene	
  
•  inputs.conf	
  
•  As	
  specific	
  as	
  possible	
  
•  Set	
  sourcetype,	
  if	
  possible	
  
•  Don't	
  let	
  splunk	
  auto-­‐sourcetype	
  (no	
  ...too_small)	
  
•  Specify	
  index	
  if	
  possible	
  
•  props.conf	
  
•  Set:	
  TIME_PREFIX,	
  TIME_FORMAT,	
  
MAX_TIMESTAMP_LOOKAHEAD	
  
•  OpMmally:	
  SHOULD_LINEMERGE	
  =	
  false,	
  LINE_BREAKER,	
  
TRUNCATE	
  
Good	
  Hygiene	
  
Data Onboarding Process
(details)
•  IdenMfy	
  the	
  specific	
  sourcetype(s)	
  -­‐	
  onboard	
  each	
  separately	
  
•  Check	
  for	
  pre-­‐exisMng	
  app/TA	
  on	
  splunk.com-­‐-­‐	
  don't	
  reinvent	
  the	
  wheel!	
  
•  Gather	
  info	
  
•  Where	
  does	
  this	
  data	
  originate/reside?	
  	
  How	
  will	
  Splunk	
  collect	
  it?	
  
•  Which	
  users/groups	
  will	
  need	
  access	
  to	
  this	
  data?	
  	
  Access	
  controls?	
  
•  Determine	
  the	
  indexing	
  volume	
  and	
  data	
  retenMon	
  requirements	
  
•  Will	
  this	
  data	
  need	
  to	
  drive	
  exisMng	
  dashboards	
  (ES,	
  PCI,	
  etc.)?	
  
•  Who	
  is	
  the	
  SME	
  for	
  this	
  data?	
  
•  Map	
  it	
  out	
  
•  Get	
  a	
  "big	
  enough"	
  sample	
  of	
  the	
  event	
  data	
  
•  IdenMfy	
  and	
  map	
  out	
  fields	
  
•  Assign	
  sourcetype	
  and	
  TA	
  names	
  according	
  to	
  CIM	
  convenMons	
  
Pre-­‐Board	
  
•  The	
  Common	
  InformaMon	
  Model	
  (CIM)	
  defines	
  
relaMonships	
  in	
  the	
  underlying	
  data,	
  while	
  leaving	
  the	
  raw	
  
machine	
  data	
  intact	
  
•  A	
  naming	
  convenMon	
  for	
  fields,	
  evensypes	
  &	
  tags	
  
•  More	
  advanced	
  reporMng	
  and	
  correlaMon	
  requires	
  that	
  the	
  
data	
  be	
  normalized,	
  categorized,	
  and	
  parsed	
  
•  CIM-­‐compliant	
  data	
  sources	
  can	
  drive	
  CIM-­‐based	
  
dashboards	
  (ES,	
  PCI,	
  others)	
  
Tangent:	
  What	
  is	
  the	
  CIM	
  and	
  why	
  should	
  I	
  care?	
  
•  IdenMfy	
  necessary	
  configs	
  (inputs,	
  props	
  and	
  transforms)	
  
to	
  properly	
  handle:	
  
•  Mmestamp	
  extracMon,	
  Mmezone,	
  event	
  breaking,	
  
sourcetype/host/source	
  assignments	
  
•  Do	
  events	
  contain	
  sensiMve	
  data	
  (i.e.,	
  PII,	
  PAN,	
  etc.)?	
  
Create	
  masking	
  transforms	
  if	
  necessary	
  
•  Package	
  all	
  index-­‐Mme	
  configs	
  into	
  the	
  TA	
  
Build	
  the	
  index-­‐Mme	
  configs	
  
•  Assign	
  sourcetype	
  according	
  to	
  event	
  format;	
  events	
  with	
  
similar	
  format	
  should	
  have	
  the	
  same	
  sourcetype	
  
•  When	
  do	
  I	
  need	
  a	
  separate	
  index?	
  
•  When	
  the	
  data	
  volume	
  will	
  be	
  very	
  large,	
  or	
  when	
  it	
  will	
  
be	
  searched	
  exclusively	
  a	
  lot	
  
•  When	
  access	
  to	
  the	
  data	
  needs	
  to	
  be	
  controlled	
  
•  When	
  the	
  data	
  requires	
  a	
  specific	
  data	
  retenMon	
  policy	
  
•  Resist	
  the	
  temptaMon	
  to	
  create	
  lots	
  of	
  indexes	
  
Tangent:	
  Best	
  &	
  Worst	
  PracMces	
  
•  Always	
  specify	
  a	
  sourcetype	
  and	
  index	
  
•  Be	
  as	
  specific	
  as	
  possible:	
  use	
  /var/log/fubar.log,	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  not	
  /var/log/	
  
•  Arrange	
  your	
  monitored	
  filesystems	
  to	
  minimize	
  
unnecessary	
  monitored	
  logfiles	
  
•  Use	
  a	
  scratch	
  index	
  while	
  tesMng	
  new	
  inputs	
  
Best	
  &	
  Worst	
  PracMces	
  –	
  [monitor]	
  
•  Lookout	
  for	
  inadvertent,	
  runaway	
  monitor	
  clauses	
  
•  Don’t	
  monitor	
  thousands	
  of	
  files	
  unnecessarily–	
  	
  
that’s	
  the	
  NSA’s	
  job	
  
•  From	
  the	
  CLI:	
  splunk	
  show	
  monitor	
  
•  From	
  your	
  browser:	
  hsps://your_splunkd:8089/
services/admin/inputstatus/TailingProcessor:FileStatus	
  
Best	
  &	
  Worst	
  PracMces	
  –	
  [monitor]	
  
•  Find	
  &	
  fix	
  index-­‐Mme	
  problems	
  BEFORE	
  polluMng	
  your	
  index	
  
•  A	
  try-­‐it-­‐before-­‐you-­‐fry-­‐it	
  interface	
  for	
  figuring	
  out	
  
•  Event	
  breaking	
  
•  Timestamp	
  recogniMon	
  
•  Timezone	
  assignment	
  
•  Provides	
  the	
  necessary	
  props.conf	
  parameter	
  sewngs	
  
Your	
  friend,	
  the	
  Data	
  Previewer	
  Another	
  
Tangent!	
  
Data Onboarding Process,
continued
•  IdenMfy	
  "interesMng"	
  events	
  which	
  should	
  be	
  tagged	
  with	
  an	
  exisMng	
  CIM	
  tag	
  (hsp://
docs.splunk.com/DocumentaMon/CIM/latest/User/Alerts)	
  
•  Get	
  a	
  list	
  of	
  all	
  current	
  tags:	
  |	
  rest	
  splunk_server=local	
  /services/admin/tags	
  |	
  rename	
  
tag_name	
  as	
  tag,	
  field_name_value	
  AS	
  definiMon,	
  eai:acl.app	
  AS	
  app	
  |	
  eval	
  
definiMon_and_app=definiMon	
  .	
  "	
  ("	
  .	
  app	
  .	
  ")"	
  |	
  stats	
  values(definiMon_and_app)	
  as	
  
"definiMons	
  (app)"	
  by	
  tag	
  |	
  sort	
  +tag	
  
•  Get	
  a	
  list	
  of	
  all	
  evensypes	
  (with	
  associated	
  tags):	
  |	
  rest	
  splunk_server=local	
  /services/
admin/evensypes	
  |	
  rename	
  Mtle	
  as	
  evensype,	
  search	
  AS	
  definiMon,	
  eai:acl.app	
  AS	
  app	
  |	
  
table	
  evensype	
  definiMon	
  app	
  tags	
  |	
  sort	
  +evensype	
  
•  Examine	
  the	
  current	
  list	
  of	
  CIM	
  tags.	
  	
  For	
  each	
  "interesMng"	
  event,	
  idenMfy	
  which	
  tags	
  
should	
  be	
  applied	
  to	
  each.	
  	
  A	
  parMcular	
  event	
  may	
  have	
  mulMple	
  tags.	
  
•  Are	
  there	
  new	
  tags	
  which	
  should	
  be	
  created,	
  beyond	
  those	
  in	
  the	
  current	
  CIM	
  tag	
  library?	
  	
  
If	
  so,	
  add	
  them	
  to	
  the	
  CIM	
  library	
  
Build	
  the	
  search-­‐Mme	
  configs:	
  evenRypes	
  &	
  tags	
  
•  Extract	
  "interesMng"	
  fields	
  
•  If	
  already	
  in	
  your	
  CIM	
  library,	
  name	
  or	
  alias	
  appropriately	
  
•  If	
  not	
  already	
  in	
  your	
  CIM	
  library,	
  name	
  according	
  to	
  CIM	
  
convenMons	
  
•  Add	
  lookups	
  for	
  missing/desirable	
  fields	
  
•  Lookups	
  may	
  be	
  required	
  to	
  supply	
  CIM-­‐compliant	
  fields/field	
  
values	
  (for	
  example,	
  to	
  convert	
  'sev=42'	
  to	
  'severity=medium'	
  
•  Make	
  the	
  values	
  more	
  readable	
  for	
  humans	
  
•  Put	
  everything	
  into	
  the	
  TA	
  package	
  
Build	
  the	
  search-­‐Mme	
  configs:	
  extracMons	
  &	
  lookups	
  
•  Create	
  data	
  models.	
  	
  What	
  will	
  be	
  interesMng	
  for	
  end	
  users?	
  
•  Document!	
  	
  (Especially	
  the	
  fields,	
  evensypes	
  &	
  tags)	
  
•  Test	
  
•  Does	
  this	
  data	
  drive	
  relevant	
  exisMng	
  dashboards	
  correctly?	
  
•  Do	
  the	
  data	
  models	
  work	
  properly	
  /	
  produce	
  correct	
  results?	
  
•  Is	
  the	
  TA	
  packaged	
  properly?	
  
•  Check	
  with	
  originaMng	
  user/group;	
  is	
  it	
  OK?	
  
Keep	
  Going	
  
•  Determine	
  addiMonal	
  Splunk	
  infrastructure	
  required;	
  can	
  
exisMng	
  infrastructure	
  &	
  license	
  support	
  this?	
  	
  
•  Will	
  new	
  forwarders	
  be	
  required?	
  	
  If	
  so,	
  iniMate	
  CR	
  process(es)	
  
•  Will	
  firewall	
  changes	
  be	
  required?	
  	
  If	
  so,	
  iniMate	
  CR	
  process(es)	
  
•  Will	
  new	
  Splunk	
  roles	
  be	
  required?	
  	
  Create	
  &	
  map	
  to	
  AD	
  roles	
  
•  Will	
  new	
  app	
  contexts	
  be	
  required?	
  	
  Create	
  app(s)	
  as	
  necessary	
  
•  Will	
  new	
  users	
  be	
  added?	
  	
  Create	
  the	
  accounts	
  
Get	
  ready	
  to	
  deploy	
  
•  Deploy	
  new	
  search	
  heads	
  &	
  indexers	
  as	
  needed	
  
•  Install	
  new	
  forwarders	
  as	
  needed	
  
•  Deploy	
  new	
  app	
  &	
  TA	
  to	
  search	
  heads	
  &	
  indexers	
  
•  Deploy	
  new	
  TA	
  to	
  relevant	
  forwarders	
  
Bring	
  it!	
  
•  All	
  sources	
  reporMng?	
  
•  Event	
  breaking,	
  Mmestamp,	
  Mmezone,	
  host,	
  source,	
  
sourcetype?	
  
•  Field	
  extracMons,	
  aliases,	
  lookups?	
  
•  Evensypes,	
  tags?	
  
•  Data	
  model(s)?	
  
•  User	
  access?	
  
•  Confirm	
  with	
  original	
  requesMng	
  user/group:	
  looks	
  OK?	
  
Test	
  &	
  Validate	
  
Done!
•  Bring	
  new	
  data	
  sources	
  in	
  correctly	
  the	
  first	
  Mme	
  
•  Reduce	
  the	
  amount	
  of	
  “bad”	
  data	
  in	
  your	
  indexes–	
  and	
  
the	
  Mme	
  spent	
  dealing	
  with	
  it	
  
•  Make	
  the	
  new	
  data	
  immediately	
  useful	
  to	
  ALL	
  users–	
  not	
  
just	
  the	
  ones	
  who	
  originally	
  requested	
  it	
  
•  Allow	
  the	
  data	
  to	
  drive	
  all	
  sorts	
  of	
  dashboards	
  without	
  
extra	
  modificaMons	
  
Gee,	
  this	
  seems	
  like	
  a	
  lot	
  of	
  work…	
  
•  What	
  splunk	
  can	
  monitor:	
  
•  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Data/WhatSplunkcanmonitor	
  
•  How	
  data	
  moves	
  through	
  splunk:	
  
•  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Deploy/Datapipeline	
  
•  Components	
  of	
  the	
  data	
  pipeline:	
  
•  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Deploy/Componentsofadistributedenvironment	
  
•  Common	
  informaMon	
  model	
  app:	
  
•  hsps://splunkbase.splunk.com/app/1621	
  
•  Common	
  informaMon	
  model	
  docs:	
  
•  hsp://docs.splunk.com/DocumentaMon/CIM/latest/User/Overview	
  
•  Where	
  do	
  I	
  put	
  configs:	
  
•  hsp://wiki.splunk.com/Where_do_I_configure_my_Splunk_sewngs	
  	
  
Reference	
  
Copyright	
  ©	
  2014	
  Splunk	
  Inc.	
  
Thank You!
Jeff Meyers
jam@splunk.com

More Related Content

What's hot

Splunk 101
Splunk 101Splunk 101
Splunk 101Splunk
 
Splunk Enterprise Security
Splunk Enterprise SecuritySplunk Enterprise Security
Splunk Enterprise SecuritySplunk
 
Data Onboarding
Data Onboarding Data Onboarding
Data Onboarding Splunk
 
Where is my MQ message on z/OS?
Where is my MQ message on z/OS?Where is my MQ message on z/OS?
Where is my MQ message on z/OS?Matt Leming
 
Getting started with Splunk
Getting started with SplunkGetting started with Splunk
Getting started with SplunkSplunk
 
How Splunk connects Salesforce
How Splunk connects SalesforceHow Splunk connects Salesforce
How Splunk connects SalesforceMuleSoft
 
Splunk Search Optimization
Splunk Search OptimizationSplunk Search Optimization
Splunk Search OptimizationSplunk
 
Splunk Architecture overview
Splunk Architecture overviewSplunk Architecture overview
Splunk Architecture overviewAlex Fok
 
Introduction to AMQP Messaging with RabbitMQ
Introduction to AMQP Messaging with RabbitMQIntroduction to AMQP Messaging with RabbitMQ
Introduction to AMQP Messaging with RabbitMQDmitriy Samovskiy
 
Deploying Confluent Platform for Production
Deploying Confluent Platform for ProductionDeploying Confluent Platform for Production
Deploying Confluent Platform for Productionconfluent
 
Getting Started with Splunk (Hands-On)
Getting Started with Splunk (Hands-On) Getting Started with Splunk (Hands-On)
Getting Started with Splunk (Hands-On) Splunk
 
Building Business Service Intelligence with ITSI
Building Business Service Intelligence with ITSIBuilding Business Service Intelligence with ITSI
Building Business Service Intelligence with ITSISplunk
 
SplunkLive 2011 Advanced Session
SplunkLive 2011 Advanced SessionSplunkLive 2011 Advanced Session
SplunkLive 2011 Advanced SessionSplunk
 
Splunk Data Onboarding Overview - Splunk Data Collection Architecture
Splunk Data Onboarding Overview - Splunk Data Collection ArchitectureSplunk Data Onboarding Overview - Splunk Data Collection Architecture
Splunk Data Onboarding Overview - Splunk Data Collection ArchitectureSplunk
 
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Kai Wähner
 
Splunk conf2014 - Onboarding Data Into Splunk
Splunk conf2014 - Onboarding Data Into SplunkSplunk conf2014 - Onboarding Data Into Splunk
Splunk conf2014 - Onboarding Data Into SplunkSplunk
 
Splunk for Enterprise Security featuring User Behavior Analytics
Splunk for Enterprise Security featuring User Behavior AnalyticsSplunk for Enterprise Security featuring User Behavior Analytics
Splunk for Enterprise Security featuring User Behavior AnalyticsSplunk
 

What's hot (20)

Splunk-Presentation
Splunk-Presentation Splunk-Presentation
Splunk-Presentation
 
Splunk 101
Splunk 101Splunk 101
Splunk 101
 
Splunk Enterprise Security
Splunk Enterprise SecuritySplunk Enterprise Security
Splunk Enterprise Security
 
Data Onboarding
Data Onboarding Data Onboarding
Data Onboarding
 
Where is my MQ message on z/OS?
Where is my MQ message on z/OS?Where is my MQ message on z/OS?
Where is my MQ message on z/OS?
 
Apache Kafka
Apache KafkaApache Kafka
Apache Kafka
 
Getting started with Splunk
Getting started with SplunkGetting started with Splunk
Getting started with Splunk
 
How Splunk connects Salesforce
How Splunk connects SalesforceHow Splunk connects Salesforce
How Splunk connects Salesforce
 
Splunk Search Optimization
Splunk Search OptimizationSplunk Search Optimization
Splunk Search Optimization
 
Splunk Architecture overview
Splunk Architecture overviewSplunk Architecture overview
Splunk Architecture overview
 
Splunk overview
Splunk overviewSplunk overview
Splunk overview
 
Introduction to AMQP Messaging with RabbitMQ
Introduction to AMQP Messaging with RabbitMQIntroduction to AMQP Messaging with RabbitMQ
Introduction to AMQP Messaging with RabbitMQ
 
Deploying Confluent Platform for Production
Deploying Confluent Platform for ProductionDeploying Confluent Platform for Production
Deploying Confluent Platform for Production
 
Getting Started with Splunk (Hands-On)
Getting Started with Splunk (Hands-On) Getting Started with Splunk (Hands-On)
Getting Started with Splunk (Hands-On)
 
Building Business Service Intelligence with ITSI
Building Business Service Intelligence with ITSIBuilding Business Service Intelligence with ITSI
Building Business Service Intelligence with ITSI
 
SplunkLive 2011 Advanced Session
SplunkLive 2011 Advanced SessionSplunkLive 2011 Advanced Session
SplunkLive 2011 Advanced Session
 
Splunk Data Onboarding Overview - Splunk Data Collection Architecture
Splunk Data Onboarding Overview - Splunk Data Collection ArchitectureSplunk Data Onboarding Overview - Splunk Data Collection Architecture
Splunk Data Onboarding Overview - Splunk Data Collection Architecture
 
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
 
Splunk conf2014 - Onboarding Data Into Splunk
Splunk conf2014 - Onboarding Data Into SplunkSplunk conf2014 - Onboarding Data Into Splunk
Splunk conf2014 - Onboarding Data Into Splunk
 
Splunk for Enterprise Security featuring User Behavior Analytics
Splunk for Enterprise Security featuring User Behavior AnalyticsSplunk for Enterprise Security featuring User Behavior Analytics
Splunk for Enterprise Security featuring User Behavior Analytics
 

Viewers also liked

Customer Presentation
Customer PresentationCustomer Presentation
Customer PresentationSplunk
 
Cisco UCS and Splunk Workshop
Cisco UCS and Splunk WorkshopCisco UCS and Splunk Workshop
Cisco UCS and Splunk WorkshopRobb Boyd
 
SplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case SwisscomSplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case SwisscomSplunk
 
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureTaking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureSplunk
 
Conf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimizationConf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimizationSplunk
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionSplunk
 
Splunk for DevOps - Faster Insights - Better Code
Splunk for DevOps - Faster Insights - Better CodeSplunk for DevOps - Faster Insights - Better Code
Splunk for DevOps - Faster Insights - Better CodePhilipp Drieger
 
Hunk - Unlocking the Power of Big Data
Hunk - Unlocking the Power of Big DataHunk - Unlocking the Power of Big Data
Hunk - Unlocking the Power of Big DataSplunk
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer PresentationSplunk
 
dlux - Splunk Technical Overview
dlux - Splunk Technical Overviewdlux - Splunk Technical Overview
dlux - Splunk Technical OverviewDavid Lutz
 
dlux splunk>live! 2012 Beginners Session
dlux splunk>live! 2012 Beginners Sessiondlux splunk>live! 2012 Beginners Session
dlux splunk>live! 2012 Beginners SessionDavid Lutz
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk OverviewSplunk
 
Deploying Splunk. Arquitetura e dimensionamento do Splunk
Deploying Splunk. Arquitetura e dimensionamento do SplunkDeploying Splunk. Arquitetura e dimensionamento do Splunk
Deploying Splunk. Arquitetura e dimensionamento do SplunkSplunk
 

Viewers also liked (14)

Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
 
Exploring Splunk
Exploring SplunkExploring Splunk
Exploring Splunk
 
Cisco UCS and Splunk Workshop
Cisco UCS and Splunk WorkshopCisco UCS and Splunk Workshop
Cisco UCS and Splunk Workshop
 
SplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case SwisscomSplunkLive! Zürich 2016 - Use Case Swisscom
SplunkLive! Zürich 2016 - Use Case Swisscom
 
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureTaking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – Architecture
 
Conf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimizationConf2014_SplunkSearchOptimization
Conf2014_SplunkSearchOptimization
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Splunk for DevOps - Faster Insights - Better Code
Splunk for DevOps - Faster Insights - Better CodeSplunk for DevOps - Faster Insights - Better Code
Splunk for DevOps - Faster Insights - Better Code
 
Hunk - Unlocking the Power of Big Data
Hunk - Unlocking the Power of Big DataHunk - Unlocking the Power of Big Data
Hunk - Unlocking the Power of Big Data
 
Customer Presentation
Customer PresentationCustomer Presentation
Customer Presentation
 
dlux - Splunk Technical Overview
dlux - Splunk Technical Overviewdlux - Splunk Technical Overview
dlux - Splunk Technical Overview
 
dlux splunk>live! 2012 Beginners Session
dlux splunk>live! 2012 Beginners Sessiondlux splunk>live! 2012 Beginners Session
dlux splunk>live! 2012 Beginners Session
 
Splunk Overview
Splunk OverviewSplunk Overview
Splunk Overview
 
Deploying Splunk. Arquitetura e dimensionamento do Splunk
Deploying Splunk. Arquitetura e dimensionamento do SplunkDeploying Splunk. Arquitetura e dimensionamento do Splunk
Deploying Splunk. Arquitetura e dimensionamento do Splunk
 

Similar to Data Onboarding

SplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding OverviewSplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding OverviewSplunk
 
Getting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-OnGetting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-OnSplunk
 
SplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding OverviewSplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding OverviewSplunk
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseSplunk
 
Data Onboarding Breakout Session
Data Onboarding Breakout SessionData Onboarding Breakout Session
Data Onboarding Breakout SessionSplunk
 
MacSysAdmin Conference 2019 - Logging
MacSysAdmin Conference 2019 - Logging MacSysAdmin Conference 2019 - Logging
MacSysAdmin Conference 2019 - Logging Henry Stamerjohann
 
Splunk Discovery: Warsaw 2018 - Getting Data In
Splunk Discovery: Warsaw 2018 - Getting Data InSplunk Discovery: Warsaw 2018 - Getting Data In
Splunk Discovery: Warsaw 2018 - Getting Data InSplunk
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionSplunk
 
Machine Data Is EVERYWHERE: Use It for Testing
Machine Data Is EVERYWHERE: Use It for TestingMachine Data Is EVERYWHERE: Use It for Testing
Machine Data Is EVERYWHERE: Use It for TestingTechWell
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseSplunk
 
Splunk workshop-Machine Data 101
Splunk workshop-Machine Data 101Splunk workshop-Machine Data 101
Splunk workshop-Machine Data 101Splunk
 
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...Splunk
 
Machine Data 101 Workshop
Machine Data 101 Workshop Machine Data 101 Workshop
Machine Data 101 Workshop Splunk
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightSplunk
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightSplunk
 
Getting started with Splunk Breakout Session
Getting started with Splunk Breakout SessionGetting started with Splunk Breakout Session
Getting started with Splunk Breakout SessionSplunk
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseSplunk
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseShannon Cuthbertson
 
Splunk forwarders tech_brief
Splunk forwarders tech_briefSplunk forwarders tech_brief
Splunk forwarders tech_briefGreg Hanchin
 
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Amsterdam 2015 Breakout - Getting Started with SplunkSplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Amsterdam 2015 Breakout - Getting Started with SplunkSplunk
 

Similar to Data Onboarding (20)

SplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding OverviewSplunkLive! Frankfurt 2018 - Data Onboarding Overview
SplunkLive! Frankfurt 2018 - Data Onboarding Overview
 
Getting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-OnGetting Started with Splunk Enterprise Hands-On
Getting Started with Splunk Enterprise Hands-On
 
SplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding OverviewSplunkLive! Munich 2018: Data Onboarding Overview
SplunkLive! Munich 2018: Data Onboarding Overview
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Data Onboarding Breakout Session
Data Onboarding Breakout SessionData Onboarding Breakout Session
Data Onboarding Breakout Session
 
MacSysAdmin Conference 2019 - Logging
MacSysAdmin Conference 2019 - Logging MacSysAdmin Conference 2019 - Logging
MacSysAdmin Conference 2019 - Logging
 
Splunk Discovery: Warsaw 2018 - Getting Data In
Splunk Discovery: Warsaw 2018 - Getting Data InSplunk Discovery: Warsaw 2018 - Getting Data In
Splunk Discovery: Warsaw 2018 - Getting Data In
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Machine Data Is EVERYWHERE: Use It for Testing
Machine Data Is EVERYWHERE: Use It for TestingMachine Data Is EVERYWHERE: Use It for Testing
Machine Data Is EVERYWHERE: Use It for Testing
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Splunk workshop-Machine Data 101
Splunk workshop-Machine Data 101Splunk workshop-Machine Data 101
Splunk workshop-Machine Data 101
 
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
Elevate your Splunk Deployment by Better Understanding your Value Breakfast S...
 
Machine Data 101 Workshop
Machine Data 101 Workshop Machine Data 101 Workshop
Machine Data 101 Workshop
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into Insight
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into Insight
 
Getting started with Splunk Breakout Session
Getting started with Splunk Breakout SessionGetting started with Splunk Breakout Session
Getting started with Splunk Breakout Session
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
Splunk forwarders tech_brief
Splunk forwarders tech_briefSplunk forwarders tech_brief
Splunk forwarders tech_brief
 
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Amsterdam 2015 Breakout - Getting Started with SplunkSplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
SplunkLive! Amsterdam 2015 Breakout - Getting Started with Splunk
 

More from Splunk

.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routineSplunk
 
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTVSplunk
 
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)Splunk
 
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank InternationalSplunk
 
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett .conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett Splunk
 
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär).conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)Splunk
 
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu....conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...Splunk
 
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever....conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...Splunk
 
.conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex).conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex)Splunk
 
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)Splunk
 
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk
 
Splunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go KölnSplunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go KölnSplunk
 
Splunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go KölnSplunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go KölnSplunk
 
Data foundations building success, at city scale – Imperial College London
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College LondonSplunk
 
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk
 
SOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSplunk
 
.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability SessionSplunk
 
.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - KeynoteSplunk
 
.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform SessionSplunk
 
.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security SessionSplunk
 

More from Splunk (20)

.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
 
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
 
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
 
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International
 
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett .conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
 
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär).conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
 
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu....conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
 
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever....conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
 
.conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex).conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex)
 
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
 
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11y
 
Splunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go KölnSplunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go Köln
 
Splunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go KölnSplunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go Köln
 
Data foundations building success, at city scale – Imperial College London
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College London
 
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
 
SOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security Webinar
 
.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session
 
.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote
 
.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session
 
.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session.conf Go Zurich 2022 - Security Session
.conf Go Zurich 2022 - Security Session
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Data Onboarding

  • 1. Copyright  ©  2014  Splunk  Inc.   Data Onboarding Ingestion without the Indigestion Jeff  Meyers   Sales  Engineer  
  • 2. •  Major  components  involved  in  data  indexing   •  What  happens  to  data  within  Splunk   •  What  the  data  pipeline  is  &  how  to  influence  it   •  Shaping  data  understanding  via  props.conf   •  Configuring  data  inputs  via  inputs.conf   •  What  goes  where   •  Heavy  Forwarders  vs.  Universal  Forwarders   •  How  to  get  your  data  into  Splunk  (mostly  correctly)     ~  60  minutes  from  now...  
  • 3. •  SystemaMc  way  to  bring  new  data  sources  into  Splunk   •  Make  sure  that  new  data  is  instantly  usable     &  has  maximum  value  for  users   •  Goes  hand-­‐in-­‐hand  with  the  User  Onboarding  process   (sold  separately)     What  is  the  Data  Onboarding  Process?  
  • 4. 4   Machine Data > Business Value Index  Untapped  Data:  Any  Source,  Type,  Volume   Online   Services   Web   Services   Servers   Security   GPS   LocaMon   Storage   Desktops   Networks   Packaged   ApplicaMons   Custom   ApplicaMons  Messaging   Telecoms   Online   Shopping   Cart   Web   Clickstream s   Databases   Energy   Meters   Call  Detail   Records   Smartphones   and  Devices   RFID   On-­‐   Premises   Private     Cloud   Public     Cloud    Ask  Any  QuesMon   ApplicaMon  Delivery   Security,  Compliance  and   Fraud   IT  OperaMons   Business  AnalyMcs   Industrial  Data  and   the  Internet  of  Things  
  • 5. Flavors of Machine Data Order  Processing   TwiRer   Care  IVR   Middleware     Error  
  • 6. Getting Data Into Splunk 6 Agent  and  Agent-­‐less  Approach  for  Flexibility   perf   shell   code   Mounted  File  Systems   hostnamemount   syslog   TCP/UDP   WMI   Event  Logs  Performance   AcMve     Directory   syslog  compaMble  hosts   and  network  devices   Unix,  Linux  and  Windows  hosts   Windows  hosts   Custom  apps  and  scripted  API  connecMons   Local  File  Monitoring   log  files,  config  files   dumps  and  trace  files   Windows  Inputs   Event  Logs   performance  counters   registry  monitoring   AcAve  Directory  monitoring   virtual   host   Windows  hosts   Scripted  Inputs   shell  scripts  custom   parsers  batch  loading     Agent-­‐less  Data  Input   Splunk  Forwarder  
  • 7. Splunk  Data  Ingest   UF   UF   HF   UF   IDX   SH   Splunk  Enterprise    (with  opMonal  configs)   Splunk  Universal  Forwarder   Summary:  when  it  comes  to  "core"   Splunk,  there  are  two  dis8nct   products:  Splunk  Universal   Forwarder  and  Splunk  Enterprise.       "Everything  else"  –  Indexer,  Search   Head,  License  Server,  Deployment   Server,  Cluster  Master,  Deployer,   Heavy  Forwarder,  etc.  are  all   instances  of  Splunk  Enterprise  with   varying  configs.  
  • 10. The  Data  Pipeline   Any  QuesMons?  
  • 12. •  Input  Processors:  Monitor,  FIFO,  UDP,  TCP,  Scripted             •  No  events  yet-­‐-­‐  just  a  stream  of  bytes   •  Break  data  stream  into  64KB  blocks   •  Annotate  stream  with  metadata  keys  (host,  source,   sourcetype,  index,  etc.)   •  Can  happen  on  UF,  HF  or  indexer   Inputs–  Where  it  all  starts  
  • 13. • Check  character  set   • Break  lines   • Process  headers   • Can  happen  on  HF  or  indexer   Parsing  
  • 14. •  Merge  lines  for  mulM-­‐line  events   •  IdenMfy  events  (finally!)   •  Extract  Mmestamps   •  Exclude  events  based  on  Mmestamp  (MAX_DAYS_AGO,  ..)   •  Can  happen  on  HF  or  indexer   AggregaMon/Merging  
  • 15. •  Do  regex  replacement  (field  extracMon,  punctuaMon   extracMon,  event  rouMng,  host/source/sourcetype   overrides)   •  Annotate  events  with  metadata  keys     (host,  source,  sourcetype,  ..)   •  Can  happen  on  HF  or  indexer   Typing  
  • 16. •  Output  processors:  TCP,  syslog,  HTTP   •  indexAndForward   •  Sign  blocks   •  Calculate  license  volume  and  throughput  metrics   •  Index   •  [Write  to  disk  ]  /  [forward  elsewhere]  /  ...   •  Can  happen  on  HF  or  indexer   Indexing  
  • 18. Data  Pipeline:  UF  &  Indexer  
  • 19. Data  Pipeline:  HF  &  Indexer  
  • 20. Data  Pipeline:  UF,  IF  &  Indexer  
  • 21. UF  vs.  HF   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:16]  "GET  /oldlink?itemId=EST-­‐6&JSESSIONID=SD0SL6FF7AD...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:17]  "GET  /product.screen?productId=BS-­‐AG-­‐G09&JSESSION...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:19]  "POST  /category.screen?categoryId=STRATEGY&JSESSI...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:20]  "GET  /product.screen?productId=FS-­‐SG-­‐G03&JSESSION...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:20]  "POST  /cart.do?acMon=addtocart&itemId=EST-­‐21&pro...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:21]  "POST  /cart.do?acMon=purchase&itemId=EST-­‐21&JSES...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:22]  "POST  /cart/success.do?JSESSIONID=SD0SL6FF7ADFF49...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:21]  "GET  /cart.do?acMon=remove&itemId=EST-­‐11&product...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:22]  "GET  /oldlink?itemId=EST-­‐14&JSESSIONID=SD0SL6FF7A...   112.111.162.4  -­‐  -­‐  [23/Feb/2016:18:26:36]  "GET  /product.screen?productId=WC-­‐SH-­‐G04&JSESSION...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:16]  "GET  /oldlink?itemId=EST-­‐6&JSESSIONID=SD0SL6FF7AD...   209.160.24.63  -­‐  -­‐  [23/Feb/2016:18:22:17]  "GET  /product.screen?productId=BS-­‐AG-­‐G09&SSN=xxxyyyzzz...   sourcetype=access_combined,  _8me=1456251739,  index=foo,  host=bar,  …   sourcetype=access_combined,  _8me=1456251739,  index=foo,  host=bar,  …   sourcetype=access_combined,  index=foo,  host=bar,  …   UF     HF    emits  events   emits  chunks  of   data  
  • 22. Splunk  Data  Ingest   UF   UF   HF   UF   IDX   SH   Parsing   Not  Parsing   Note:  the  data  is  parsed  at  the  first   component  that  has  a  parsing   engine  –  and  not  again     This  effects  where  you  put  certain   props.conf  and  transforms.conf   files  (a.k.a.  some8mes  they  go  on   the  forwarder)  
  • 24. •  IdenMfy  the  specific  sourcetype(s)  -­‐  onboard  each  separately   •  Check  for  pre-­‐exisMng  app/TA  on  splunk.com-­‐-­‐  don't  reinvent  the  wheel!   •  Gather  info   •  Where  does  this  data  originate/reside?    How  will  Splunk  collect  it?   •  Which  users/groups  will  need  access  to  this  data?    Access  controls?   •  Determine  the  indexing  volume  and  data  retenMon  requirements   •  Will  this  data  need  to  drive  exisMng  dashboards  (ES,  PCI,  etc.)?   •  Who  is  the  SME  for  this  data?   •  Map  it  out   •  Get  a  "big  enough"  sample  of  the  event  data   •  IdenMfy  and  map  out  fields   •  Assign  sourcetype  and  TA  names  according  to  CIM  convenMons   On-­‐boarding  Process  
  • 25. •  Dev   •  Create  (or  use)  an  app   •  Props  /  inputs  definiMon   •  Sourcetype  definiMon   •  Use  data  import  wizard   •  Import,  tweak,  repeat   •  Oneshot   •  [hook  up  monitor]   On-­‐boarding  Process   •  Prod   •  Deploy  app   •  Validate   •  Monitor   •  Test   •  Deploy  app   •  Oneshost   •  Validate   •  Hook  up  monitor   •  Validate     1   2   3  
  • 26. •  General:   •  Use  apps  for  configs   •  Use  TAs  /  add-­‐ons  from  Splunk  if  possible   •  Use  dev,  test,  prod   •  Dev  can  be  laptop,  test  can  be  ephemeral   •  UF  when  possible   •  HF  only  if  filtering  /  transforming  is  required  in  foreign  land   •  Unique  Sourcetype  per  event  stream   •  Don't  send  data  through  Search  Heads   •  Don't  send  data  direct  to  Indexers   Good  Hygiene  
  • 27. •  inputs.conf   •  As  specific  as  possible   •  Set  sourcetype,  if  possible   •  Don't  let  splunk  auto-­‐sourcetype  (no  ...too_small)   •  Specify  index  if  possible   •  props.conf   •  Set:  TIME_PREFIX,  TIME_FORMAT,   MAX_TIMESTAMP_LOOKAHEAD   •  OpMmally:  SHOULD_LINEMERGE  =  false,  LINE_BREAKER,   TRUNCATE   Good  Hygiene  
  • 29. •  IdenMfy  the  specific  sourcetype(s)  -­‐  onboard  each  separately   •  Check  for  pre-­‐exisMng  app/TA  on  splunk.com-­‐-­‐  don't  reinvent  the  wheel!   •  Gather  info   •  Where  does  this  data  originate/reside?    How  will  Splunk  collect  it?   •  Which  users/groups  will  need  access  to  this  data?    Access  controls?   •  Determine  the  indexing  volume  and  data  retenMon  requirements   •  Will  this  data  need  to  drive  exisMng  dashboards  (ES,  PCI,  etc.)?   •  Who  is  the  SME  for  this  data?   •  Map  it  out   •  Get  a  "big  enough"  sample  of  the  event  data   •  IdenMfy  and  map  out  fields   •  Assign  sourcetype  and  TA  names  according  to  CIM  convenMons   Pre-­‐Board  
  • 30. •  The  Common  InformaMon  Model  (CIM)  defines   relaMonships  in  the  underlying  data,  while  leaving  the  raw   machine  data  intact   •  A  naming  convenMon  for  fields,  evensypes  &  tags   •  More  advanced  reporMng  and  correlaMon  requires  that  the   data  be  normalized,  categorized,  and  parsed   •  CIM-­‐compliant  data  sources  can  drive  CIM-­‐based   dashboards  (ES,  PCI,  others)   Tangent:  What  is  the  CIM  and  why  should  I  care?  
  • 31. •  IdenMfy  necessary  configs  (inputs,  props  and  transforms)   to  properly  handle:   •  Mmestamp  extracMon,  Mmezone,  event  breaking,   sourcetype/host/source  assignments   •  Do  events  contain  sensiMve  data  (i.e.,  PII,  PAN,  etc.)?   Create  masking  transforms  if  necessary   •  Package  all  index-­‐Mme  configs  into  the  TA   Build  the  index-­‐Mme  configs  
  • 32. •  Assign  sourcetype  according  to  event  format;  events  with   similar  format  should  have  the  same  sourcetype   •  When  do  I  need  a  separate  index?   •  When  the  data  volume  will  be  very  large,  or  when  it  will   be  searched  exclusively  a  lot   •  When  access  to  the  data  needs  to  be  controlled   •  When  the  data  requires  a  specific  data  retenMon  policy   •  Resist  the  temptaMon  to  create  lots  of  indexes   Tangent:  Best  &  Worst  PracMces  
  • 33. •  Always  specify  a  sourcetype  and  index   •  Be  as  specific  as  possible:  use  /var/log/fubar.log,                                                                                                  not  /var/log/   •  Arrange  your  monitored  filesystems  to  minimize   unnecessary  monitored  logfiles   •  Use  a  scratch  index  while  tesMng  new  inputs   Best  &  Worst  PracMces  –  [monitor]  
  • 34. •  Lookout  for  inadvertent,  runaway  monitor  clauses   •  Don’t  monitor  thousands  of  files  unnecessarily–     that’s  the  NSA’s  job   •  From  the  CLI:  splunk  show  monitor   •  From  your  browser:  hsps://your_splunkd:8089/ services/admin/inputstatus/TailingProcessor:FileStatus   Best  &  Worst  PracMces  –  [monitor]  
  • 35. •  Find  &  fix  index-­‐Mme  problems  BEFORE  polluMng  your  index   •  A  try-­‐it-­‐before-­‐you-­‐fry-­‐it  interface  for  figuring  out   •  Event  breaking   •  Timestamp  recogniMon   •  Timezone  assignment   •  Provides  the  necessary  props.conf  parameter  sewngs   Your  friend,  the  Data  Previewer  Another   Tangent!  
  • 37. •  IdenMfy  "interesMng"  events  which  should  be  tagged  with  an  exisMng  CIM  tag  (hsp:// docs.splunk.com/DocumentaMon/CIM/latest/User/Alerts)   •  Get  a  list  of  all  current  tags:  |  rest  splunk_server=local  /services/admin/tags  |  rename   tag_name  as  tag,  field_name_value  AS  definiMon,  eai:acl.app  AS  app  |  eval   definiMon_and_app=definiMon  .  "  ("  .  app  .  ")"  |  stats  values(definiMon_and_app)  as   "definiMons  (app)"  by  tag  |  sort  +tag   •  Get  a  list  of  all  evensypes  (with  associated  tags):  |  rest  splunk_server=local  /services/ admin/evensypes  |  rename  Mtle  as  evensype,  search  AS  definiMon,  eai:acl.app  AS  app  |   table  evensype  definiMon  app  tags  |  sort  +evensype   •  Examine  the  current  list  of  CIM  tags.    For  each  "interesMng"  event,  idenMfy  which  tags   should  be  applied  to  each.    A  parMcular  event  may  have  mulMple  tags.   •  Are  there  new  tags  which  should  be  created,  beyond  those  in  the  current  CIM  tag  library?     If  so,  add  them  to  the  CIM  library   Build  the  search-­‐Mme  configs:  evenRypes  &  tags  
  • 38. •  Extract  "interesMng"  fields   •  If  already  in  your  CIM  library,  name  or  alias  appropriately   •  If  not  already  in  your  CIM  library,  name  according  to  CIM   convenMons   •  Add  lookups  for  missing/desirable  fields   •  Lookups  may  be  required  to  supply  CIM-­‐compliant  fields/field   values  (for  example,  to  convert  'sev=42'  to  'severity=medium'   •  Make  the  values  more  readable  for  humans   •  Put  everything  into  the  TA  package   Build  the  search-­‐Mme  configs:  extracMons  &  lookups  
  • 39. •  Create  data  models.    What  will  be  interesMng  for  end  users?   •  Document!    (Especially  the  fields,  evensypes  &  tags)   •  Test   •  Does  this  data  drive  relevant  exisMng  dashboards  correctly?   •  Do  the  data  models  work  properly  /  produce  correct  results?   •  Is  the  TA  packaged  properly?   •  Check  with  originaMng  user/group;  is  it  OK?   Keep  Going  
  • 40. •  Determine  addiMonal  Splunk  infrastructure  required;  can   exisMng  infrastructure  &  license  support  this?     •  Will  new  forwarders  be  required?    If  so,  iniMate  CR  process(es)   •  Will  firewall  changes  be  required?    If  so,  iniMate  CR  process(es)   •  Will  new  Splunk  roles  be  required?    Create  &  map  to  AD  roles   •  Will  new  app  contexts  be  required?    Create  app(s)  as  necessary   •  Will  new  users  be  added?    Create  the  accounts   Get  ready  to  deploy  
  • 41. •  Deploy  new  search  heads  &  indexers  as  needed   •  Install  new  forwarders  as  needed   •  Deploy  new  app  &  TA  to  search  heads  &  indexers   •  Deploy  new  TA  to  relevant  forwarders   Bring  it!  
  • 42. •  All  sources  reporMng?   •  Event  breaking,  Mmestamp,  Mmezone,  host,  source,   sourcetype?   •  Field  extracMons,  aliases,  lookups?   •  Evensypes,  tags?   •  Data  model(s)?   •  User  access?   •  Confirm  with  original  requesMng  user/group:  looks  OK?   Test  &  Validate  
  • 43. Done!
  • 44. •  Bring  new  data  sources  in  correctly  the  first  Mme   •  Reduce  the  amount  of  “bad”  data  in  your  indexes–  and   the  Mme  spent  dealing  with  it   •  Make  the  new  data  immediately  useful  to  ALL  users–  not   just  the  ones  who  originally  requested  it   •  Allow  the  data  to  drive  all  sorts  of  dashboards  without   extra  modificaMons   Gee,  this  seems  like  a  lot  of  work…  
  • 45. •  What  splunk  can  monitor:   •  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Data/WhatSplunkcanmonitor   •  How  data  moves  through  splunk:   •  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Deploy/Datapipeline   •  Components  of  the  data  pipeline:   •  hsp://docs.splunk.com/DocumentaMon/Splunk/latest/Deploy/Componentsofadistributedenvironment   •  Common  informaMon  model  app:   •  hsps://splunkbase.splunk.com/app/1621   •  Common  informaMon  model  docs:   •  hsp://docs.splunk.com/DocumentaMon/CIM/latest/User/Overview   •  Where  do  I  put  configs:   •  hsp://wiki.splunk.com/Where_do_I_configure_my_Splunk_sewngs     Reference  
  • 46. Copyright  ©  2014  Splunk  Inc.   Thank You! Jeff Meyers jam@splunk.com