Submit Search
Upload
Using Machine Learning to Debug complex Oracle RAC Issues
•
12 likes
•
1,924 views
Anil Nair
Follow
Slide deck used at Oracle RAC Doag 2019 dusseldorf
Read less
Read more
Technology
Report
Share
Report
Share
1 of 53
Download now
Download to read offline
Recommended
New availability features in oracle rac 12c release 2 anair ss
New availability features in oracle rac 12c release 2 anair ss
Anil Nair
Step by Step instructions to install Cluster Domain deployment model
Step by Step instructions to install Cluster Domain deployment model
Anil Nair
Oracle RAC features on Exadata
Oracle RAC features on Exadata
Anil Nair
Using Machine Learning to Debug Oracle RAC Issues
Using Machine Learning to Debug Oracle RAC Issues
Anil Nair
Rac 12c rel2_operational_best_practices_sangam_2017_as_pdf
Rac 12c rel2_operational_best_practices_sangam_2017_as_pdf
Anil Nair
Smart monitoring how does oracle rac manage resource, state ukoug19
Smart monitoring how does oracle rac manage resource, state ukoug19
Anil Nair
Con8780 nair rac_best_practices_final_without_12_2content
Con8780 nair rac_best_practices_final_without_12_2content
Anil Nair
Oracle RAC 19c: Best Practices and Secret Internals
Oracle RAC 19c: Best Practices and Secret Internals
Anil Nair
Recommended
New availability features in oracle rac 12c release 2 anair ss
New availability features in oracle rac 12c release 2 anair ss
Anil Nair
Step by Step instructions to install Cluster Domain deployment model
Step by Step instructions to install Cluster Domain deployment model
Anil Nair
Oracle RAC features on Exadata
Oracle RAC features on Exadata
Anil Nair
Using Machine Learning to Debug Oracle RAC Issues
Using Machine Learning to Debug Oracle RAC Issues
Anil Nair
Rac 12c rel2_operational_best_practices_sangam_2017_as_pdf
Rac 12c rel2_operational_best_practices_sangam_2017_as_pdf
Anil Nair
Smart monitoring how does oracle rac manage resource, state ukoug19
Smart monitoring how does oracle rac manage resource, state ukoug19
Anil Nair
Con8780 nair rac_best_practices_final_without_12_2content
Con8780 nair rac_best_practices_final_without_12_2content
Anil Nair
Oracle RAC 19c: Best Practices and Secret Internals
Oracle RAC 19c: Best Practices and Secret Internals
Anil Nair
New Generation Oracle RAC Performance
New Generation Oracle RAC Performance
Anil Nair
Anil nair rac_internals_sangam_2016
Anil nair rac_internals_sangam_2016
Anil Nair
Rac 12c rel2_operational_best_practices_sangam_2017
Rac 12c rel2_operational_best_practices_sangam_2017
Anil Nair
Oracle RAC - New Generation
Oracle RAC - New Generation
Anil Nair
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
Sandesh Rao
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Markus Michalewicz
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Sandesh Rao
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
Sandesh Rao
Understanding oracle rac internals part 2 - slides
Understanding oracle rac internals part 2 - slides
Mohamed Farouk
TFA Collector - what can one do with it
TFA Collector - what can one do with it
Sandesh Rao
TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new
Sandesh Rao
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
Sandesh Rao
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
Markus Michalewicz
Oracle RAC 12c Best Practices with Appendices DOAG2013
Oracle RAC 12c Best Practices with Appendices DOAG2013
Markus Michalewicz
Understanding oracle rac internals part 1 - slides
Understanding oracle rac internals part 1 - slides
Mohamed Farouk
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Markus Michalewicz
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Anil Nair
Expert performance tuning tips for Oracle RAC
Expert performance tuning tips for Oracle RAC
SolarWinds
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Markus Michalewicz
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Markus Michalewicz
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Fwdays
Whats new in oracle trace file analyzer 18.2.0
Whats new in oracle trace file analyzer 18.2.0
Gareth Chapman
More Related Content
What's hot
New Generation Oracle RAC Performance
New Generation Oracle RAC Performance
Anil Nair
Anil nair rac_internals_sangam_2016
Anil nair rac_internals_sangam_2016
Anil Nair
Rac 12c rel2_operational_best_practices_sangam_2017
Rac 12c rel2_operational_best_practices_sangam_2017
Anil Nair
Oracle RAC - New Generation
Oracle RAC - New Generation
Anil Nair
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
Sandesh Rao
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Markus Michalewicz
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Sandesh Rao
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
Sandesh Rao
Understanding oracle rac internals part 2 - slides
Understanding oracle rac internals part 2 - slides
Mohamed Farouk
TFA Collector - what can one do with it
TFA Collector - what can one do with it
Sandesh Rao
TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new
Sandesh Rao
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
Sandesh Rao
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
Markus Michalewicz
Oracle RAC 12c Best Practices with Appendices DOAG2013
Oracle RAC 12c Best Practices with Appendices DOAG2013
Markus Michalewicz
Understanding oracle rac internals part 1 - slides
Understanding oracle rac internals part 1 - slides
Mohamed Farouk
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Markus Michalewicz
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Anil Nair
Expert performance tuning tips for Oracle RAC
Expert performance tuning tips for Oracle RAC
SolarWinds
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Markus Michalewicz
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Markus Michalewicz
What's hot
(20)
New Generation Oracle RAC Performance
New Generation Oracle RAC Performance
Anil nair rac_internals_sangam_2016
Anil nair rac_internals_sangam_2016
Rac 12c rel2_operational_best_practices_sangam_2017
Rac 12c rel2_operational_best_practices_sangam_2017
Oracle RAC - New Generation
Oracle RAC - New Generation
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
LAD - GroundBreakers - Jul 2019 - Using Oracle Autonomous Health Framework to...
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Oracle RAC BP for Upgrade & More by Anil Nair and Markus Michalewicz
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
Oracle Autonomous Health Service- For Protecting Your On-Premise Databases- F...
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
15 Troubleshooting Tips and Tricks for database 21c - OGBEMEA KSAOUG
Understanding oracle rac internals part 2 - slides
Understanding oracle rac internals part 2 - slides
TFA Collector - what can one do with it
TFA Collector - what can one do with it
TFA, ORAchk and EXAchk 20.2 - What's new
TFA, ORAchk and EXAchk 20.2 - What's new
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AIOUG-GroundBreakers-Jul 2019 - 19c RAC
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
AskTom: How to Make and Test Your Application "Oracle RAC Ready"?
Oracle RAC 12c Best Practices with Appendices DOAG2013
Oracle RAC 12c Best Practices with Appendices DOAG2013
Understanding oracle rac internals part 1 - slides
Understanding oracle rac internals part 1 - slides
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Oracle RAC 12c Practical Performance Management and Tuning OOW13 [CON8825]
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Collaborate 17 Oracle RAC 12cRel 2 Best Practices
Expert performance tuning tips for Oracle RAC
Expert performance tuning tips for Oracle RAC
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Oracle RAC 12c Rel. 2 & Cluster Architecture Internals OOW17 by Anil Nair
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Cluster Health Advisor (CHA) Deep Dive by Mark Scardina
Similar to Using Machine Learning to Debug complex Oracle RAC Issues
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Fwdays
Whats new in oracle trace file analyzer 18.2.0
Whats new in oracle trace file analyzer 18.2.0
Gareth Chapman
Diagnose Your Microservices
Diagnose Your Microservices
Marcus Hirt
Oracle Management Cloud
Oracle Management Cloud
Dheeraj Hiremath
Oracle Management Cloud
Oracle Management Cloud
Dheeraj Hiremath
Enterprise Cloud transformation z pohledu Oracle
Enterprise Cloud transformation z pohledu Oracle
MarketingArrowECS_CZ
What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0
Sandesh Rao
Hyderabad Mar 2019 - Autonomous Database
Hyderabad Mar 2019 - Autonomous Database
Connor McDonald
Exachk Customer Presentation
Exachk Customer Presentation
Sandesh Rao
Streamline it management
Streamline it management
DLT Solutions
Oracle ORAchk & EXAchk overview
Oracle ORAchk & EXAchk overview
Gareth Chapman
Whats new in Oracle Trace File analyzer 18.3.0
Whats new in Oracle Trace File analyzer 18.3.0
Sandesh Rao
Whats new in oracle trace file analyzer 18.3.0
Whats new in oracle trace file analyzer 18.3.0
Gareth Chapman
Serverless patterns
Serverless patterns
Jesse Butler
Oracle Cloud Café hybrid Cloud 19 mai 2016
Oracle Cloud Café hybrid Cloud 19 mai 2016
Sorathaya Sirimanotham
Oracle Management Cloud - HybridCloud Café - May 2016
Oracle Management Cloud - HybridCloud Café - May 2016
Bastien Leblanc
Enterprise manager 13c
Enterprise manager 13c
MarketingArrowECS_CZ
Management Cloud Oracle
Management Cloud Oracle
MarketingArrowECS_CZ
Oracle engineered systems executive presentation
Oracle engineered systems executive presentation
OTN Systems Hub
AWR, ASH with EM13 at HotSos 2016
AWR, ASH with EM13 at HotSos 2016
Kellyn Pot'Vin-Gorman
Similar to Using Machine Learning to Debug complex Oracle RAC Issues
(20)
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Whats new in oracle trace file analyzer 18.2.0
Whats new in oracle trace file analyzer 18.2.0
Diagnose Your Microservices
Diagnose Your Microservices
Oracle Management Cloud
Oracle Management Cloud
Oracle Management Cloud
Oracle Management Cloud
Enterprise Cloud transformation z pohledu Oracle
Enterprise Cloud transformation z pohledu Oracle
What's new in oracle trace file analyzer 18.2.0
What's new in oracle trace file analyzer 18.2.0
Hyderabad Mar 2019 - Autonomous Database
Hyderabad Mar 2019 - Autonomous Database
Exachk Customer Presentation
Exachk Customer Presentation
Streamline it management
Streamline it management
Oracle ORAchk & EXAchk overview
Oracle ORAchk & EXAchk overview
Whats new in Oracle Trace File analyzer 18.3.0
Whats new in Oracle Trace File analyzer 18.3.0
Whats new in oracle trace file analyzer 18.3.0
Whats new in oracle trace file analyzer 18.3.0
Serverless patterns
Serverless patterns
Oracle Cloud Café hybrid Cloud 19 mai 2016
Oracle Cloud Café hybrid Cloud 19 mai 2016
Oracle Management Cloud - HybridCloud Café - May 2016
Oracle Management Cloud - HybridCloud Café - May 2016
Enterprise manager 13c
Enterprise manager 13c
Management Cloud Oracle
Management Cloud Oracle
Oracle engineered systems executive presentation
Oracle engineered systems executive presentation
AWR, ASH with EM13 at HotSos 2016
AWR, ASH with EM13 at HotSos 2016
Recently uploaded
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
BookNet Canada
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
OnBoard
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Ridwan Fadjar
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
Sinan KOZAK
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
Neo4j
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Alan Dix
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Padma Pradeep
Key Features Of Token Development (1).pptx
Key Features Of Token Development (1).pptx
LBM Solutions
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Scott Keck-Warren
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
2toLead Limited
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Memoori
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Delhi Call girls
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Allon Mureinik
Recently uploaded
(20)
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Key Features Of Token Development (1).pptx
Key Features Of Token Development (1).pptx
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Using Machine Learning to Debug complex Oracle RAC Issues
1.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Using Machine Learning to Debug complex Oracle RAC Issues Klaus Thielen – Architect Anil Nair -- Sr. Principal Product Manager, Oracle Real Application Clusters (RAC) Jun 3rd , 2019 @RACMasterPM, @OracleRACpm http://www.linkedin.com/in/anil-nair-01960b6 http://www.slideshare.net/AnilNair27/
2.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. 2
3.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda Introduction How do we diagnose issues? What’s new? Walk through some common scenarios FAQ 1 2 3 4 5 3
4.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda Introduction1 2 3 4 5 4
5.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 5 Scalability without Application code change(s) 0 500 0 100 00 150 00 200 00 250 00 300 00 350 00 400 00 4 8 32 48 64 80 2035 4010 15520 22416 30016 37040 # of Cores across RAC Nodes Users 2 Nodes 3 Nodes 4 Nodes 5 Nodes SAP certified SD Benchmark results Active Active Instances scales Writes, Reads and Hybrid workloads
6.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 6 Oracle Real Application Cluster Family of Solutions • Integrated set of tools that work cohesively to provide High Availability and Scalability • The functionality provided by Oracle RAC Family of Solutions can be used by licensed Oracle RAC, Oracle RAC One Node and Single Instance customers without any additional charge
7.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda How do we Diagnose issues? 1 2 3 4 5 7
8.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 8 How do we Diagnose Issues?
9.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 9 Lets walk through a sample problem resolution
10.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 10 Lets walk through a sample problem resolution There is a quiz in the end so pay attention!!!
11.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 11 Lets walk through a sample problem resolution There is a quiz in the end so pay attention!!! https://www.slideshare.net/AnilNair27/using-machine-learning-to-debug-oracle-rac-issues Case Studies covered last year
12.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • Reactive • Detection and resolution requires diagnostic data from all the nodes along with • Different types of Issues such as Performance slowdown, Hangs, Deadlocks 12 How do we diagnose Issues? Detect ReactFix
13.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 13 How do we proactively detect and attempt to prevent issues?
14.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • Utilize TFA notification features • $tfactl set smtp • $tfactl set notificationAddress=“..” – Prepend the owner for specific homes • $tfactl set notificationAddress=“salesora: John.Doe@oracle.com” • Events include – Oracle Errors (ORA-) – Cluster Health Monitor events • Diagnostic Data is moved to DSC for further analysis by TFA Service Automatic Notification and Diagnostic Collection using TFA DBA 1 DBA 2 TFA Agent TFA Agent
15.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • Utilize TFA notification features • $tfactl set smtp • $tfactl set notificationAddress=“..” – Prepend the owner for specific homes • $tfactl set notificationAddress=“salesora: John.Doe@oracle.com” • Events include – Oracle Errors (ORA-) – Cluster Health Monitor events • Diagnostic Data is moved to DSC for further analysis by TFA Service Automatic Notification and Diagnostic Collection using TFA DBA 1 DBA 2 TFA Agent TFA Agent Enable CHA Auto-collections tfactl set chaautocollect=ON Enable Email Notifications in TFA tfactl set smtp Enable Notifications for CHA events tfactl set chanotification=ON Specify Email Address for Notifications (optional) tfactl set notificationAddress=chatfa:john.doe@oracle.com
16.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Cluster Health Advisor TFA SMTP Notifications From: <tfa@myserver1.example.com> Date: Thursday, June 7, 2018 at 9:55 AM To: <john.doe@example.com> Subject: TFA CHA Event ASM Disk Utilization Cleared on myserver At Thu Jun 07 09:55:35 PDT 2018 Oracle Cluster Health Analyzer detected ASM Cluster-wide Disk Utilization on host myserver1 Cluster: mycluster CHA Problem Description : CHA detected that the ASM disk IO request rate is higher than expected. CHA Problem Cause : The Cluster Health Advisor (CHA) detected slower than expected disk performance because the high disk I/O demand from the other servers increased the utilization of the shared disks. CHA Problem Action : Review the CHA findings and corrective actions from the other servers and database instances in the cluster for IO issues. Add disks to the database disk groups. Example Notification Email
17.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • OraChk can help compare and contrast different configurations – Different I/O scheduler? – Centralized location makes it easier to identify diffs • Daemon Mode – $orachk –d start/stop/status • Configure Schedule – $orachk AUTORUN_SCHEDULE • What’s difference between OraChk and ExaChk? Configure OraChk to run in daemon mode 11204 11204DBA 1 DBA 2 Oracle RAC Assessment Report System Health Score is 89 out of 100 Oracle RAC Assessment Report System Health Score is 98 out of 100
18.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • OraChk can help compare and contrast different configurations – Different I/O scheduler? – Centralized location makes it easier to identify diffs • Daemon Mode – $orachk –d start/stop/status • Configure Schedule – $orachk AUTORUN_SCHEDULE • What’s difference between OraChk and ExaChk? Configure OraChk to run in daemon mode 11204 11204DBA 1 DBA 2
19.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 19 Lets walk through a sample problem resolution
20.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Node Eviction due to memory pressure 20 Memory pressure as reported by the OS In the past, this would result in Node eviction
21.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Misconfiguration!!! Application Server Application Server Application Server
22.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 22 Memory Guard will proactively step in to prevent Node Eviction • Monitors memory allocated and detects conditions that may result in swapping • Issues alert should any server be at risk • Stops services transactionally on the server • This prevents any new applications connections • Re-starts the services once the excess memory allocation issue is cleared Sales_svc Sales_svc
23.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 23 Oracle Memory Guard M em ory stress detected by M em ory G uard DB Alert Log Audit Reports
24.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 24 Oracle Memory Guard DB Alert Log Audit Reports M em ory G uard stops services for incom ing connections
25.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 25 Oracle Memory Guard DB Alert Log Audit Reports Services for new connections are restarted on the node
26.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | So what is the root cause? 26 Hmmm….Who used all my memory?
27.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 27 CHM provides aggregated data by Process Type Use Historical Data to fix root cause
28.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda What’s new? 1 2 3 4 5 28
29.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Oracle Autonomous Health Framework (AHF) • Integrated next generation tools running as components - 24/7 • Discovers Potential Issues and takes Corrective Actions • Speeds up Issue Diagnosis and Resolution • Maintains Database Performance and Availability 29
30.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Applied Machine Learning Pipeline Diagnostics • Generic ML-extracted Data Clusters are insufficient for diagnostics • Operational data correlation does not determine root cause • Trusted root cause determination critical to swift corrective actions • Algorithms selected and models built require domain expertise • Models refined via field feedback Subject Matter ExpertLog ASH Metrics ML Knowledge Extraction Model Generation Human Supervision Application Optimized Models Feedback ScrubData
31.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Data Pipeline Overview Autonomous Health – Database Performance
32.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda Walk through some common scenarios 1 2 3 4 5 32
33.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Scenario 1 – Remember our Service Shutdown 33
34.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • CHAG (Cluster Health Advisor Graphical) Doc ID 2340062.1 • CHAG is the GUI to utilize the benefits of AHF • Can be run on the Cluster node – Not Recommended* – Set ORACLE_HOME= GI HOME – Connects to the GIMR using wallets – but Install locally on a separate Linux host $export ORACLE_HOME=/u01/app/12.2.0/grid $ ./chag CHAG logging to log/chagout_20873.log Initializing DB reader Connect via 'jdbc:oracle:thin:@(DESCRIPTION=(ADDRESS_LIST=(ADDR ESS=….. CHAG MDB feed open. Timings: Load JDBC driver: 147.50 ms, Connect to MDB: 1398.85 ms 1st Query with 60 minutes of data (15:00:56..16:00:56): 16546 ms 1st CLOB : (59290 lines, 1.484 MB) parsing time: 644 ms, 10.49 mics/line 34 You could have reached same conclusion using CHA
35.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 35 anair1
36.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Use CHAG on a Remote system • Remote Mode requires the GIMR Data to be exported – Execute the following on the cluster node – $chactl export repository -format mdb -start ‘’ -end '‘ – • Copy the MDB to the local node and execute – chag –f <mdb_file> • MDB file includes all the data for post mortem analysis from all the nodes 36 $chactl export repository -format mdb -start '2017-12-15 00:00:00' -end '2017-12-20 00:00:00' successfully dumped the CHA statistics to location "/u01/app/gridbase/crsdata/anair/trace/chad/cha_dump_20171215_000000_20171215_010000.mdb“
37.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Sample Problems and Resolution 37 Problem 2 – Why was my Session killed?
38.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 38 Hang Manager interventions reported via ORA-32701 Dump file …/diag/rdbms/hm6/hm62/incident/incdir_5753/hm62_dia0_12656_i5753.trc Oracle Database 12c Enterprise Edition Release 12.2.0.0.0 - 64bit Beta With the Partitioning, Real Application Clusters, OLAP, Advanced Analytics and Real Application Testing options Build label: RDBMS_MAIN_LINUX.X64_151013 ORACLE_HOME: …/3775268204/oracle System name: Linux Node name: slc05kyr Release: 2.6.39-400.211.1.el6uek.x86_64 Version: #1 SMP Fri Nov 15 13:39:16 PST 2013 Machine: x86_64 VM name: Xen Version: 3.4 (PVM) Instance name: hm62 Redo thread mounted by this instance: 2 Oracle process number: 19 Unix process pid: 12656, image: oracle@slc05kyr (DIA0) *** 2015-10-13T16:47:59.541509+17:00 *** SESSION ID:(96.41299) 2015-10-13T16:47:59.541519+17:00 *** CLIENT ID:() 2015-10-13T16:47:59.541529+17:00 *** SERVICE NAME:(SYS$BACKGROUND) 2015-10-13T16:47:59.541538+17:00 *** MODULE NAME:() 2015-10-13T16:47:59.541547+17:00 *** ACTION NAME:() 2015-10-13T16:47:59.541556+17:00 *** CLIENT DRIVER:() 2015-10-13T16:47:59.541565+17:00 2015-10-13T16:47:59.435039+17:00 Errors in file /oracle/log/diag/rdbms/hm6/hm6/trace/hm6_dia0_12433.trc (incident=7353): ORA-32701: Possible hangs up to hang ID=1 detected Incident details in: …/diag/rdbms/hm6/hm6/incident/incdir_7353/hm6_dia0_12433_i7353.trc 2015-10-13T16:47:59.506775+17:00 DIA0 requesting termination of session sid:40 with serial # 43179 (ospid:13031) on instance 2 due to a GLOBAL, HIGH confidence hang with ID=1. Hang Resolution Reason: Automatic hang resolution was performed to free a significant number of affected sessions. DIA0: Examine the alert log on instance 2 for session termination status of hang with ID=1. 2015-10-13T16:47:59.538673+17:00 Errors in file …/diag/rdbms/hm6/hm62/trace/hm62_dia0_12656.trc (incident=5753): ORA-32701: Possible hangs up to hang ID=1 detected Incident details in: …/diag/rdbms/hm6/hm62/incident/incdir_5753/hm62_dia0_12656_i5753.trc 2015-10-13T16:48:04.222661+17:00 DIA0 terminating blocker (ospid: 13031 sid: 40 ser#: 43179) of hang with ID = 1 requested by master DIA0 process on instance 1 Hang Resolution Reason: Automatic hang resolution was performed to free a significant number of affected sessions. by terminating session sid:40 with serial # 43179 (ospid:13031) ORA-32701: Possible hangs up to hang ID=1 detected Incident details in: …/../hm62_dia0_12656_i5753.trc DIA0 terminating blocker (ospid: 13031 sid: 40 ser#: 43179) requested by master DIA0 process on instance 1 Hang Resolution Reason: Automatic hang resolution was performed to free a significant umber of affected sessions. Alert log
39.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Sample Problems and Resolution 39 Problem 3 – How long did the reconfiguration take?
40.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Reconfiguration Diagnosability **************** BEGIN DLM RCFG HA STATS **************** Total dlm rcfg time (inc 6): 3.586 secs (394926177, 394929763) Begin step .........: 0.005 secs (394926177, 394926182) Freeze step ........: 0.019 secs (394926182, 394926201) Sync 1 step ........: 0.002 secs (394926264, 394926266) Sync 2 step ........: 0.024 secs (394926266, 394926290) Enqueue cleanup step: 0.002 secs (394926290, 394926292) Sync pcm1 step .....: 0.004 secs (394926293, 394926297) …… …. Enqueue dubious step: 0.004 secs (394926432, 394926436) Sync 5 step ........: 0.000 secs (394926436, 394926436) Enqueue grant step .: 0.001 secs (394926436, 394926437) Sync 6 step ........: 0.012 secs (394926437, 394926449) Fixwrt replay step .: 0.885 secs (394928837, 394929722) Sync 8 step ........: 0.040 secs (394929722, 394929762) End step ...........: 0.001 secs (394929762, 394929763) Number of replayed enqueues sent / received .......: 2246 / 893 Number of replayed fusion locks sent / received ...: 124027 / 0 Number of enqueues mastered before / after rcfg ...: 2058 / 1384 **************** END DLM RCFG HA STATS ***************** Detailed timing breakdown available in LMON trace file
41.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Sample Problems and Resolution 41 Problem 5 – Is Dynamic Resource Management (DRM) helping my workload ?
42.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | DRM Diagnosability Dynamic Remastering Statistics DB/Inst: SALES/sales1 Snaps: 393-452 -> Affinity objects - Affinity objects mastered at the begin/end snapshot -> Read-mostly objects - Read-mostly objects mastered at the begin/end snapshot per Begin End Name Total Remaster Op Snap Snap -------------------------------- ------------ ------------- -------- -------- remaster ops 24 1.00 remastered objects 24 1.00 remaster time (s) 7.4 0.31 freeze time (s) 1.5 0.06 cleanup time (s) 2.4 0.10 replay time (s) 0.3 0.01 fixwrite time (s) 2.4 0.10 sync time (s) 0.1 0.01 affinity objects N/A 3 27 read-mostly objects N/A 0 0 read-mostly objects (persistent) N/A 0 0 Detailed timing breakdown available in AWR Report
43.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Program Agenda FAQ 1 2 3 4 5 43
44.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 44 Frequently asked Question # 1 Why does GIMR (mgmtdb) need so much space?
45.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Cluster Type Redundancy MGMT DG (GB) Domain Services Cluster (2 Node DSC with 4 Member Clusters of 2 Nodes each) External 345 Normal 690 High/Flex 1035 Standalone Cluster (4 Node Cluster) External 30 Normal 60 High/Flex 90 • Oracle GI 18c feature AHF (Autonomous Health Framework) collects, correlate & stores diagnostics data from OS and DB in MGMT • In DSC, one PDB per member cluster is provisioned to store member cluster diagnostics data • DSC also stores central TFA Repository • Diagnose issues in real time, accurate and quick post-mortem analysis and provide historical data for workload characterization over time , performance and capacity review 45 GIMR space requirements
46.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 46 Frequently asked Question # 2 Why is the Grid Infrastructure Repository mandatory?
47.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Is the Grid Infrastructure Repository (GIMR) mandatory? • Starting with Version 19 the GIMR is going to be optional! – You can choose to install it during the installation of the Grid Infrastructure – If you decide to go without the GIMR we are going to write into the filesystem – As soon as we find an running Repository (GIMR) we start writing into the repository • Databases also are going to be monitored automatically – Pre version 19: The cluster nodes were monitored automatically 47
48.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Will we get the same functionality without the GIMR? • No! There is no support for – Cluster Heath Advisor GUI (CHAG) • Online and offline mode – CHACTL • Calibrate • Export repository -format • query diagnosis – Global view over the cluster – Good and Bad Neighbor analysis • Trace File Analyzer will provide a limited graphical view of Root causes – When chaautocollect and chanotification set = ON 48
49.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 49 GIMR is optional but recommended!!! If you install it, you will get the advantage of the full functionality of CHA
50.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | 50 Frequently asked Question # 3 Should I continue to use Oswatcher?
51.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | Should I continue to use OSWatcher? • Hopefully by now, the value benefits of AHF is clear • AHF collects much of the data which OSWatcher also collects • AHF continues to be enhanced to improve – Diagnosing large number of deployments – Correlating data to speed diagnosis – Preventing issues in the first place – Utilizing latest technologies like Machine learning • However, Customers can choose to use both OSW and AHF 51
52.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | • opatch automatically patches GIMR if required • clients of GIMR connect using encrypted credentials • MGMT listener is automatically maintained. 52 Frequently asked Question # 4 But it is still one more database for me to manage
53.
Copyright © 2018,
Oracle and/or its affiliates. All rights reserved. | For Further Information • Oracle 19c Autonomous Health Framework User’s Guide • Oracle 19c Clusterware Adminstration and Deployment Guide • Oracle Autonomus Health Framework on OTN • Oracle QoS Management on OTN • Oracle 19c ORAchk • Oracle 19c Trace File Analyzer • MDBUtil: GI Management Repository configuration tool (MOS Doc ID 2065175.1) • Cluster Health Advisor Graphical User Interface - CHAG (Doc ID 2340062.1)
Download now