8 Resolving Performance Degradation Over Time

This chapter describes how to resolve performance degradation over time with Oracle Database. Performance degradation of the database over time happens when your database was performing optimally in the past, such as 6 months ago, but has gradually degraded to a point where it becomes noticeable to the users.

The Automatic Workload Repository (AWR) Compare Periods report enables you to compare database performance between two periods of time. While an AWR report shows AWR data between two snapshots (or two points in time), the AWR Compare Periods report shows the difference between two periods (or two AWR reports, which equates to four snapshots).

Using the AWR Compare Periods report helps you to identify detailed performance attributes and configuration settings that differ between two time periods. For example, if the application workload is known to be stable between 10:00 p.m. and midnight every night, but the performance on a particular Thursday was poor between 10:00 p.m. and 11:00 p.m., generating an AWR Compare Periods report for Thursday from 10:00 p.m. to 11:00 p.m. and Wednesday from 10:00 p.m. to 11:00 p.m. should identify configuration settings, workload profile, and statistics that were different in these two time periods. Based on the differences identified, the cause of the performance degradation can be more easily diagnosed. The two time periods selected for the AWR Compare Periods Report can be of different durations, because the report normalizes the statistics by the amount of time spent on the database for each time period, and presents statistical data ordered by the largest difference between the periods.

This chapter contains the following sections:

Creating Baselines
Running the Automatic Workload Repository Compare Periods Reports
Using the Automatic Workload Repository Compare Periods Reports

See Also:

"Gathering Database Statistics Using the Automatic Workload Repository"

Creating Baselines

Baselines are an effective way to diagnose performance problems. AWR supports the capture of baseline data by enabling you to specify and preserve a pair or a range of snapshots as a baseline. The snapshots contained in a baseline are excluded from the automatic AWR purging process and are retained indefinitely.

Carefully consider the time period you choose as a baseline, because the baseline should represent the system operating at an optimal level. In the future, you can compare these baselines with the snapshots captured during periods of poor performance to analyze performance degradation over time.

To create baselines :

On the Database Performance page, under Additional Monitoring Links, click Snapshots.

The Snapshots page appears with a list of the most recent snapshots.

Description of the illustration baselines_snapshots.gif
To filter the snapshots and only display the snapshots taken at the start of the desired baseline, select the time for the starting snapshot in the Go To Time field and click Go.

In this example, 12:00AM on July 7, 2006 is selected.

Description of the illustration baselines_go_to_time.gif
Select the starting snapshot for the baseline.

In this example, snapshot 2282 is selected.

Description of the illustration baseline_snap_begin.gif
From the Actions list, select Create Preserved Snapshot Set and click Go.

The Create Preserved Snapshot Set page appears with a list of subsequent snapshots displayed. Note that a system generated value is assigned in the Preserved Snapshot Name field.
Under Select Ending Snapshot, select the ending snapshot for the baseline and click OK.

In this example, snapshot 2283 is selected.

Description of the illustration baseline_snap_end.gif
After the preserved snapshot set is taken, the Preserved Snapshot Sets page appears with a Confirmation message.

In this example, the preserved snapshot set that was created has a Preserved Snapshot Set ID of 2 and contains snapshots 2282 through 2283.

Description of the illustration baseline_presnapset.gif
The preserved snapshot set that was created can now be used as a baseline for comparison to other snapshots when performance problems occur. To view the statistics gathered for this baseline, click the Preserved Snapshot Set ID link.

The Preserved Snapshot Set Details page appears with the statistics gathered for the baseline displayed.

Description of the illustration baseline_presnapset_det.gif

Running the Automatic Workload Repository Compare Periods Reports

This section describes how to run the AWR Compare Periods reports using Oracle Enterprise Manager.

You can use AWR Compare Periods reports to compare the database performance between two time periods by:

Comparing a Baseline to Another Baseline or Pair of Snapshots
Comparing Two Pairs of Snapshots

Comparing a Baseline to Another Baseline or Pair of Snapshots

When performance degradation happens to a database over time, you should run the AWR Compare Periods report to compare the degraded performance, captured as a new baseline or a pair of snapshots, to an existing baseline that represents a time when the system was operating at an optimal level. To do so, you will need to have preserved a baseline that represents the system operating at an optimal level. If an existing baseline is not available, you can compare database performance between two periods of time by using two arbitrary pairs of snapshots, as described in "Comparing Two Pairs of Snapshots".

See Also:

"Creating Baselines"

To compare a baseline to another baseline :

On the Database Administration page, under Statistics Management, click Automatic Workload Repository.

The Automatic Workload Repository page appears.

Description of the illustration awr1.gif
Under Manage Snapshots and Preserved Snapshot Sets, click the link next to Preserved Snapshot Sets.

The Preserved Snapshot Sets page appears.

Description of the illustration awr_presnapset.gif
Select the baseline you want to use for the report. At least one existing preserved snapshot set must be available to be used as a baseline.

From the Actions list, select Compare Periods and click Go.

The Compare Periods: Second Period Start page appears. Under First Period, the selected baseline is displayed. In this example, the baseline named AWR_BASELINE_2006 is selected.

Description of the illustration baseline_first_period.gif
You can compare the baseline selected in the first period to another baseline or a pair of snapshots.
- To compare to another baseline, select Select a Preserved Snapshot Set and then the baseline you want to use in the second period.
  
  In this example, the preserved snapshot set named AWR_20061019 is selected.
  
  Description of the illustration awr_baseline2.gif
  
  Click Next. The Compare Periods: Review page appears. Proceed to Step 6.
- To compare to a pair of snapshots, select Select Beginning Snapshot and then the beginning snapshot you to use in the second period.
  
  In this example, snapshot 3693, taken on October 19, 2006 at 2:00 p.m., is selected.
  
  Description of the illustration awr_snapshot2_begin2.gif
  
  Click Next. The Compare Periods: Second Period End page appears. Continue with the next step.
Select the ending snapshot for the snapshot period that will be included in the report and click Next.

In this example, snapshot 3696, taken on October 19, 2006 at 5:00 p.m., is selected.

Description of the illustration awr_snapshot2_end2.gif

The Compare Periods: Review page appears.

Description of the illustration awr2_review.gif
Review the selected periods that will be included in the report and click Finish.

The Compare Periods: Results page appears. Data from the selected periods appears under the General subpage. Data can be viewed per second or per transaction by selecting the desired option from the View Data list.

Description of the illustration awr2_results_general.gif

In this example, parse time in the first period is much higher than the second.
To view the report, click the Report tab.

The Processing: View Report page appears while the report is being generated. After it completes, the report will appear, as shown in Figure 8-1. To change periods, click Change Periods. To save the report as an HTML file, click Save to File.

See Also:

"Using the Automatic Workload Repository Compare Periods Reports"

Comparing Two Pairs of Snapshots

If an existing baseline is not available, you can compare the database performance using two arbitrary pairs of snapshots, one pair taken when the database is performing optimally, and another pair when the database is performing poorly.

To compare performance using two pairs of snapshots :

On the Database Administration page, under Statistics Management, click Automatic Workload Repository.

The Automatic Workload Repository page appears.

Description of the illustration awr2.gif
Under Manage Snapshots and Preserved Snapshot Sets, click the link next to Snapshots.

The Snapshots page appears. At least four existing snapshots must be available.

Description of the illustration awr_snapshots.gif
To filter the snapshots and display only the snapshot taken at the start of the comparison period, in the Go To Time field, select the time for the starting snapshot and click Go.

In this example, 10:00 p.m. on Thursday, July 6, 2006, is selected.

Description of the illustration awr_gototime_1.gif
Under Select Beginning Snapshot, select the starting point for the first snapshot period that will be included in the report.

In this example, snapshot 2274, taken on Thursday, July 6, 2006 at 10:00 p.m., is selected.

Description of the illustration awr_snapshot1_begin.gif
From the Actions list, select Compare Periods and click Go.

The Compare Periods: First Period End page appears.
Select the ending point for the first snapshot period that will be included in the report and click Next.

In this example, snapshot 2278, taken on Thursday, July 6, 2006 at 11:00 p.m., is selected.

Description of the illustration awr_snapshot1_end.gif

The Compare Periods: Second Period Start page appears.
Select the starting point for the second snapshot period that will be included in the report and click Next.

In this example, snapshot 2302, taken on Friday, July 7, 2006 at 10:00 p.m., is selected.

Description of the illustration awr_snapshot2_begin.gif

The Compare Periods: Second Period End page appears.
Select the end point for the second period that will be included in the report and click Next.

In this example, snapshot 2304, taken on Friday, July 7, 2006 at 11:00 p.m., is selected.

Description of the illustration awr_snapshot2_end.gif

The Compare Periods: Review page appears.

Description of the illustration awr_review.gif
Review the selected periods that will be included in the report and click Finish.

The Compare Periods: Results page appears. Data from the selected periods appears under the General subpage. Data can be viewed per second or per transaction by selecting the desired option from the View Data list.

Description of the illustration awr_results_general.gif

In this example, logical reads in the first period are much higher than the second.
To view the report, click the Report tab.

The Processing: View Report page appears while the report is being generated. After it completes, the report will appear, as shown in Figure 8-1. To change periods, click Change Periods. To save the report as an HTML file, click Save to File.

Using the Automatic Workload Repository Compare Periods Reports

After an AWR Compare Periods report is generated for the time periods you want to compare, you can use it to perform an analysis of performance degradation with Oracle Database that may have happened over time. For information about generating AWR Compare Periods reports, see "Running the Automatic Workload Repository Compare Periods Reports".

Figure 8-1 shows an example of an AWR Compare Periods report.

Figure 8-1 AWR Compare Periods Report

Description of "Figure 8-1 AWR Compare Periods Report"

The contents of the AWR Compare Periods report are divided into the following sections:

Report Summary
Wait Events
Time Model Statistics
Operating System Statistics
Service Statistics
SQL Statistics
Instance Activity Statistics
I/O Statistics
Advisory Statistics
Wait Statistics
Latch Statistics
Segment Statistics
Dictionary Cache Statistics
Library Cache Statistics
SGA Statistics
init.ora Parameters

Report Summary

The report summary is at the beginning of the AWR Compare Periods report, and summarizes information about the snapshot sets and loads used in the report. The report summary contains the following sections:

Snapshot Sets
Configuration Comparison
Load Profile
Top 5 Timed Events

Snapshot Sets

The Snapshot Sets section, shown in Figure 8-2, displays information about the snapshot sets used for this report, such as instance, host, and snapshot information.

Figure 8-2 Snapshot Sets

Description of "Figure 8-2 Snapshot Sets"

In this example, the first snapshot period corresponds to the time when performance degradation was experienced on July 6, 2006 from 10:00 p.m. to 11:00 p.m. The second snapshot period represents a time when performance was stable on July 7, 2006 from 10:00 p.m. to 11:00 p.m.

Configuration Comparison

The Configuration Comparison section compares the configurations used in the two snapshot sets. Any differences in the configurations are quantified as percentages differed in the %Diff column.

Load Profile

The Load Profile section compares the loads used in the two snapshot sets. Any differences in the loads are quantified as percentages differed in the %Diff column.

Top 5 Timed Events

The Top 5 Timed Events section, shown in Figure 8-3, displays the five events or operations that consumed the most CPU time (represented as a percentage of total DB time) in each of the snapshot sets.

Figure 8-3 Top 5 Timed Events

Description of "Figure 8-3 Top 5 Timed Events"

In this example, CPU time is much higher in the first period than in the second. Waits for the db file sequential read event is also significantly higher in the first period than in the second.

Wait Events

The Wait Events section , shown in Figure 8-4, compares the wait events in the two snapshot sets. The wait events are ordered based on the difference in total DB time spent on the wait event between the two snapshot sets, and are listed in descending order. Wait events at the top of this section have the greatest differential between the two snapshot sets, and may be possible causes for performance degradation over time.

Figure 8-4 Wait Events

Description of "Figure 8-4 Wait Events"

In this example, the wait time for several events is much higher in the first period than in the second, such as db file sequential read, db file scattered read, latch free, and buffer busy waits.

See Also:

"Wait Event Statistics"

Time Model Statistics

The Time Model Statistics section , shown in Figure 8-5, compares time model statistics in the two snapshot sets. The time model statistics are ordered based on the difference in total DB time spent on a particular type of operation between the two snapshot sets, and are listed in descending order. Time model statistics at the top of this section have the greatest differential between the two snapshot sets, and the related operations may be possible causes for performance degradation over time.

Figure 8-5 Time Model Statistics

Description of "Figure 8-5 Time Model Statistics"

In this example, several statistics are much higher in the first period than in the second, such as DB time, DB CPU, and sql execute elapsed time. This information suggests that the majority of the database activity and CPU utilization are being used by the execution of SQL statements. The SQL Statistics section should be analyzed next to determine if a particular SQL statement is causing the performance degradation.

See Also:

"Time Model Statistics"

Operating System Statistics

The Operating System Statistics section compares operating system statistics in the two snapshot sets. This section provides an overall state of the operating system during each of the two periods being compared.

Service Statistics

The Service Statistics section compares services in the two snapshot sets. The services are ordered based on the difference in total DB time spent on a particular service between the two snapshot sets, and are listed in descending order.

SQL Statistics

The SQL Statistics section compares the top SQL statements in the two snapshot sets. The SQL statements are ordered based on different comparison methods, but in all cases, the top ten SQL statements with the greatest differential between the two snapshot sets are shown. These SQL statements may be possible causes for performance degradation over time, and are ordered based on the following categories:

Top 10 SQL Comparison by Execution Time
Top 10 SQL Comparison by CPU Time
Top 10 SQL Comparison by Buffer Gets
Top 10 SQL Comparison by Physical Reads
Top 10 SQL Comparison by Executions
Top 10 SQL Comparisons by Parse Calls
Complete List of SQL Text

Top 10 SQL Comparison by Execution Time

SQL statements in this section are ordered based on the difference in total DB time spent processing the SQL statement between the two snapshot sets and are listed in descending order, as shown in Figure 8-6.

Figure 8-6 Top 10 SQL Comparison by Execution Time

Description of "Figure 8-6 Top 10 SQL Comparison by Execution Time"

In this example, the time it took to execute the top two SQL statements consumed 87.93 percent of DB time in the first period, but not in the second. These two SQL statements are likely the high-load SQL statements that caused the performance degradation in the first period and should be investigated. Review the SQL statements in the Complete List of SQL Text subsection of the report and tune them, if necessary.

See Also:

Chapter 10, "Tuning SQL Statements" for information about tuning SQL statements

Top 10 SQL Comparison by CPU Time

SQL statements in this section are ordered based on the difference in CPU time spent processing the SQL statement between the two snapshot sets, and are listed in descending order.

Top 10 SQL Comparison by Buffer Gets

SQL statements in this section are ordered based on the difference in the number of total buffer cache reads or buffer gets made when processing the SQL statement between the two snapshot sets, and are listed in descending order.

Top 10 SQL Comparison by Executions

SQL statements in this section are ordered based on the difference in the number of executions per second (based on DB time) when processing the SQL statement between the two snapshot sets, and are listed in descending order.

Top 10 SQL Comparisons by Parse Calls

SQL statements in this section are ordered based on the difference in the number of total parses made when processing the SQL statement between the two snapshot sets, and are listed in descending order. Parsing is one stage in the processing of a SQL statement. When an application issues a SQL statement, the application makes a parse call to Oracle Database. Making parse calls can greatly affect the performance of a database and should be minimized as much as possible.

See Also:

Oracle Database Concepts for information about parsing

Complete List of SQL Text

This section displays the SQL text of all SQL statements listed in the SQL Statistics section.

Instance Activity Statistics

The Instance Activity Statistics section compares the statistic values of instance activity between the two snapshot sets. For each statistic, the value of the statistic is shown along with the differentials measured by DB time, elapsed time, and per transaction.

The instance activity statistics are categorized into the following sections:

Key Instance Activity Statistics
Other Instance Activity Statistics

Key Instance Activity Statistics

As the name suggests, the Key Instance Activity Statistics section displays the difference in key instance activity statistic values between the two snapshot sets.

Other Instance Activity Statistics

The Other Instance Activity Statistics sections displays the difference in instance activity for all other statistics between the two snapshot sets.

I/O Statistics

The I/O Statistics section compares the I/O operations performed on tablespaces and database files between the two snapshot sets. A drastic increase in I/O operations between the two snapshots may be the cause of performance degradation over time.

For each tablespace or database file, the difference in the number of reads, writes, and buffer cache waits (or buffer gets) are quantified as a percentage. The database files are ordered based on different comparison methods, but in all cases, the top 10 database files with the greatest differential between the two snapshot sets are shown.

I/O statistics comparison are divided into the following categories:

Tablespace I/O Statistics
Top 10 File Comparison by I/O
Top 10 File Comparison by Read Time
Top 10 File Comparison by Buffer Waits

Tablespace I/O Statistics

Tablespaces in this section are ordered by the difference in the number of normalized I/Os performed on the tablespace between the two snapshot sets, and are listed in descending order. Normalized I/Os are the sum of average reads and writes per second.

Top 10 File Comparison by I/O

Database files in this section are ordered by the difference in the number of normalized I/Os performed on the database file between the two snapshot sets, and are listed in descending order. Normalized I/Os are the sum of average reads and writes per second.

Top 10 File Comparison by Buffer Waits

Database files in this section are ordered by the difference in the number of buffer waits (waits caused during a free buffer lookup in the buffer cache) performed on the database file between the two snapshot sets, and are listed in descending order.

Advisory Statistics

The Advisory Statistics section compares program global area (PGA) memory statistics between the two snapshot sets, and is divided into the following categories:

PGA Aggregate Summary
PGA Aggregate Target Statistics

PGA Aggregate Summary

The PGA Aggregate Summary section compares the PGA cache hit ratio between the two snapshot sets.

PGA Aggregate Target Statistics

The PGA Aggregate Target Statistics section compares the key statistics related to the automatic PGA memory management between the two snapshot sets.

Wait Statistics

The Wait Statistics section compares statistics for buffer waits and enqueues between the two snapshot sets.

Wait statistics are divided into the following categories:

Buffer Wait Statistics
Enqueue Activity

Buffer Wait Statistics

The Buffer Wait Statistics section compares buffer waits between the two snapshot sets. Buffer waits happen during a free buffer lookup in the buffer cache.

Enqueue Activity

The Enqueue Activity section compares enqueue activities between the two snapshot sets. Enqueues are shared memory structures (or locks) that serialize access to database resources and can be associated with a session or transaction.

See Also:

Oracle Database Performance Tuning Guide
Oracle Database Reference for information about enqueues

Latch Statistics

The Latch Statistics section compares the number of total sleeps for latches between the two snapshot sets in descending order.

Latches are simple, low-level serialization mechanisms to protect shared data structures in the system global area (SGA). For example, latches protect the list of users currently accessing the database and the data structures describing the blocks in the buffer cache. A server or background process acquires a latch for a very short time while manipulating or looking up one of these structures. The implementation of latches is operating system dependent, particularly in regard to whether and how long a process will wait for a latch.

See Also:

Oracle Database Performance Tuning Guide for information about latches

Segment Statistics

The Segment Statistics section compares segments, or database objects (such as tables and indexes), between the two snapshot sets. The segments are ordered based on different comparison methods, but in all cases the top five segments with the greatest differential between the two snapshot sets are shown. These segments may be the causes of performance degradation over time, and are ordered based on the following categories:

Top 5 Segments Comparison by Logical Reads
Top 5 Segments Comparison by Physical Reads
Top 5 Segments by Row Lock Waits
Top 5 Segments by ITL Waits
Top 5 Segments by Buffer Busy Waits

Top 5 Segments by Row Lock Waits

Segments in this section are ordered based on the difference in the number of waits on row locks for the segment between the two snapshot sets, and are listed in descending order. Row-level locks are primarily used to prevent two transactions from modifying the same row. When a transaction needs to modify a row, a row lock is acquired.

See Also:

Oracle Database Concepts for information about row locks

Top 5 Segments by ITL Waits

Segments in this section are ordered based on the difference in the number of interested transaction list (ITL) waits for the segment between the two snapshot sets, and are listed in descending order.

See Also:

Oracle Database Performance Tuning Guide for information about ITL waits

Top 5 Segments by Buffer Busy Waits

Segments in this section are ordered based on the difference in the number of buffer busy waits for the segment between the two snapshot sets, and are listed in descending order.

See Also:

Oracle Database Performance Tuning Guide for information about buffer busy waits

Dictionary Cache Statistics

The Dictionary Cache Statistics section compares the number of get requests performed on the dictionary cache between the two snapshot sets in descending order. The difference is measured by the number of get requests per second of both total DB time and elapsed time. The dictionary cache is a part of the SGA that stores information about the database, its structures, and its users. The dictionary cache also stores descriptive information (or metadata) about schema objects, which is accessed by Oracle Database during the parsing of SQL statements.

See Also:

Oracle Database Performance Tuning Guide for information about the dictionary cache

Library Cache Statistics

The Library Cache Statistics section compares the number of get requests performed on the library cache between the two snapshot sets in descending order. The difference is measured by the number of get requests per second of both total DB time and elapsed time. The library cache is a part of the SGA that stores table information, object definitions, SQL statements, and PL/SQL programs.

See Also:

Oracle Database Performance Tuning Guide for information about the library cache

SGA Statistics

The SGA Statistics section compares SGA memory statistics between the two snapshot sets, and is divided into the following categories:

SGA Memory Summary
SGA Breakdown Difference

SGA Memory Summary

The SGA Memory Summary section summarizes the SGA memory configurations for the two snapshot sets.

SGA Breakdown Difference

The SGA Breakdown Difference section compares SGA memory usage for each of its subcomponents between the two snapshot sets. The difference is measured based on the percentage changed in the beginning and ending values of memory usage between the two snapshot sets.

init.ora Parameters

The init.ora Parameters section lists all the initialization parameter values for the first snapshot set. Any changes in the values of the initialization parameters between the two snapshot sets are listed for the second snapshot set with the changed value shown.

8 Resolving Performance Degradation Over Time

Creating Baselines

Running the Automatic Workload Repository Compare Periods Reports

Comparing a Baseline to Another Baseline or Pair of Snapshots

Comparing Two Pairs of Snapshots

Using the Automatic Workload Repository Compare Periods Reports

Report Summary

Snapshot Sets

Configuration Comparison

Load Profile

Top 5 Timed Events

Wait Events

Time Model Statistics

Operating System Statistics

Service Statistics

SQL Statistics

Top 10 SQL Comparison by Execution Time

Top 10 SQL Comparison by CPU Time

Top 10 SQL Comparison by Buffer Gets

Top 10 SQL Comparison by Physical Reads

Top 10 SQL Comparison by Executions

Top 10 SQL Comparisons by Parse Calls

Complete List of SQL Text

Instance Activity Statistics

Key Instance Activity Statistics

Other Instance Activity Statistics

I/O Statistics

Tablespace I/O Statistics

Top 10 File Comparison by I/O

Top 10 File Comparison by Read Time

Top 10 File Comparison by Buffer Waits

Advisory Statistics

PGA Aggregate Summary

PGA Aggregate Target Statistics

Wait Statistics

Buffer Wait Statistics

Enqueue Activity

Latch Statistics

Segment Statistics

Top 5 Segments Comparison by Logical Reads

Top 5 Segments Comparison by Physical Reads

Top 5 Segments by Row Lock Waits

Top 5 Segments by ITL Waits

Top 5 Segments by Buffer Busy Waits

Dictionary Cache Statistics

Library Cache Statistics

SGA Statistics

SGA Memory Summary

SGA Breakdown Difference

init.ora Parameters