I wrote this for SQLServerCentral.com, and since they do not ask to own the content of their authors, I figured I'd cross-post it on my blog:
In my experience, one of the biggest problems in building applications and databses is dealing with deadlocks, and other resource contention issues. A dilemma in the process of building a database application is the lack of load that may exist in the development environment; in the real world, there's plenty of built-in load that comes with multiple users using the database. It's not as easy in a simulated environment.
Sure, there are applications you can build to simulate the sort of activity that may exist in a customer's environment, but since everyone seems to be cash-strapped right now, I thought I would share with you one way I've come up with to test how deadlock and timeout resistent an application is.
The first step, in this method, is to find out which tables in the database are most commonly used. Assuming that the way the application has been used in your simulated environment is close to the way it is used in production environments, the below query (>SQL 2005) should suffice as a way to determine this:
DECLARE @YourDatabaseName varchar(125)
SET @YourDatabaseName='LC_OTORL'
exec('USE ' + @YourDatabaseName)
SELECT
DB_NAME(ius.database_id) AS DBName,
OBJECT_NAME(ius.object_id) AS TableName,
SUM(ius.user_seeks + ius.user_scans + ius.user_lookups) AS TimesAccessed
FROM sys.indexes i
INNER JOIN sys.dm_db_index_usage_stats ius
ON ius.object_id = i.object_id
AND ius.index_id = i.index_id
WHERE
ius.database_id = DB_ID()
AND DB_NAME(ius.database_id)=@YourDatabaseName
GROUP BY
DB_NAME(ius.database_id),
OBJECT_NAME(ius.object_id)
ORDER BY SUM(ius.user_seeks + ius.user_scans + ius.user_lookups) DESC
If your simulated environment does not provide accurate data about the most-used tables in a production environment, this method may include running the above query against one of your customers' databases.
In either case, assuming you've got accurate data from the above query about what your most frequently accessed tables are, you can build a simple TSQL-based job that leverages this information to create simulated contention in your database. This job can run every 30 seconds, minute, or several minutes. It depends on what makes sense in your test environment.
For a methodology one could use to create contention, the easiest way to do so is to disregard the long-held convention that one should abstain from holding a transaction open for too long. In other words, creating an unnecessarily long and ineffective transaction, and running them over-and-over again will wreak havoc on how your application interacts with the database, thus giving you a clear answer as to whether or not your application is deadlock resistent, or at the very least, can recover from a query timeout.
To create such an evil-looking script, I'll use a cursor to build it. Below is the concoction I've created:
DECLARE @TableName varchar(125)
DECLARE @NumberOfTimesAccessed int
DECLARE @YourDatabaseName varchar(125)
DECLARE @ThreshholdNumberOfTimesAccessed int
DECLARE @UpdateQueryText varchar(255)
DECLARE @TargetedColumnName varchar(255)
/*
*
* Use this variable to prevent locking on
* a table if it hasn't been accessed enough
* In this example, locking will not be
* simulated if the table has not been
* accessed at least 5 times
* This is a good candidate variable for parameterization,
* If you choose to make this script into a stored procedure
*
*/
SET @ThreshholdNumberOfTimesAccessed=5
SET @YourDatabaseName='ADT'
exec('USE ' + @YourDatabaseName)
BEGIN TRANSACTION
DECLARE MostUsedTables_cur CURSOR
FOR
/*
* This query gets the most recently used tables in the database.
* As time goes on, this script becomes more targeted, as the
* most frequently used tables in the database get used more and more
*/
SELECT
OBJECT_NAME(ius.object_id) AS TableName,
SUM(ius.user_seeks + ius.user_scans + ius.user_lookups) AS TimesAccessed
FROM sys.indexes i
INNER JOIN sys.dm_db_index_usage_stats ius
ON ius.object_id = i.object_id
AND ius.index_id = i.index_id
WHERE
ius.database_id = DB_ID()
AND DB_NAME(ius.database_id)=@YourDatabaseName
GROUP BY
DB_NAME(ius.database_id),
OBJECT_NAME(ius.object_id)
ORDER BY SUM(ius.user_seeks + ius.user_scans + ius.user_lookups) DESC
Open MostUsedTables_cur
Fetch NEXT FROM MostUsedTables_cur INTO @TableName, @NumberOfTimesAccessed
While (@@FETCH_STATUS = 0)
BEGIN
IF @NumberOfTimesAccessed >= @ThreshholdNumberOfTimesAccessed
BEGIN
/*
* The below query grabs an arbitrary, non-primary key column
* in the current table, and assembles an update statement
* to set the column value equal to itself, so this behavior
* should be relatively low-risk
*/
SET @TargetedColumnName = ( select top 1 co.[name] as ColumnName
FROM sys.indexes i
JOIN sys.objects o on i.object_id = o.object_id
JOIN sys.index_columns ic on ic.object_id = i.object_id
AND ic.index_id = i.index_id
JOIN sys.columns co on co.object_id = i.object_id
AND co.column_id = ic.column_id
WHERE i.is_primary_key = 0
AND o.[type] = 'U'
AND o.name = @TableName )
SET @UpdateQueryText = 'UPDATE ' + @TableName + ' SET ' + @TargetedColumnName + ' = ' + @TargetedColumnName
/*
* Commenting the below Print line out, but if you wish to debug
* this script, simply Uncomment the Print line, and
*comment the exec() line
*/
-- PRINT @UpdateQueryText
exec(@UpdateQueryText)
/*
* The below command waits 5 seconds. You can parameterize this to
* make it more configurable on-the-fly
* The effect is that the transaction stays open for
* a long time
*/
WAITFOR DELAY '00:00:05'
END
Fetch NEXT FROM MostUsedTables_cur INTO @TableName, @NumberOfTimesAccessed
END
CLOSE MostUsedTables_cur
DEALLOCATE MostUsedTables_cur
--Commit the transaction here, thus ending the long-held open transaction
COMMIT
The above script, when run as a job, and possibly parameterized into a stored procedure, can really wreak havoc on a database, particularly if your database is large. The considerations to keep in mind will revolve around the following:
1. Setting the @ThreshholdNumberOfTimesAccessed value correctly. If your database is highly used, the value of this variable should be high
2. Setting the WAITFOR DELAY value appropriately. Setting it too long will create a very long script, and if this script is being run in the context of a SQL Agent job, the script may not be done running before the next iteration begins
3. Frequency of running the script. If running as a SQL Agent job, make sure to couple the frequency of the job with how the system is being tested.
If used correctly, the above script can become another tool for appropriately testing your system, and could perhaps be a shell of your stress test methodology. Obviously, this script is not the be-all-end-all for all testing, but I believe that it is a high value tool that can definitely add value in diagnosing how well your application can recover from a bad database environment.
My struggles in understanding and learning about Object Oriented design, and the tools and knowledge I've taken from them.
Subscribe to:
Post Comments (Atom)
Followers
About Me
Search This Blog
Powered by Blogger.
No comments:
Post a Comment