What Is Data Purging?

By | In Database Administration | September 29th, 2015

If you’ve worked with database technologies for any length of time, you may have heard of a process known as data purging.

Although it might seem that an organization would want to keep all its data for data mining and analysis, there is a finite amount of storage available to warehouse those bits and bytes. When data is not purged regularly, the tables in a database can grow so large that performance begins to deteriorate. Purging is an important aspect of database performance tuning.

Techopedia defines it as:

Methods that permanently erase and remove data from a storage space. There are many different strategies and techniques for data purging, which is often contrasted with data deletion. Deletion is often seen as a temporary preference, whereas purging removes the data permanently and opens up memory or storage space for other uses.

After an interview I did with Plotting Success, Abe Selig says:

Purging is just what it sounds like — completely erasing data from your system. Unlike a simple delete function, purging renders the information completely unsalvageable once it’s been purged.

Data Retention Policies

Most organizations employ a framework for defining their data by need or age. An organization may also have explicit data retention policies based on specific business and legal requirements, such as the Health Insurance Portability and Accountability Act of 1996 (HIPAA) or the SarbanesOxley Act of 2002 in the United States. Some government agencies also have data purging policies, some of which have been called into question recently.

Data purging can be an automatic process, but there are some instances in which administrators have to manually purge data from the database. The process can differ incrementally between systems. Oracle may have slightly different steps than SQL Server, for example.

Although data purging may be a simple process, any changes can affect other parts of the remaining database system, such as tables. It is important to ensure the system continues functioning properly. Purging can be particularly problematic for relational database systems that may rely on some pieces of data across tables.

Money and Resources Saved

Data purging, if properly completed on a routine basis, can accelerate the database reporting process and ultimately enable an organization to save money and other resources, such as hardware.

Does your organization need assistance in assessing the performance gains and cost savings associated with creating an effective data purging routine? Or do you need a performance tuning analysis? Datavail can help. Contact us to find out how.

Contact Us
Chuck Ezell
Vice President – Data Integration Practice
With almost 20 years of development experience, Chuck Ezell leads the Development, Tuning and Automation Practice (DTA). Bringing his knowledge from supporting some of the world's largest infrastructures in database & application performance tuning. With a broad background in process automation, database application development, optimizing database performance and many DevOps tools. Although he and his team excel at optimizing and customizing Oracle systems, Chuck works with .NET, T-SQL, C#, PHP, javascript, Java, ANSI SQL, PL/SQL, APEX, and many other languages on systems by Oracle, HP, IBM, Linux and Windows. He uses SSMS, Oracle OEM, AppDynamics, Splunk, Visual VM along with many others, for performance tuning operations.

Leave a Reply

Your email address will not be published.
Required fields are marked (*).