You can use backups for things other than restoring

A customer wanted to know the internal file format of Visual SourceSafe databases. (That wasn’t the actual question, but I’ve translated it into something equivalent but which requires less explanation.) They explained why they wanted this information:

We are doing some code engineering analysis on our project, so we need to extract data about every single commit to the project since its creation. Things like who did the commit, the number of lines of code changed, the time of day… We can then crank on all this data to determine things like What time of day are most bugs introduced? and possibly even try identify bug farms. Since our project is quite large, we found that generating all these queries against the database creates high load on the server. To reduce the load on the server, we’d like to just access the database files directly, but in order to do that, we need to know the file format.

Oh great, directly accessing a program’s internal databases while they’re live. What could possibly go wrong?

I proposed an alternative:

Take a recent backup of your project and mount it on a temporary server as read-only. Run your data collection scripts against the temporary server. This will spike the load on the temporary server, but who cares? You’re the only person using the temporary server; the main server is unaffected. After you collect all your data from the temporary server, you can then perform a much smaller number of queries against the live server to get data on the commits that took place since the last backup.

Author

Raymond Chen

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

You can use backups for things other than restoring

Author

0 comments

Read next

How can I detect the language a run of text is written in?

Why does CreateEvent fail with ERROR_PATH_NOT_FOUND if I give it a name with a backslash?

Author

0 comments

Read next

How can I detect the language a run of text is written in?

Why does CreateEvent fail with ERROR_PATH_NOT_FOUND if I give it a name with a backslash?

Stay informed