Data loss protection is a major concern to every industry. The software engineering process involves multiple points for potential data loss, as a number of parties are involved other than the client and software development team. The list may include external testing agencies, other software vendors, consulting agencies etc. From requirement analysis document to the source code and beyond, all the software artifacts contain information that the client may consider sensitive.
There has been considerable efforts for protecting application generated data using k-anonymity, L-diversity. A quick summary is here
But what are the options/best practices/tools available in protecting sensitive information that sit in software artifacts (e.g. analysis documents, source code, documentation etc) in a rather unstructured format? (Of course except NDA and good faith ...)