Anonymous Genetic Profiles Aren't Completely Anonymous

Genetic codes and dna — Human genomes are a boon to medical research, but pose privacy risks.

(Image credit: JohnGoode via flickr | http://bit.ly/1eqICwJ)

(ISNS) -- Today it is easy for long-forgotten photos or personal information to live online indefinitely. But what if the most personal data about you – your genetic makeup – lived online? An individual's genome contains a vast amount of information about inherited diseases and physical traits, all stored in strands of DNA. The consequences of being able to search, cross-reference, and analyze this information are profound, experts say.

Hundreds of thousands of people have already had their genomes mapped in the U.S., either for research studies or through one of several private companies offering this service. In many cases, people want to know their risk of medical maladies like heart attack or breast cancer, or to identify the specific gene causing a disorder in their family. What these pioneers of personal genome mapping might not know, though, is how easily re-identifiable their anonymous data can be. And if that is the case, the question might not be whether to share, but rather how to regulate and protect what is being shared.

Latest Videos From

Watch full video here:

Erlich, who is a fellow at the Whitehead Institute for Biomedical Research in Cambridge, Mass., brings a unique but apt background to genetic privacy research: He is a former hacker, someone who was hired to expose weaknesses in the security systems of banks and credit card companies. He and his team took a similar approach to illustrate vulnerabilities within genetic databases. Their study, published in Science last January, recovered the identities of nearly 50 anonymous participants in the 1000 Genomes Project; and they did it using free, publicly accessible Internet resources.

From the honed-down list of about 12 males, the team was able to use Google and free services such as PeopleFinder.com to track down the owner of the unknown genome. A similar technique has been used by individuals who were adopted or conceived from sperm donation to trace their biological families. As more genetic data reaches online databases, Erlich said, new threats to privacy are keeping pace.

For example, in the future, it could be the norm for users to send their genetic data through a cloud service as an added precaution. Kristin Lauter, head of the cryptography research group at Microsoft Research, likens this method, called homomorphic encryption, to “not having to trust your jeweler,” since users would hand over their precious information, and allow a private service like hers to do calculations on it in an encrypted form.