Data firm left detailed profiles of 48 million people on a publicly accessible website

In the wake of Facebook's Cambridge Analytica scandal, another data firm was discovered to have amassed similar user profiles of millions of people.


A report published Wednesday reveals how a data firm built psychographic profiles on 48 million people, using data from Facebook, Twitter, LinkedIn, Zillow, and others—and then left that trove of data unprotected on a cloud storage repository.

The data was compiled by LocalBlox, a firm that “automatically crawls, discovers, extracts, indexes, maps and augments data in a variety of formats from the web and from exchange networks” to build consumer profiles that it sells to companies.  

In February, Chris Vickery, an ethical data breach hunter and director of cyber risk research at the security firm UpGuard, was able to access millions of these profiles on an unlisted and unprotected Amazon Web Services S3 bucket. The bucket contained a 151.3-gigabyte file that, when decompressed, amounted to a 1.2 terabyte that contained the user profiles. It was aptly named “final_people_data_2017_5_26_48m.json.”

“In the wake of the Facebook/Cambridge Analytica debacle, the importance of massive sets of psychographic data is becoming more and more apparent,” UpGuard’s report reads. “The exposed LocalBlox dataset combines standard personal information like name and address, with data about the person’s internet usage, such as their LinkedIn histories and Twitter feeds. This combination begins to build a three-dimensional picture of every individual affected—who they are, what they talk about, what they like, even what they do for a living—in essence, a blueprint from which to create targeted persuasive content, like advertising or political campaigning.”

The consumer profiles amassed by LocalBlox vary in level of detail. Much of the information can be harvested from public sources—the email address listed on your Facebook profile, or the city of residence shown on your Twitter page. Some of the information is believed to have been collected from non-public sources, such as purchased marketing data.

In a ZDNet article published Wednesday, LocalBlox’s chief technology officer Ashfaq Rahman said most of the data discovered by Vickery was fabricated for internal tests, and that Vickery had “hacked in” to the publicly accessible repository. But Vickery had informed LocalBlox that he accessed the repository after discovering the vulnerability in February, and it was reportedly secured soon after.

“Rahman would not say why he restricted the bucket’s permissions hours later,” reads the ZDNet article.

According to Rahman, “no other individual is believed to have accessed this file from the S3 bucket.”

LocalBlox didn’t break any laws in its harvesting of consumer data, though it’s not clear whether it violated the terms of websites like LinkedIn, Facebook, and Zillow, all of which explicitly prohibit data scraping.

In a 2013 article, LocalBlox’s president Sabira Arefin said it’s “up to the individual sites and system to determine the terms and conditions and then enforce any security mechanism in place if they want to prevent scraping.”

Vickery said that companies like LocalBlox should be more responsible in the way they handle and stores people’s data.

“Concentrating millions of people's details can become by its very nature a weaponized thing, and something that can lead to a lot of harm,” Vickery said.

UpGuard’s report concludes:

“The profitability gained by data must come with the responsibility of protecting its integrity and privacy. Cloud storage itself provides functionality and speed at a reasonable cost, but cloud assets require careful configuration—the thin line between private and public can be erased with the flip of a single switch. The lack of controls around common IT processes are what allow critical errors like this to slip into production, eroding the privacy of millions of people.”

'Upstreamism': Your zip code affects your health as much as genetics

Upstreamism advocate Rishi Manchanda calls us to understand health not as a "personal responsibility" but a "common good."

Sponsored by Northwell Health
  • Upstreamism tasks health care professionals to combat unhealthy social and cultural influences that exist outside — or upstream — of medical facilities.
  • Patients from low-income neighborhoods are most at risk of negative health impacts.
  • Thankfully, health care professionals are not alone. Upstreamism is increasingly part of our cultural consciousness.
Keep reading Show less

Meet the Bajau sea nomads — they can reportedly hold their breath for 13 minutes

The Bajau people's nomadic lifestyle has given them remarkable adaptions, enabling them to stay underwater for unbelievable periods of time. Their lifestyle, however, is quickly disappearing.

Wikimedia Commons
Culture & Religion
  • The Bajau people travel in small flotillas throughout the Phillipines, Malaysia, and Indonesia, hunting fish underwater for food.
  • Over the years, practicing this lifestyle has given the Bajau unique adaptations to swimming underwater. Many find it straightforward to dive up to 13 minutes 200 feet below the surface of the ocean.
  • Unfortunately, many disparate factors are erasing the traditional Bajau way of life.
Keep reading Show less

Golden blood: The rarest blood in the world

We explore the history of blood types and how they are classified to find out what makes the Rh-null type important to science and dangerous for those who live with it.

Abid Katib/Getty Images
Surprising Science
  • Fewer than 50 people worldwide have 'golden blood' — or Rh-null.
  • Blood is considered Rh-null if it lacks all of the 61 possible antigens in the Rh system.
  • It's also very dangerous to live with this blood type, as so few people have it.
Keep reading Show less

Scientists create a "lifelike" material that has metabolism and can self-reproduce

An innovation may lead to lifelike evolving machines.

Shogo Hamada/Cornell University
Surprising Science
  • Scientists at Cornell University devise a material with 3 key traits of life.
  • The goal for the researchers is not to create life but lifelike machines.
  • The researchers were able to program metabolism into the material's DNA.
Keep reading Show less