In a world where our decisions are increasingly influenced by data, understanding the information we encounter has never been more essential. Dr. Talithia Williams explains the case for data literacy.
Dr. Talithia Williams, a math professor and science communicator, shares her take on why understanding data is now more important than ever. Using examples like noticing targeted ads after a conversation, Williams shows how data shapes our everyday experiences. But she also warns of the dangers, like biases in data-driven models that can lead to unfair outcomes. While AI and machine learning offer powerful insights, it’s up to us to ensure these tools are used fairly and accurately.
Dr. Williams also emphasizes that by deepening our understanding of data, we can better navigate the challenges that arise in our daily lives. She encourages us to see data not just as numbers, but as a tool for making more informed, fairer decisions in our bewilderingly complex world.
Talithia Williams: Literacy of the 21st century is more than just reading and writing. It really is understanding information and understanding data. If you've ever felt like your phone is listening to you, often it is. You may talk about green rugs and then all of a sudden you pull up Facebook or Instagram and you scroll down, it's like, "Whoa. Here's an advertisement for a green rug." That data is being transferred real time and then turned into a commercial or an ad for you.
With the rise of AI, it's easy to use that for corruption. And so as a society, when we help move everyone towards statistical literacy and data literacy, it forces us to be very objective in our analysis, to take in all different viewpoints and different pieces of information, and it empowers everyone because we can all make our own informed rational choice instead of depending on the belief of someone else or trusting the belief of someone else because we don't understand the information that's right in front of us.
My name is Dr. Talithia Williams, and I am a math professor and a science communicator. So every day we're doing math and working with numbers just to live in society, and so really all of us are statisticians and data scientists and mathematicians. If I want to get a ride to the airport and I do Uber or Lyft, I'm looking at it and I'm interacting, and it's telling me a distance, and a time, and a price or exercise data, right, you know, the number of steps that you took per day or you know, what was the range of your heart rate today? How much movement did you have? All of these things can not only help us make decisions, but can help our doctor understand when things are going wrong in our system or things are working right.
There are so many ways that math and statistics show up in the beauty of nature. The Fibonacci sequence is a wonderful example of how it shows up in the pattern of leaves on a plant or things like fractals. Fractals are seen in the human body with the way that blood vessels sort of grow and move around and expand. So many of these mathematical concepts that we study and understand have direct occurrences in the world around us in ways that are just beautiful and awe-inspiring. And so part of my excitement is to make those connections and help people see that if they walk outside, there's mathematics right in front of them that they can be really excited about.
What's really critical to understand when you analyze data is first, what does it have the power to tell you, and what does it not have the power to tell you? People think a statistical model is the end-all be-all, and it's not. If I wanted to paint a narrative or tell a story that was not true, I could very easily do that and many people would not know, especially if it feeds an underlying belief, right? Then people will take that story whether it's true or not, and latch onto it. And so it's almost like being a detective, right? When you gimme a set of data, it is my job to uncover what's hidden in it. The first thing I'm looking for is, where was the data collected? Is it a reputable source? Is it accurate? The other thing that I'm often looking for is understanding that correlation does not imply causations.
For example, if you mapped the ability to tie shoes relative to age, right, you would see that you tie them much more effectively when you go from zero to 10 years old, but it's not just because your age is increasing, right? It's because you're becoming nimble with your hands and now you can reach down and tie your shoes. And so I have to be careful that once I uncover those relationships, I understand the true sort of root cause of them.
Then I wanna know, do I have a representative sample? Often data that we collect is data that has come from society in some way, and so the same biases that we might possess ends up in the dataset. We call that confounding variables, and so as a statistician, if there's a bias in the data, it's my job to find it, uncover it, pull it out, and get rid of it.
There was a healthcare algorithm that was developed so that if you were to show up and present with symptoms, it would use this historical data to predict what your treatment should be, and it was pretty accurate. It was very good. It didn't take into account patient race, which made the statisticians believe that it was a very unbiased model. Once the model was in practice, they found that it was biased because built into the data was some underlying racist practices in terms of how black and brown patients showed up and would often get assigned to lesser treatments. The model actually perpetuated that bias. And so if we don't uncover those biases, those confounding variables, the correlations that don't imply causations beforehand, we end up implementing it in a model, putting it into a real life situation, but that affects people or outcomes or decisions, and those decisions then have consequences.
When you have a sort of a data literate component of society moving in the direction of AI and machine learning and folks in society who could care less about it, who are more maybe just consumers. We end up with people making decisions for us that we don't understand or we've had very little influence on because so many of our daily decisions depend on numbers and math and data. It's important that as society, we are data literate, we're statistically literate so that we are not only aware of the decisions that are being made on our behalf, but we're a part of them.