日批在线视频_内射毛片内射国产夫妻_亚洲三级小视频_在线观看亚洲大片短视频_女性向h片资源在线观看_亚洲最大网

USEUROPEAFRICAASIA 中文雙語Fran?ais
Opinion
Home / Opinion / Op-Ed Contributors

Better manage risks inherent in Big Data

By Ernest Davis | China Daily | Updated: 2017-02-13 08:00

Better manage risks inherent in Big Data

A man tries out a VR (virtual reality) device during the ongoing Big Data Expo 2016 in Guiyang, capital of Southwest China's Guizhou province, May 25, 2016. [Photo/Xinhua]

In the last 15 years, we have witnessed an explosion in the amount of digital data available-from the Internet, social media, scientific equipment, smart phones, surveillance cameras, and many other sources-and in the computer technologies used to process it. "Big Data", as it is known, will undoubtedly deliver important scientific, technological, and medical advances. But Big Data also poses serious risks if it is misused or abused.

But having more data is no substitute for having high-quality data. For example, a recent article in Nature reports that election pollsters in the United States are struggling to obtain representative samples of the population, because they are legally permitted to call only landline telephones, whereas Americans increasingly rely on cellphones. And while one can find countless political opinions on social media, these aren't reliably representative of voters, either. In fact, a substantial share of tweets and Facebook posts about politics are computer-generated.

A Big Data program that used this search result to evaluate hiring and promotion decisions might penalize black candidates who resembled the pictures in the results for "unprofessional hairstyles," thereby perpetuating traditional social biases. And this isn't just a hypothetical possibility. Last year, a ProPublica investigation of "recidivism risk models" demonstrated that a widely used methodology to determine sentences for convicted criminals systematically overestimates the likelihood that black defendants will commit crimes in the future, and underestimates the risk that white defendants will do so.

Another hazard of Big Data is that it can be gamed. When people know that a data set is being used to make important decisions that will affect them, they have an incentive to tip the scales in their favor. For example, teachers who are judged according to their students' test scores may be more likely to "teach to the test," or even to cheat.

Similarly, college administrators who want to move their institutions up in the US News and World Reports rankings have made unwise decisions, such as investing in extravagant gyms at the expense of academics. Worse, they have made grotesquely unethical decisions, such as the effort by Mount Saint Mary's University to boost its "retention rate" by identifying and expelling weaker students in the first few weeks of school.

A third hazard is privacy violations, because so much of the data now available contains personal information. In recent years, enormous collections of confidential data have been stolen from commercial and government sites; and researchers have shown how people's political opinions or even sexual preferences can be accurately gleaned from seemingly innocuous online postings, such as movie reviews-even when they are published pseudonymously.

Finally, Big Data poses a challenge for accountability. Someone who feels that he or she has been treated unfairly by an algorithm's decision often has no way to appeal it, either because specific results cannot be interpreted, or because the people who have written the algorithm refuse to provide details about how it works. And while governments or corporations might intimidate anyone who objects by describing their algorithms as "mathematical" or "scientific," they, too, are often awed by their creations' behavior. The European Union recently adopted a measure guaranteeing people affected by algorithms a "right to an explanation"; but only time will tell how this will work in practice.

When people who are harmed by Big Data have no avenues for recourse, the results can be toxic and far-reaching, as data scientist Cathy O'Neil demonstrates in her recent book Weapons of Math Destruction.

The good news is that the hazards of Big Data can be largely avoided. But they won't be unless we zealously protect people's privacy, detect and correct unfairness, use algorithmic recommendations prudently, and maintain a rigorous understanding of algorithms' inner workings and the data that informs their decisions.

The author is a professor of computer science at the Courant Institute of Mathematical Sciences, New York University.

Project Syndicate

Most Viewed in 24 Hours
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
主站蜘蛛池模板: 少妇自拍视频 | 成人av一区二区三区 | 午夜激情影院 | 欧美伦理在线观看 | 毛片的网址 | 色姐| 四虎在线免费播放 | 91成人精品一区在线播放 | 色丁香六月 | 婷婷在线看 | 久久久男人的天堂 | 国产精品成人免费精品自在线观看 | aaa一区二区三区 | 久久久久久久久久久91 | 久久嫩草精品久久久久 | 日本在线观看www | 看av在线| 国产无遮挡又黄又爽又色视频 | 天干夜天干天天天爽视频 | 黄页网站在线看 | 午夜影院黄 | 成人av在线网址 | 一级片在线 | 天天做夜夜爱爱爱 | 日本一区二区中文字幕 | 亚洲a精品 | 超碰伊人网 | 午夜爽爽爽男女免费观看 | 国产黄色免费在线观看 | 婷婷色一区二区三区 | 福利片在线观看 | 国产欧美在线 | 亚洲高清视频在线播放 | 一级黄色片免费看 | 成年人在线视频观看 | 六月伊人| 亚洲三级久久 | 91最新国产 | 日韩在线天堂 | 一区中文字幕 | 97久久精品 |