Showing posts with label Big Data. Show all posts

Tuesday, January 7, 2020

UK Government Plans To Open Public Transport Data To Third Parties; Forbes, December 31, 2019

Simon Chandler, Forbes; UK Government Plans To Open Public Transport Data To Third Parties

"The launch is a significant victory for big data. Occasionally derided as a faddish megatrend or empty buzzword, the announcement of the Bus Open Data Service shows that national governments are willing to harness masses of data and use them to create new services and economic opportunities. Similarly, it's also a victory for the internet of things, insofar as real-time data from buses will be involved in providing users with up-to-date travel info.

That said, the involvement of big data inevitably invites fears surrounding privacy and surveillance."

Friday, November 9, 2018

In Favor of the Caselaw Access Project; The Harvard Crimson, November 7, 2018

The Crimson Editorial Board, The Harvard Crimson; In Favor of the Caselaw Access Project

"We hope that researchers will use these court opinions to further advance academic scholarship in this area. In particular, we hope that computer programmers are able to take full advantage of this repository of information. As Ziegler noted, no lawyer will be able to take full advantage of the millions of pages in the database, but computers have an advantage in this regard. Like Ziegler, we are hopeful that researchers using the database will be able to learn more about less understood aspects of the legal system — such as how courts influence each other and deal with disagreements. Those big-picture questions could not have been answered as well without the information provided by this new database.

This project is a resounding success for the Harvard Library, which happens also to be looking for a new leader. We hope that the person hired for the job will be similarly committed to projects that increase access to information — a key value that all who work in higher education should hold near and dear. In addition to maintaining the vast amounts of histories and stories already in the system, Harvard’s libraries should seek to illuminate content that may have been erased or obscured. There is always more to learn."

Thursday, November 8, 2018

Harvard Converts Millions of Legal Documents into Open Data; Government Technology, November 2, 2018

Theo Douglas, Government Technology; Harvard Converts Millions of Legal Documents into Open Data

[Kip Currier: Discovered the recent launch of this impressive Harvard University-anchored Caselaw Access Project, while updating a lecture for next week on Open Data.

The free site provides access to highly technical data, full text cases, and even "quirky" but fascinating legal info...like the site's Gallery, highlighting instances in which "witchcraft" is mentioned in legal cases throughout the U.S.

Check out this new site...and spread the word about it!]

"A new free website spearheaded by the Library Innovation Lab at the Harvard Law School makes available nearly 6.5 million state and federal cases dating from the 1600s to earlier this year, in an initiative that could alter and inform the future availability of similar areas of public-sector big data.

Led by the Lab, which was founded in 2010 as an arena for experimentation and exploration into expanding the role of libraries in the online era, the Caselaw Access Project went live Oct. 29 after five years of discussions, planning and digitization of roughly 100,000 pages per day over two years.

The effort was inspired by the Google Books Project; the Free Law Project, a California 501(c)(3) that provides free, public online access to primary legal sources, including so-called “slip opinions,” or early but nearly final versions of legal opinions; and the Legal Information Institute, a nonprofit service of Cornell University that provides free online access to key legal materials."

Sunday, March 4, 2018

China has shot far ahead of the US on deep-learning patents; Quartz, March 2, 2018

Echo Huang, Quartz; China has shot far ahead of the US on deep-learning patents

"China is outdoing the US in some kinds of AI-related intellectual property, according to a report published in mid-February by US business research firm CB Insights. The number of patents with the words “artificial intelligence” and “deep learning” published in China has grown faster than those published in the US, particularly in 2017, the firm found. Publication is a step that comes after applications are filed but before a patent is granted. The firm looked at data from the European patent office.

When it comes to deep learning—an advanced subset of machine learning, which uses algorithms to identify complex patterns in large amounts of data—China has six times more patent publications than the US, noted the report (pdf, p.7)...

...[W]hen it comes to patents using the term “machine learning,” often conflated with the term AI, China still lags behind. Searching patents for “machine learning” found the US had 882 related patent publications while China had 77 in 2017."

Thursday, March 1, 2018

Professor Tells UN, Governments Of Coming “Tsunami” Of Data And Artificial Intelligence; Intellectual Property Watch, February 21, 2018

William New, Intellectual Property Watch; Professor Tells UN, Governments Of Coming “Tsunami” Of Data And Artificial Intelligence

"[Prof. Shmuel (Mooly) Eden of the University of Haifa, Israel] said this fourth revolution in human history is made up of four factors. First, computing power is at levels that were unimaginable. This power is what makes artificial intelligence now possible. The smartphone in your hand has 1,000 times the components of the first rocket to the moon, he said, which led to a chorus of “wows” from the audience.

Second is big data. Every time you speak on the phone or go on the internet, someone records it, he said. The amount of data is unlimited. Eden said he would be surprised if we use 2 percent of the data we generate, but in the future “we will.”

Third is artificial intelligence (AI). No one could analyse all of that data, so AI came into play.

Fourth is robots. He noted that they don’t always look like human forms. Most robots are just software doing some function...

Eden ended by quoting a hero of his, former Israeli Prime Minister Shimon Peres, who told him: “Technology without ethics is evil. Ethics without technology is poverty. That’s why we have to combine the two.”

Eden challenged the governments, the UN and all others to think about how to address this rapid change and come up with ideas.

He challenged the governments, the UN and all others to think about how to address this rapid change and come up with ideas. Exponentially."

Sunday, July 16, 2017

How can we stop algorithms telling lies?; Guardian, July 16, 2017

Cathy O'Neil, Guardian;

How can we stop algorithms telling lies?

[Kip Currier: Cathy O'Neil is shining much-needed light on the little-known but influential power of algorithms on key aspects of our lives. I'm using her thought-provoking 2016 Weapons of Math Destruction: How Big Data Increases Inequality And Threatens Democracy as one of several required reading texts in my Information Ethics graduate course at the University of Pittsburgh's School of Computing and Information.]

"A proliferation of silent and undetectable car crashes is harder to investigate than when it happens in plain sight.

I’d still maintain there’s hope. One of the miracles of being a data sceptic in a land of data evangelists is that people are so impressed with their technology, even when it is unintentionally creating harm, they openly describe how amazing it is. And the fact that we’ve already come across quite a few examples of algorithmic harm means that, as secret and opaque as these algorithms are, they’re eventually going to be discovered, albeit after they’ve caused a lot of trouble.

What does this mean for the future? First and foremost, we need to start keeping track. Each criminal algorithm we discover should be seen as a test case. Do the rule-breakers get into trouble? How much? Are the rules enforced, and what is the penalty? As we learned after the 2008 financial crisis, a rule is ignored if the penalty for breaking it is less than the profit pocketed. And that goes double for a broken rule that is only discovered half the time...

It’s time to gird ourselves for a fight. It will eventually be a technological arms race, but it starts, now, as a political fight. We need to demand evidence that algorithms with the potential to harm us be shown to be acting fairly, legally, and consistently. When we find problems, we need to enforce our laws with sufficiently hefty fines that companies don’t find it profitable to cheat in the first place. This is the time to start demanding that the machines work for us, and not the other way around."

Monday, June 19, 2017

Amazon has a patent to keep you from comparison shopping while you’re in its stores; Washington Post, June 16, 2017

Brian Fung, Washington Post; Amazon has a patent to keep you from comparison shopping while you’re in its stores

"Amazon was awarded a patent May 30 that could help it choke off a common issue faced by many physical stores: Customers’ use of smartphones to compare prices even as they walk around a shop. The phenomenon, often known as mobile “window shopping,” has contributed to a worrisome decline for traditional retailers.

But Amazon now has the technology to prevent that type of behavior when customers enter any of its physical stores and log onto the WiFi networks there. Titled “Physical Store Online Shopping Control,” Amazon’s patent describes a system that can identify a customer’s Internet traffic and sense when the smartphone user is trying to access a competitor’s website. (Amazon chief executive Jeffrey P. Bezos is also the owner of The Washington Post.)...

Just because a company wins a patent doesn’t necessarily mean it’ll use it. Sometimes companies file for patents to ensure they have the option to put the idea into practice later, or to keep other companies from implementing the concept. So, a system such as the kind Amazon’s envisioning might never be rolled out. And even if it is, chances are shoppers could still get around the system by staying off the in-store WiFi."

Monday, March 6, 2017

Patent Data – The Modern Investor’s Crystal Ball; Intellectual Property Watch, March 6, 2017

Sirena Rubinoff, Intellectual Property Watch;

Patent Data – The Modern Investor’s Crystal Ball

"What if there was a crystal ball that could tell you where and when to invest your money? It sounds like science fiction, but engineers at MIT have actually developed a formula that can predict future events in tech development. The formula is based on a combination of big data from patent applications and smart analytics which, when put together, can estimate how fast a technology is advancing.

Why patent applications?

If you want to know where technology is headed, a great place to look is in a patent application database like the USPTO. One of the qualifications for getting a patent granted is “novelty,” which means new, similar innovations won’t appear anywhere else. Once enough data is collected from the database, it can be used to map out and predict unique advancements in specific areas of technology."

Friday, February 24, 2017

Second Internet of Things National Institute; American Bar Association, Washington, DC, May 10-11, 2017

Second Internet of Things National Institute

"A game-changer has emerged for businesses, policymakers, and lawyers, and it's called the "Internet of Things" (IoT). It's one of the most transformative and fast-paced technology developments in recent years. Billions of vehicles, buildings, process control devices, wearables, medical devices, drones, consumer/business products, mobile phones, tablets, and other "smart" objects are wirelessly connecting to, and communicating with, each other - and raising unprecedented legal and liability issues.

Recognized as a top new law practice area, and with global spending projected to hit $1.7 trillion by 2020, IoT will require businesses, policymakers, and lawyers (M&A, IP, competition, litigation, health law, IT/outsourcing, and privacy/cybersecurity) to identify and address the escalating legal risks of doing business in a connected world. Join us in Washington, D.C., on May 10 - 11, 2017, for our second IoT National Institute, which will feature:

Overviews and demos of the powerful technology driving the legal and liability issues

Practical guidance and the latest insights on the product liability, mass tort, big data, privacy, data security, intellectual property, cloud, and regulatory issues raised by IoT

Dynamic new additions: a mock trial, a tabletop exercise, a corporate counsel roundtable, and niche issue mini-updates.

Two full days of CLE credit (including ethics credit), plus two breakfasts, two lunches (with keynote speakers), and a cocktail reception.

Our distinguished faculty includes prominent legal and technical experts and thought-leaders from companies, government entities, universities, think-tanks, advocacy organizations, and private practice. Organized by the American Bar Association's Section of Science & Technology Law, the IoT National Institute offers an unparalleled learning and networking opportunity. With billions of devices and trillions of dollars in spending, IoT is a rapidly growing market that everyone wants to get in on."

Tuesday, August 9, 2016

There’s No Such Thing as Innocuous Personal Data; Slate, 8/8/16

Elizabeth Weingarten, Slate; There’s No Such Thing as Innocuous Personal Data:

"The way you walk can be as unique as your fingerprint; a couple of studies show that gait can help verify the identity of smartphone users. And gait can also predict whether someone is at risk for dementia. Seemingly useless pieces of data may let experts deduce or predict certain behaviors or conditions now, but the big insights will come in the next few years, when companies and consumers are able to view a tapestry of different individual data points and contrast them with data across the entire population. That’s when, according to a recent report from Berkeley’s Center for Long-Term Cybersecurity, we’ll be able to “gain deep insight into human emotional experiences.”
But it’s the data that you’re creating now that will fuel those insights. Far from meaningless, it’s the foundation of what you (and everyone else) may be able to learn about your future self."

Sunday, July 24, 2016

Uncle Sam Wants You — Or at Least Your Genetic and Lifestyle Information; New York Times, 7/23/16

Robert Pear, New York Times; Uncle Sam Wants You — Or at Least Your Genetic and Lifestyle Information:

"People can sign up through academic medical centers at Columbia University, Northwestern University in Illinois, the University of Arizona and the University of Pittsburgh, each of which is working with local partners. Columbia, for example, is collaborating with NewYork-Presbyterian Hospital, Harlem Hospital and Weill Cornell Medicine.
Participants will be recruited to reflect the geographic, racial, ethnic and socioeconomic diversity of the nation. To help achieve that goal, officials have enlisted community health centers, where more than 90 percent of patients have annual incomes less than twice the poverty level (less than $23,760 for an individual). About one-third of health center patients are Latinos, and about one-fourth are African-Americans.
Officials said they wanted patients to be partners in the research, not just “human subjects.” To that end, patients will have access to all the information about themselves, including laboratory and genetic test results. Doctors could eventually use the data to shape treatment for an individual patient, rather than using standard treatments that may not work for everyone. Patients will help guide the research, sitting on its steering committee and advisory board."

Tuesday, May 10, 2016

Biden calls for open-data research; Politico, 5/10/16

David Pittman, Politico; Biden calls for open-data research:

"BIDEN GETS TOUGH AT HEALTH DATAPALOOZA:
Vice President Joe Biden issued some of his strongest words yet in support of sharing clinical and research data, in remarks to data scientists Monday at Health Datapalooza. He said science was at an inflection point, with the ease of genomic sequencing, massive increases in computing power and digitization of health records. “You told me that this is the way we can make great progress, by sharing more data, breaking down the silos,” Biden told a standing-room only crowd in the ballroom of the Grand Hyatt. “Imagine what we could, you could do to help in the fight against cancer if you had access to millions of cancer pathologies, genomic sequences, family histories and treatment outcomes.”
Calls for a national research database:
The country needs a way to share and make public underlying data from medical research, Biden said, a one point criticizing the New Journal of Medicine editor for saying such policy would breed “data parasites.”
Flying records cross country:
The Biden family had to literally fly Beau’s medical records to Houston’s MD Anderson Cancer Center because EHR systems couldn’t talk to each out. And this was the vice president’s son. “We spent $35 billion to avoid that kind of thing from happening.”"

Saturday, March 12, 2016

Analytics key to agencies in big data explosion; FedScoop, 3/10/16

Billy Mitchell, FedScoop; Analytics key to agencies in big data explosion:

Lots of leading edge info and thought-provoking commentary from an impressive array of speakers at FedScoop and Hitachi's 3/10/16 Social Innovation Summit I attended at the Newseum in D.C. Good overview of Summit by FedScoop's Billy Mitchell:
"The federal government has seen an explosion of data at its disposal and has needed powerful analytics tools to put it to use, federal IT officials and industry executives said.
A single statistic drove the bulk of the conversation at Thursday’s Hitachi Data Systems Social Innovation Summit, produced by FedScoop: By 2020, analysts predict there will be more than 30 billion network-connected digital devices globally, all producing unprecedented volumes of data in a concept called the Internet of Things.
“Those devices, whether it be the phones we use, the cars we drive in, the medical devices used to keep us healthy, the buildings we work in, the ships and airplanes that protect our country, they’re all generating data, and it’s a question of how do we take that data and really put it to use?” said Mike Tanner, president and CEO of federal for Hitachi Data Systems...
While that data brings with it endless opportunities, it also complicates things, particularly because humans alone are unable to do much with such massive data sets."

Wednesday, February 24, 2016

Sara Fine Institute presents: Christine Borgman, "Big Data, Open Data, and Scholarship": Mon Feb 29th 3.00pm - 5.00pm, University of Pittsburgh

Sara Fine Institute presents: Christine Borgman, "Big Data, Open Data, and Scholarship" :

"Monday Feb 29th 3.00pm - 5.00pm
University Club, Ballroom A, 123 University Pl, Pittsburgh, PA 15260
"Big Data, Open Data, and Scholarship"
by Christine L. Borgman
Distinguished Professor & Presidential Chair in Information Studies
University of California, Los Angeles
Scholars gathered data long before the emergence of books, journals, libraries, publishers, or the Internet. Until recently, data were considered part of the process of scholarship, essential but largely invisible. In the “big data” era, the products of these research processes have become valuable objects in themselves to be captured, shared, reused, and sustained for the long term. Data also has become contentious intellectual property to be protected, whether for proprietary, confidentiality, competition, or other reasons. Public policy leans toward open access to research data, but rarely with the public investment necessary to sustain access. Enthusiasm for big data is obscuring the complexity and diversity of data in scholarship and the challenges for stewardship. Data practices are local, varying from field to field, individual to individual, and country to country. This talk will explore the stakes and stakeholders in research data and implications for policy and practice.
Join us Feb. 29, 2016 at 3pm at the University of Pittsburgh’s University Club (Ballroom A). This event is free to attend and no RSVP is required. A reception will follow."

Wednesday, February 17, 2016

Balancing Benefits and Risks of Immortal Data Participants’ Views of Open Consent in the Personal Genome Project; Hastings Center Report, 12/17/15

Oscar A. Zarate, Julia Green Brody, Phil Brown, Monica D. Ramirez-Andreotta, Laura Perovich andJacob Matz, Hastings Center Report; Balancing Benefits and Risks of Immortal Data: Participants’ Views of Open Consent in the Personal Genome Project:

"Abstract
An individual's health, genetic, or environmental-exposure data, placed in an online repository, creates a valuable shared resource that can accelerate biomedical research and even open opportunities for crowd-sourcing discoveries by members of the public. But these data become “immortalized” in ways that may create lasting risk as well as benefit. Once shared on the Internet, the data are difficult or impossible to redact, and identities may be revealed by a process called data linkage, in which online data sets are matched to each other. Reidentification (re-ID), the process of associating an individual's name with data that were considered deidentified, poses risks such as insurance or employment discrimination, social stigma, and breach of the promises often made in informed-consent documents. At the same time, re-ID poses risks to researchers and indeed to the future of science, should re-ID end up undermining the trust and participation of potential research participants.
The ethical challenges of online data sharing are heightened as so-called big data becomes an increasingly important research tool and driver of new research structures. Big data is shifting research to include large numbers of researchers and institutions as well as large numbers of participants providing diverse types of data, so the participants’ consent relationship is no longer with a person or even a research institution. In addition, consent is further transformed because big data analysis often begins with descriptive inquiry and generation of a hypothesis, and the research questions cannot be clearly defined at the outset and may be unforeseeable over the long term. In this article, we consider how expanded data sharing poses new challenges, illustrated by genomics and the transition to new models of consent. We draw on the experiences of participants in an open data platform—the Personal Genome Project—to allow study participants to contribute their voices to inform ethical consent practices and protocol reviews for big-data research."

Friday, February 5, 2016

Cops will adapt big data platform to secure Super Bowl; FedScoop.com, 2/5/16

Alex Koma, FedScoop.com; Cops will adapt big data platform to secure Super Bowl:

"Law enforcement agents and first responders in Northern California are turning to some software that harnesses the power of data to help keep fans safe at the Super Bowl, one of the most daunting security challenges of the year.
The state first started using the program last year — known as the “California Common Operating Picture” and powered by Haystax Technology’s “Constellation” analytics platform — and now law enforcement agencies of all shapes and sizes are preparing to use it to collect thousands of pieces of data about potential threats ahead of the big matchup in Santa Clara’s Levi’s Stadium.
In a briefing here at Haystax’s headquarters, Chief Technology Officer Bryan Ware laid out just how federal, state and local agents across the region have been using the system to keep a close eye on potential trouble makers and targets ahead of the Super Bowl, and how 13 different monitoring centers run by various government agencies will use it the night of the game to stay ahead of any security concerns."

Tuesday, March 17, 2015

Pitt, CMU and UPMC hope to remake health care via new big data alliance; Pittsburgh Post-Gazette, 3/16/15

Bill Toland, Pittsburgh Post-Gazette; Pitt, CMU and UPMC hope to remake health care via new big data alliance:

"Pittsburgh is making a big bet on big data.
UPMC, the University of Pittsburgh and Carnegie Mellon University on Monday announced the formation of the Pittsburgh Health Data Alliance to “revolutionize health care and wellness” by using data to detect potential outbreaks as well as create health care innovations that will spawn spinoff companies.
The clinical goal, the leaders of the three institutions said, is to remake health care so that it is at once more computerized, yet more personalized, using millions of gigabytes of accumulated health records to predict and treat patients’ health issues in a manner far more specific than is possible today.
And the business development goal, the leaders said, is no less than a Pittsburgh-based “moonshot” for health information technology, one that could make Pittsburgh the global epicenter for such research.
If the alliance unfolds as outlined, it someday could rival the scope of the nation’s largest university-led data-sharing projects (such as the ongoing Dartmouth Atlas health policy research partnership with Dartmouth College) and its biggest artificial clinical intelligence projects (such as the IBM Watson team’s foray into the health care realm)."