6 dataset lists curated by data scientists

The list is a collection of six great dataset lists from both famous data scientists and those who aren’t well-known.

Reference: 6 dataset lists curated by data scientists


Government develops software differently

… I suspect their approach works for projects like flying astronauts to the moon, or shooting ICBMs at the Soviet Union, where there’s no chance to try things out before you go “live.” They have to work the first time. And in some cases security is a huge issue, and you don’t want too many people to understand the whole project.

Norway To Digitize All Norwegian Books, Allowing Domestic IP Addresses To Read All Of Them, Irrespective Of Copyright Status

The National Library of Norway is planning to digitize all the books by the mid 2020s. Yes. All. The. Books. In Norwegian, at least. Hundreds of thousands of them. Every book in the library’s holdings.

If you happen to be in Norway, as measured by your IP address, you will be able to access all 20th-century works, even those still under copyright. Non-copyrighted works from all time periods will be available for download.


10 Online Tools for Creating Infographics and Charts

Check out the following article about 10 online tools for creating infographics and charts.

10 Online Tools for Creating Infographics and Charts

Data Mining Exposes Embarrassing Problems for Massive Open Online Courses

Earlier this month, researchers at the University of Pennsylvania reported that the online classes it offered had failed miserably. Only about half of the students who registered ever viewed a lecture and only 4 percent completed a course.

Today, Christopher Brinton at Princeton University and a few pals offer their view. These guys have studied the behaviour in online discussion forums of over 100,000 students taking massive open online courses (or MOOCs).

Brinton and co say they’ve found various correlations with the drop. One of these is the amount of peer-graded homework on the course, a factor which moderately increases the rate of decline. More worrying is the discovery that teacher involvement in a thread seems to accelerate the decline (although it also increases the number of posts).

After Setbacks, Online Courses Are Rethought

A recent University of Pennsylvania study of a million users of massive open online courses (MOOCs) found that, on average, only about 50 percent of those who registered for a course ever viewed a lecture, and only about 4 percent completed the courses. Although MOOCs were started with the goal of providing courses for students in poor countries with little access to higher education, the study found that about 80 percent of those taking MOOCs had already earned a degree of some kind. In response to some of the initial shortcomings of several MOOC programs, their designers are making changes to broaden their appeal. For example, edX is producing videos to use in some high school Advanced Placement classes, and Coursera is experimenting with using its courses, along with a facilitator, in small discussion classes at some U.S. consulates. In addition, Udacity is revamping its software so future students could have more time to work through the courses. “We are seeing significant improvement in learning outcomes and student engagement,” says Udacity founder and Stanford University professor Sebastian Thurn. Meanwhile, some MOOC pioneers are developing a connectivist MOOC model, which is more about the connections and communications among students than about the content delivered by a professor.


Gates Foundation Big Data Grants Stress Open Data

The Bill and Melinda Gates Foundation has allocated six $100,000 grants to projects for the purpose of “increasing interoperability of social good data.” The grants mark the first time the foundation has made data an element in its Grand Challenges. Among the grants’ recipients are NetHope, a U.S. nonprofit that aims to improve information management following a natural disaster, and Development Initiatives Poverty Research, a U.K. nonprofit striving to devise an open-resource toolkit to create interoperable datasets for ending poverty. Also receiving a grant is a University of Michigan research team led by professor H.V. Jagadish, who is developing tools to link incompatible data sets. Jagadish says improving policymakers’ decisions is the goal of his project, and he plans to employ data available through the data.gov clearinghouse to develop a prototype to show that his effort can generate meaningful ways to compare public data, and to make queries such as “Does better primary school education lower crime rates?” The Gates Foundation’s Victoria Vrana says the overriding goal of the grants is to have foundations, nonprofit leaders, and citizens more informed by data in what they fund and how they do their work by combining people’s observations with thorough analysis.