Quantcast
Channel: Big Data – Blog of Leonid Mamchenkov
Browsing all 24 articles
Browse latest View live

In Head-Hunting, Big Data May Not Be Such a Big Deal

Very interesting interview with Laszlo Block, senior vice president of people operations at Google.  Here are some of my favorite bits. Years ago, we did a study to determine whether anyone at Google...

View Article


GitHub’s Data Challenge II winners announced

GitHub, being a massive data store, is constantly looking for new and improved ways of extracting knowledge from its data. In April we announced the second annual GitHub data challenge. [...] After...

View Article


Elasticsearch – distributed restful search and analytics

#

View Article

πfs – the data-free filesystem!

πfs – the data-free filesystem! πfs is a revolutionary new file system that, instead of wasting space storing your data on your hard drive, stores your data in π! You’ll never run out of space again –...

View Article

ECharts 2.0 – an Open Source charts solution from the Baidu team

ECharts 2.0 – an Open Source charts solution from the Baidu team

View Article


Free Data Science Books

I came across a collection of free data science books: Pulled from the web, here is a great collection of eBooks (most of which have a physical version that you can purchase on Amazon) written on the...

View Article

Image may be NSFW.
Clik here to view.

400,000 GitHub repositories, 1 billion files, 14 terabytes of code: Spaces or...

Here is an interesting bit of research – do people prefer tabs or spaces when programming the most popular languages? Tabs or spaces. We are going to parse a billion files among 14 programming...

View Article

Image may be NSFW.
Clik here to view.

Amazon Snowmobile – a truck with up to 100 Petabytes of storage

Back in my college days, I had a professor who frequently used Andrew Tanenbaum‘s quote in the networking class: Never underestimate the bandwidth of a station wagon full of tapes hurtling down the...

View Article


Grakn and Graql – a database for AI

From the grakn.ai website: Grakn is a distributed hyper-relational database for knowledge-oriented systems. Grakn enables machines to manage complex data that serves as a knowledge base for...

View Article


Image may be NSFW.
Clik here to view.

A Comparative Analysis of Top 6 BI and Data Visualization Tools in 2018

A Comparative Analysis of Top 6 BI and Data Visualization Tools in 2018 looks at 6 most popular business intelligence and data visualization tools and compares pros, cons, features, pricing and more....

View Article

Image may be NSFW.
Clik here to view.

Registry of Open Data on AWS

AWS News Blog covers the Registry of Open Data on AWS: Almost a decade ago, my colleague Deepak Singh introduced the AWS Public Datasets in his post Paging Researchers, Analysts, and Developers. I’m...

View Article

Image may be NSFW.
Clik here to view.

Metabase – Open Source business intelligence and analytics

Metabase is an Open Source business intelligence and analytics tool.  It supports a variety of databases and services as sources for data, and provides a number of data querying and processing tools....

View Article

Image may be NSFW.
Clik here to view.

Cyprus National Internet Portal for Open Data

It is via this Cyprus Mail article that I’ve learned that not only Cyprus has an official Open Data portal, but that it’s also the best in Europe: Cyprus is one of the top five European Union...

View Article


Image may be NSFW.
Clik here to view.

Data Science Cheatsheets

Here’s a good collection of cheatsheets for anyone involved with Big Data and machine learning. Whether you are already well versed in the subject, or just starting, I’m sure you’ll find something...

View Article

ECharts 2.0 – an Open Source charts solution from the Baidu team

ECharts 2.0 – an Open Source charts solution from the Baidu team

View Article


Free Data Science Books

I came across a collection of free data science books: Pulled from the web, here is a great collection of eBooks (most of which have a physical version that you can purchase on Amazon) written on the...

View Article

Image may be NSFW.
Clik here to view.

400,000 GitHub repositories, 1 billion files, 14 terabytes of code: Spaces or...

Here is an interesting bit of research – do people prefer tabs or spaces when programming the most popular languages? Tabs or spaces. We are going to parse a billion files among 14 programming...

View Article


Image may be NSFW.
Clik here to view.

Amazon Snowmobile – a truck with up to 100 Petabytes of storage

Back in my college days, I had a professor who frequently used Andrew Tanenbaum‘s quote in the networking class: Never underestimate the bandwidth of a station wagon full of tapes hurtling down the...

View Article

Grakn and Graql – a database for AI

From the grakn.ai website: Grakn is a distributed hyper-relational database for knowledge-oriented systems. Grakn enables machines to manage complex data that serves as a knowledge base for...

View Article

Image may be NSFW.
Clik here to view.

A Comparative Analysis of Top 6 BI and Data Visualization Tools in 2018

A Comparative Analysis of Top 6 BI and Data Visualization Tools in 2018 looks at 6 most popular business intelligence and data visualization tools and compares pros, cons, features, pricing and more....

View Article
Browsing all 24 articles
Browse latest View live