Tag - There

corrr 0.2.1 now on CRAN

(This article was first published on blogR, and kindly contributed to R-bloggers) @drsimonj here to discuss the latest CRAN release of corrr (0.2.1), a package for exploring correlations in a tidy R framework. This post will describe corrr features a...

Cute Gibbs sampling for rounded observations

I was attending a course of Bayesian Statistics where this problem showed up: There is a number of individuals, say 12, who take a pass/fail test 15 times. For each individual we have recorded the number of passes, which can go from 0 to 15. Because...

Data Driven Cheatsheets

22 SHARES Share Tweet Guest post by Jonathan Sidi Cheatsheets are currently built and used exclusivley as a teaching tool. We want to try and change this and produce a cheat sheet that gives a roadmap to build a known product, but also is built as a...

Data Science and Big Data, Explained

This article is meant to give the non-data scientist a solid overview of the many concepts and terms behind __data science? What is big data? What do these terms mean and why is it important to find out? These are hot topics indeed, but are often mis...

Data Science and the Imposter Syndrome

You are not the only one who wonders how much longer they can get away with pretending to be a comments I am not a real Even Ewoks feel like imposters sometimes. (Photo courtesy of Diane Rohrer.) What a real data scientist looks like “Data science” i...

Data Science: (not) the preferred nomenclature

The term __data could be called “Data-Driven Science”. Whatever your preferred term, reinforcing the distinction will help establish the Science OF data and doing Science WITH data as bona-fide disciplines. comments By Peter Flach, U. of Bristol. dat...

Dataframes and the tidyverse

Share Tweet The data frame is the primary structure for working with data in R. Whenever you have data that is arranged in a spreadsheet-like fashion, the default receptacle for that data in R is the data frame. In a data frame, each column contains...

Deep Learning Can be Applied to Natural Language Processing

This post is a rebuttal to a recent article suggesting that neural networks cannot be applied to natural language given that language is not a produced as a result of continuous function. The post delves into some additional points on deep learning a...

Deploying Production-grade Data Products – Special Report

Dataiku launched a survey a few months back to find out how companies handled going from designing to deploying a Report How do Companies  Really Manage to Put Data Science into Production? Deploying a   The result? There are four ways that companies...

Deploying R, RStudio and Shiny applications on Unbuntu Server

In this post, we are going to see how to deploy R, RStudio, and Shiny apps on a virtual server. The post won’t focus on Shiny programming aspect, neither on Unbuntu programming, however, the tutorial is self-sufficient. Only by following the differen...