Tag - Suppose

A Data Analyst guide to A/B testing

A/B testing is key to improving results in any marketing campaign. We examine the issues involved in its 3 main components: message variants, user group selection, and choosing the winning version. comments By Jacob Joseph, CleverTap. The primary aim...

About Risks and Side-Effects… Consult your Purrr-Macist

Share Tweet Capture errors, warnings and messages, but keep your list operations going In a recent post about text mining, I discussed some solutions to webscraping the contents of our STATWORX blog using the purrr-package. However, while preparing t...

adam kelleher

The Data Processing Inequality If you look at the wikipedia article for the data processing inequality, it’s really just a stub (as of the time this article was published). The inequality is given, but there is little context. The data processing ine...

Advanced Base Graphics Exercises

(This article was first published on R-exercises, and kindly contributed to R-bloggers) Being able to visualize information through plots is essential for a statistic analysis. A simple and clean graph  can explain much more than words.  In this set...

Calculating a fuzzy kmeans membership matrix with R and Rcpp

by Błażej Moska, computer science student and data science intern  Suppose that we have performed clustering K-means clustering in R and are satisfied with our results, but later we realize that it would also be useful to have a membership matrix. Of...

Introduction to Optimization with Genetic Algorithm

This article gives a brief introduction about evolutionary algorithms (EAs) and describes genetic algorithm (GA) which is one of the simplest random-based EAs. By Ahmed Gad, KDnuggets Contributor. comments Selection of the optimal parameters values f...

Is Blockchain the Ultimate Enabler of Data Monetization?

Is blockchain the ultimate enabler of __data and analytic insights directly with others? By William Schmarzo, Dell EMC. Special thanks for the help on this blog to the coolest, most hip group of industry experts that I have ever met: the Pathfinders....

Putting the “Science” Back in Data Science

The scientific method to approach a problem, in my point of view, is the best way to tackle a problem and offer the best solution. If you start your comments By Rubens Zimbres,   One of the things I learned with the scientific method was to get rid o...

setting ggplot2 background with ggbackground

Share Tweet ggimage 0.1.4 is available on CRAN. This release introduces a new function called ggbackground for setting image background as ggplot canvas. require(ggplot2) p <- ggplot(iris) + aes(x = Sepal.Length, y = Sepal.Width, color=Species) + ...