Statistics

Related Stories:

Learning

Statistics

Statistics Fri Jan 19 2018 08:30:00 GMT+0000 (UTC)
WHERE THE PARTY'S AT -- This afternoon, Alexander Ly is defending his PhD thesis at the University of Amsterdam. While I cannot attend the event, I want to celebrate the event and a remarkable thesis around the Bayes factor [even though we disagree on its role!] and the Jef
Kaiser Fung, of Junk Charts and Principal Analytics Prep, finds much to admire about this chart showing tremendous progress in social wellbeing across the globe over the last few decades.I came across this chart from the OurWorldinData website, and t
The SparklyR package from RStudio provides a high-level interface to Spark from R. This means you can create R objects that point to data frames stored in the Spark cluster and apply some familiar R paradigms (like dplyr) to the data, all the while l
When fitting a Bayesian model using MCMC (often via JAGS/BUGS/Stan), a number of checks are typically performed to make sure your model is worth interpreting without further manipulation (remember: all models are wrong, some are useful!): R-hat (AKA
A new record for the largest known prime was announced yesterday: This number has23,249,425 digits when written in base 10. In base 2, 2p – 1 is a sequence of p ones. For example, 31 = 25 -1 which is 11111 in binary. So in binary, the new r
Do women & men prefer to support different charitable causes? This analysis of celebrity endorsements reveals an interesting pattern. » See the graphicDo women & men prefer to support different charitable causes? This analy
Kaiser Fung, creator of Junk Charts and Principal Analytics Prep, discusses a popular chart by the New York Times on the impact of the Trump tax bill on middle-class households. The above chart, when it was unveiled at the end of November last year,
This landing page interactive for Zendesk, outlining how live chat functionality can be a powerful tool to convert customers online, offered us an opportunity to bring the story to life through animated illustrations. The post Chat It Up Interactive
If you've ever wondered how neural networks work behind the scenes, check out this guide to implementing neural networks in scratch with R, by David Selby. You may be surprised how with just a little linear algebra and a few R functions, you can trai
An eye-catching 'scaly-dragon' map of Berlin's public transport system in 1927Read More
After last years successful premiere, the eoda R-Akademie on the Road will be back in Hamburg this year for four inspiring days of professional R training for data scientists, programmers and R enthusiasts. The program for German-speaking users will
Retiring professor Leonard Fabiano contacted me looking to give away a set of technical books, mostly chemical engineering books. If you’re interested please email him. Here are the books: Click on the image to see a larger version. Two tit
]]> Plastiq is the only service that lets customers pay businesses with their favorite credit or debit cards. This bright, fun, and character driven video was directed at business owners. It was on point way to provide them with an overview of the
A question that came out on X validated today kept me busy for most of the day! It relates to an earlier question on the best unbiased nature of a maximum likelihood estimator, to which I pointed out the simple case of the Normal variance when the es
Do women & men prefer to support different charitable causes? This analysis of celebrity endorsements reveals an interesting pattern. See the graphicDo women & men prefer to support different charitable causes? This analysis of cele
caret has been able to utilize parallel processing for some time (before it was on CRAN in October 2007) using slightly different versions of the package. Around September of 2011, caret started using the foreach package was used to "harmonize" the p
This paper aims at solving the Bartlett-Lindley-Jeffreys paradox, i.e., the difficulty connected with improper priors in Bayes factors. The introduction is rather lengthy since by page 9 we are still (dis-)covering the Lindley paradox, along with the
Kaiser Fung, creator of Junk Charts and founder of Principal Analytics Prep, appreciates data journalism on U.S. gun crime by the Guardian graphics team.I like the Guardian's feature (undated) on gun violence in American cities a lot. The fol
The R language surged to 8th place in the 2017 TIOBE language rankings, up 8 places from a year before. Fellow data science language language Python also saw an increase in rankings, taking the 4th spot (one ahead of its January 2016 ranking). (Click
These have been the most popular math-related posts here this year. Golden powers are nearly integers How efficient is Morse code? Finding numbers in pi Common words used as technical terms Sierpinski triangle strikes again See also a list of the top
The consensus from 120 top 10 lists via Metacritic » See the graphic » Spotify playlist | Open SpotifyThe consensus from 120 top 10 lists via Metacritic » See the graphic » Spotify playlist | Open Spotif
Mazama Science has released the first official version (1.0) of the PWFSLSmoke R package for working with PM2.5 monitoring data. A beta version was released last year, along with an accompanying blog post. In this post, we discuss the purpose read mo
During his talk on unbiased MCMC in Dauphine today, Pierre Jacob provided a nice illustration of the convergence modes of MCMC algorithms. With the stationary target achieved after 100 Metropolis iterations, while the mean of the target taking much m
Long-time reader Daniel L. said this made him laugh. This prompted me revive a feature I used to run on here called "Light entertainment." Dataviz work that are so easy to ridicule that one wonders if they weren't just made...Long-time reader Daniel
Type "Harry Potter" as a text on your phone. Now press the predictive text button a few times. I got "Harry Potter was the time to come", but my phone has been trained on my texts and you'll likely get something different. But if you train the predic
I’ll be presenting at a webinar on Wednesday, December 13 at 1:00 PM Eastern. The title of the presentation is “Seven questions a statistician and answer for an attorney.” I will discuss, among other things, when common
Star Wars: The Last Jediis the biggest movie ever to create such adramatic divide between critics & audience on Rotten Tomatoes. Currently, Jedis aggregate critical score is 93%. The audience score is just 56%. An 37% chasm. In fact, by our
Ive just shared a short webcast on data reshaping in R using the cdata package. (link) We also have two really nifty articles on the theory and methods: Fluid data reshaping with cdata Coordinatized Data: A Fluid Data Specification Please give it a t
Kaiser Fung, founder of Junk Charts and Principal Analytics Prep, the premier data analytics bootcamp in New York City, finds omitted data in a chart used by Verge to illustrate the survey of attitudes toward social media brands.Im not quite done wit
The Revolutions team is celebrating Christmas today, and we're taking a break with family and enjoying good food. And given the number of Eggnogs that are being prepared thanks to Hadley Wickham's eggnogr Shiny app it might be a good idea to take the
The Shannon wavelet has an interesting plot: Given the complexity of the plot, the function definition is surprisingly simple: The Fourier transform is even simpler: it’s the indicator function of [-2, -] [, 2], i.e. the function that is 1
[UPDATED] World’s Most Contagious Falsehoods. Floss your belief structures with our filterable charticle. Click on each bubble to read the underlying sources & data.[UPDATED] World’s Most Contagious Falsehoods. Floss your
Bob Rudis (@hrbrmstr) is a famed expert, author and developer in Data Security and the Chief Security Data Scientist at Rapid7. Bob also creates the most deliciously vivid images of his meals documented by the #nom hashtag. Im going to use a similar
Aware of my fascination for paradoxes (and heterodox publications), Ewan Cameron sent me the link to a recent arXival by Louis Lyons (Oxford) on different asymptotic distributions of the likelihood ratio. Which is full of approximations. The overall
Kaiser Fung, founder of Junk Charts and Principal Analytics Prep, finds new ways to visualize market research data coming from a study of consumer attitudes toward Facebook, Amazon, Apple, Microsoft Google, and Twitter by Verge.Aleks J. wasn'
Sure, this is a promo for a movie, but I'd love to have a full-length single of this: Relatedly, if you want to settle an argument about which pop diva has the greatest vocal range, Giora Simchoni used R to perform frequency analysis of their hits: T
Last week I wrote about Jentzsch’s theorem. It says that if the power series of function has a finite radius of convergence, the set of zeros of the partial sums of the series will cluster around the boundary of convergence. This post will
Toxins. Overfishing. Bycatch. From trawled tuna & cod to farmed salmon & tilapia – the latest data on sustainable fish visualized! Multi-sourced from the Marine Conservation Society, Marine Stewardship Council, Greenpeace
One of the best things about learning R is that no matter your skill level, there is always someone who can benefit from your experience. Topics in R ranging from complicated machine learning approaches to calculating a mean all find their relevant a
Kaiser Fung, founder of Junk Charts and Principal Analytics Prep, explains why simple analysis should be illustrated using simple graphics.A reader sent this tip in some time ago and I lost track of who he/she is. This graphic looks deceptively compl
The main international R user conference, useR!, is typically hosted in Europe every other year. Back in 2016 when useR! was held in the USA, an alternative conference was held in Poland. Next year (when the useR! conference will be in Australia) the
The last digits of Fibonacci numbers repeat with period 60. This is something I’ve written aboutbefore. The 61st Fibonacci number is 2504730781961. The 62nd is 4052739537881. Since these end in 1 and 1, the 63rd Fibonacci number must end in
Toxins. Overfishing. Bycatch. From trawled tuna & cod to farmed salmon & tilapia – the latest data on sustainable fish visualized! Multi-sourced from the Marine Conservation Society, Marine Stewardship Council, Greenpeace
Problem overview This article will be divided into a series of posts. In this post, I will explain the basic concepts behind convolutional neural networks and how to build them using Keras. In the next post, I will focus on improving the perfor... (T
Along Atheism a very short introduction, I also bought Humanism a very short introduction, as they come by two at the Warwick campus bookstore (!). Written by Stephen Lee, the book is much less irritating than Atheism. Maybe because it is constructed
Ray Vella (link) asked me to comment on a chart about regional wealth distribution, which I wrote about here. He also asked students in his NYU infographics class to create their own versions. This effort caught my eye: This work...Ray Vella (link) a
The winners of the American Statistical Association Police Data Challenge have been announced. The ASA teamed up with the Police Data Initiative, which provides open data from local law enforcement agencies in the US, t