Euangelos Linardos Welcome to my personal weblog! This is Euangelos Linardos; a passionate statistician and computer scientist, a keen runner and cyclist, a wannabe environmentalist, traveler, and pianist. I enjoy writing about my experiences in machine learning and programming trading, with some helpful tutorials and tips along the way. To find out more about my professional background, please have a look on my LinkedIn and Xing profile. Hope you find this weblog useful and leave a comment or two!

02 April 2016 ~ 0 Comments

Apache Spark Workshop, Apr. 2016


On April 02 2016, I was invited to the department of Information and Communication Systems Engineering, School of Engineering, University of the Aegean, to give a lecture on Apace Spark, the well-known open source cluster computing framework. The course consisted of three parts: a) install the environment through Docker, b) introduction to Spark as well [...]

17 December 2015 ~ 0 Comments

Data At Pollfish, Dec. 2015


I was honored to present Pollfish in the 2nd Data Science Athens Meetup on 17 December 2015. This was the first time my team and I presented to the outside world what we doing from the technology standpoint. The main topics I presented was on how the data are flowing in our system,  the tools [...]

01 July 2015 ~ 0 Comments

How to Free Up Hard Disk Space


Every time Ubuntu updates or you install some software, the package manager downloads all of the packages to the system, and then caches them there in case they need to be installed again. Unfortunately, this can often mean a real lot of wasted space. So, if your Ubuntu machine has only a few megabytes of [...]

01 June 2015 ~ 0 Comments

Challenge Next Word Predictor


The goal of this challenge is to build a simple next-word predictor in the style of a particular author. Concretely, given as input a sentence like “The warm summer”, the code continues the sentence by providing the next word, which in this particular example might be “breeze”, “afternoon”, “nights”, “was”, “ended”, or even punctuation such [...]

01 May 2015 ~ 0 Comments

Challenge Stack Exchange


The goal of this challenge is to get familiar with the Stack Exchange Data Explorer as well as to accomplish some exciting tasks. As we all know, Stack Exchange ( is a highly trafficked site, with thousands of posts per day across various programming languages. Each post has an associated comment thread where users can [...]

01 April 2015 ~ 0 Comments

Challenge Antivirus Products


The goal of this challenge is to build a predictive model as well as a recommendation engine for antivirus products. We start by collecting the required data from PC Magazine. More specifically, we crawl all{i} web pages for i:=1:33 and, for each of them, we extract the following three fields. Note that, although they [...]

01 March 2015 ~ 0 Comments

Challenge Extract Quotations


The goal of this challenge is to take as input a text file, such as the Mansfield Park novel, and output another text file containing every unquoted instance of short dialogue that was encountered. We define a short dialogue as anything that a character in the book says that is no more than one sentence [...]

01 February 2015 ~ 0 Comments

Challenge Yahoo! Finance


The goal of this challenge is to build a web application that takes two stock tickers and produces a time series chart of the rolling 30-day correlation between them. Data are fetched from Yahoo! Finance, Python Pandas is used to calculate the correlation, and Python Matplotlib is used to generate the chart.     ### [...]

01 January 2015 ~ 0 Comments

Challenge Tourism and Hospitality


The goal of this challenge is to analyze a small set of data, with emphasis in feature selection techniques, toward the increase of the overall accuracy of the model. As the name reveals, the data here are from the tourism and hospitality sector. I started by transforming (i.e. discretize and normalize) the data. Then, I [...]

01 December 2014 ~ 0 Comments

Ubuntu Apps that Boost your Productivity


Making the switch to Ubuntu – or any popular Linux distribution – is more than the mere act of changing operating systems. It’s a quite time consuming process, and also someone may not know even what applications they should install to allow you to get work done. In this article, I’ll be sharing critical applications [...]