During the last months, I’ve probably run the t-test dozens of times but recently I realized that I did not fully understand some concepts such as why it is not possible to accept the null hypothesis or where the numbers in the t-tables come from. After doing some research, I found that several articles provide those answers but not so many gather all of the information together.
Therefore, I decided to write this article to explain step-by-step the t-test so anyone can use it as a reference whenever they have to run the test or review the concepts.
Ever since the introduction of Transformer in the work “Attention is all you need”, there has been a transition in the field of NLP towards replacing Recurrent Neural Networks (RNN) with attention-based networks. In current literature, there are already many great articles that describe this method. Here are two of the best ones I found during my review: The Annotated Transformer and Transformers explained Visually.
However, after researching how to implement attention in computer vision (Best found articles: Understanding Attention Modules, CBAM, Papers with Code-Attention, Self-Attention, Self-Attention and Conv), I noticed that only a few of them clearly describe the…
Last week, I had to reimplement an LSTM-based neural network. After checking the PyTorch documentation, I had to spend some time again reading and understanding all the input parameters.
Therefore, this time I have decided to write this article where I have made a summary of how to implement some basics LSTM- neural networks. Here is the structure of the article:
In 1966, John von Neumann, who is considered to be one of the top mathematicians of the 20th century, introduced the term Cellular Automaton and defined it as a dynamic system that evolves in discrete steps. Four years later, John Conway created the well-known Game of Life, whose main feature is that evolution is determined by its initial state and does not require further input.
Although over the past months we have finally started to receive a vaccine for the COVID-19, the leading approach over the last year for the effect commonly known as “flattening the curve” of infections has been social distancing. However, there has been a clear lack of consensus among the governmental administrations throughout the globe, being possible to dedicate a whole set of articles to the multiple measures implemented in each country.
Instead of describing those measures and assessing the optimal ones, we decided to leave the questions about when and how to implement lockdowns to a model that could optimize…
Probably, most of the people reading this article have already implemented some CNN-based neural networks and have wondered whether to use Conv1D or Conv2D when doing time series analysis. Along with this article, I will explain and compare both types of convolutions and answer the following question: Why am I saying that Conv1D is a subclass of Conv2D?
If you cannot easily answer this question, I think this article will become interesting to you. As always, any questions/comments will be well welcomed.
Several months ago, I was asked to create a neural network for a time series-based challenge. …
A few months ago, I started researching how to classify evoked emotions using EEG recordings when I rapidly faced the most challenging problem when developing Brain Imaging Methods: the poor homogeneity of EEG activity across participants. This problem can be easily explained using Figure 1, where the plots were obtained by first extracting features for each EEG recordings for each video and participant and then implementing the dimensionality reduction tool UMAP to embed the data. While on the left, colours indicate emotions; on the right, colours indicate to whom the data corresponds to. …
After running several statistical tests to assess my models, I decided to dig deeper into the theory and ask myself questions such as why the number of samples is relevant for the statistical test, why the standard deviation has a square root in the denominator, or why statisticians differentiate between Z- and t-distribution.
Since I did not find a blog post that answered all these questions, I decided to run some simulations in Python and post the results along with this article for people interested.
The central limit theorem states that if you sufficiently select random samples from a population…
This article is written for people who want to build a basic autoencoder using PyTorch. The dataset in which this article is based on is the Fashion-MNIST dataset.
The article is divided into the following sections:
As defined in Wikipedia:
An autoencoder is a type of neural network used to learn efficient data codings in an unsupervised manner.
In other words, the aim of an autoencoder is to learn a lower representation of a set of data, which is useful for feature…
This article is written for people who want to learn or review how to build a basic Convolutional Neural Network in Keras. The dataset in which this article is based is the Fashion-Mnist dataset.
Along with this article, we will explain how:
Fashion-MNIST is a dataset of Zalando’s article images — consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. We intend Fashion-MNIST…
Artificial Intelligence researcher and developer