Keep the gradient flowing

↧

Image may be NSFW.
Clik here to view.

Handwritten digits and Locally Linear Embedding

May 4, 2011, 1:46 am

I decided to test my new Locally Linear Embedding (LLE) implementation against a real dataset. At first I didn't think this would turn out very well, since LLE seems to be somewhat fragile, yielding...

View Article

Image may be NSFW.
Clik here to view.

Manifold learning in scikit-learn

June 7, 2011, 12:19 am

The manifold module in scikit-learn is slowly progressing: the locally linear embedding implementation was finally merged along with some documentation. At about the same time but in a different...

View Article

Image may be NSFW.
Clik here to view.

LLE comes in different flavours

June 30, 2011, 7:22 am

I haven't worked in the manifold module since last time, yet thanks to Jake VanderPlas there are some cool features I can talk about. First of, the ARPACK backend is finally working and gives factor...

View Article

Image may be NSFW.
Clik here to view.

Ridge regression path

July 12, 2011, 12:21 am

Ridge coefficients for multiple values of the regularization parameter can be elegantly computed by updating the thin SVD decomposition of the design matrix: import numpy as np from scipy import linalg...

View Article

Image may be NSFW.
Clik here to view.

scikit-learn EuroScipy 2011 coding sprint -- day one

August 23, 2011, 12:38 pm

As a warm-up for the upcoming EuroScipy-conference, some of the scikit-learn developers decided to gather and work together for a couple of days. Today was the first day and there was only a handfull...

View Article

Image may be NSFW.
Clik here to view.

scikit-learn’s EuroScipy 2011 coding sprint -- day two

August 24, 2011, 3:33 pm

Today's coding sprint was a bit more crowded, with some notable scipy hackers such as Ralph Gommers, Stefan van der Walt, David Cournapeau or Fernando Perez from Ipython joining in. On what got done:...

View Article

Image may be NSFW.
Clik here to view.

Reworked example gallery for scikit-learn

September 4, 2011, 11:09 am

I've been working lately in improving the scikit-learn example gallery to show also a small thumbnail of the plotted result. Here is what the gallery looks like now: And the real thing should be...

View Article

scikit-learn 0.9

October 2, 2011, 2:19 am

Last week we released a new version of scikit-learn. The Changelog is particularly impressive, yet personally this release is important for other reasons. This will probably be my last release as a...

View Article

Image may be NSFW.
Clik here to view.

qr_multiply function in scipy.linalg

October 14, 2011, 7:44 am

In scipy's development version there's a new function closely related to the QR-decomposition of a matrix and to the least-squares solution of a linear system. What this function does is to compute the...

View Article

Image may be NSFW.
Clik here to view.

Low rank approximation

November 6, 2011, 3:05 am

A little experiment to see what low rank approximation looks like. These are the best rank-k approximations (in the Frobenius norm) to the a natural image for increasing values of k and an original...

View Article

line-by-line memory usage of a Python program

April 23, 2012, 10:04 pm

My newest project is a Python library for monitoring memory consumption of arbitrary process, and one of its most useful features is the line-by-line analysis of memory usage for Python code. I wrote a...

View Article

Image may be NSFW.
Clik here to view.

Learning to rank with scikit-learn: the pairwise transform

October 22, 2012, 3:00 pm

This tutorial introduces the concept of pairwise preference used in most ranking problems. I'll use scikit-learn and for learning and matplotlib for visualization. In the ranking setting, training...

View Article

Image may be NSFW.
Clik here to view.

Singular Value Decomposition in SciPy

December 7, 2012, 3:00 pm

SciPy contains two methods to compute the singular value decomposition (SVD) of a matrix: scipy.linalg.svd and scipy.sparse.linalg.svds. In this post I'll compare both methods for the task of computing...

View Article

Image may be NSFW.
Clik here to view.

Memory plots with memory_profiler

January 3, 2013, 3:00 pm

Besides performing a line-by-line analysis of memory consumption, memory_profiler exposes some functions that allow to retrieve the memory consumption of a function in real-time, allowing e.g. to...

View Article

Image may be NSFW.
Clik here to view.

Loss Functions for Ordinal regression

February 26, 2013, 3:00 pm

Note: this post contains a fair amount of LaTeX, if you don't visualize the math correctly come to its original location In machine learning it is common to formulate the classification task as a...

View Article

Image may be NSFW.
Clik here to view.

Householder matrices

March 29, 2013, 4:00 pm

Householder matrices are square matrices of the form $$ P = I - \beta v v^T$$ where $\beta$ is a scalar and $v$ is a vector. It has the useful property that for suitable chosen $v$ and $\beta$ it...

View Article

Image may be NSFW.
Clik here to view.

Isotonic Regression

April 15, 2013, 3:00 pm

My latest contribution for scikit-learn is an implementation of the isotonic regression model that I coded with Nelle Varoquaux and Alexandre Gramfort. This model finds the best least squares fit to a...

View Article

Image may be NSFW.
Clik here to view.

Logistic Ordinal Regression

May 1, 2013, 3:00 pm

TL;DR: I've implemented a logistic ordinal regression or proportional odds model. Here is the Python code The logistic ordinal regression model, also known as the proportional odds was introduced in...

View Article

Image may be NSFW.
Clik here to view.

Numerical optimizers for Logistic Regression

May 19, 2013, 3:00 pm

In this post I compar several implementations of Logistic Regression. The task was to implement a Logistic Regression model using standard optimization tools from scipy.optimize and compare them...

View Article

Image may be NSFW.
Clik here to view.

Different ways to get memory consumption or lessons learned from...

July 24, 2013, 3:00 pm

As part of the development of memory_profiler I've tried several ways to get memory usage of a program from within Python. In this post I'll describe the different alternatives I've tested. The psutil...

View Article

Image may be NSFW.
Clik here to view.

Surrogate Loss Functions in Machine Learning

June 19, 2014, 3:00 pm

TL; DR These are some notes on calibration of surrogate loss functions in the context of machine learning. But mostly it is an excuse to post some images I made. In the binary-class classification...

View Article

Image may be NSFW.
Clik here to view.

Plot memory usage as a function of time

November 6, 2014, 3:00 pm

One of the lesser known features of the memory_profiler package is its ability to plot memory consumption as a function of time. This was implemented by my friend Philippe Gervais, previously a...

View Article

Image may be NSFW.
Clik here to view.

Data-driven hemodynamic response function estimation

December 4, 2014, 3:00 pm

My latest research paper[1] deals with the estimation of the hemodynamic response function (HRF) from fMRI data. This is an important topic since the knowledge of a hemodynamic response function is...

View Article

Image may be NSFW.
Clik here to view.

PyData Paris - April 2015

April 6, 2015, 3:00 pm

Last Friday was PyData Paris, in words of the organizers, ''a gathering of users and developers of data analysis tools in Python''. The organizers did a great job in putting together and the event...

View Article

Image may be NSFW.
Clik here to view.

IPython/Jupyter notebook gallery

April 20, 2015, 3:00 pm

Due to lack of time and interest, I'm no longer maintaining this project. Feel free to grab the sources from https://github.com/fabianp/nbgallery and fork the project. TL;DR I created a gallery for...

View Article

Holdout cross-validation generator

August 19, 2015, 3:00 pm

Cross-validation iterators in scikit-learn are simply generator objects, that is, Python objects that implement the __iter__ method and that for each call to this method return (or more precisely,...

View Article

On the consistency of ordinal regression methods

October 8, 2015, 3:00 pm

My latests work (with Francis Bach and Alexandre Gramfort) is on the consistency of ordinal regression methods. It has the wildly imaginative title of "On the Consistency of Ordinal Regression...

View Article

Image may be NSFW.
Clik here to view.

SAGA algorithm in the lightning library

February 21, 2016, 3:00 pm

Recently I've implemented, together with Arnaud Rachez, the SAGA[1] algorithm in the lightning machine learning library (which by the way, has been recently moved to the new scikit-learn-contrib...

View Article

Image may be NSFW.
Clik here to view.

scikit-learn-contrib, an umbrella for scikit-learn related projects.

March 5, 2016, 3:00 pm

Together with other scikit-learn developers we've created an umbrella organization for scikit-learn-related projects named scikit-learn-contrib. The idea is for this organization to host projects that...

View Article

Lightning v0.1

March 24, 2016, 4:00 pm

Announce: first public release of lightning!, a library for large-scale linear classification, regression and ranking in Python. The library was started a couple of years ago by Mathieu Blondel who...

View Article

Image may be NSFW.
Clik here to view.

Hyperparameter optimization with approximate gradient

May 24, 2016, 3:00 pm

TL;DR: I describe a method for hyperparameter optimization by gradient descent. Most machine learning models rely on at least one hyperparameter to control for model complexity. For example, logistic...

View Article

Image may be NSFW.
Clik here to view.

A fully asynchronous variant of the SAGA algorithm

October 11, 2016, 3:00 pm

My friend Rémi Leblond has recently uploaded to ArXiv our preprint on an asynchronous version of the SAGA optimization algorithm. The main contribution is to develop a parallel (fully asynchronous, no...

View Article

Optimization inequalities cheatsheet

January 10, 2017, 3:00 pm

Most proofs in optimization consist in using inequalities for a particular function class in some creative way. This is a cheatsheet with inequalities that I use most often. It considers class of...

View Article

Image may be NSFW.
Clik here to view.

Notes on the Frank-Wolfe Algorithm, Part I

March 20, 2018, 4:00 pm

This blog post is the first in a series discussing different theoretical and practical aspects of the Frank-Wolfe algorithm. hljs.initHighlightingOnLoad(); $$ \def\xx{\boldsymbol x}...

View Article

Three Operator Splitting

September 5, 2018, 3:00 pm

$$ \def\aa{\boldsymbol a} \def\bb{\boldsymbol b} \def\cc{\boldsymbol c} \def\xx{\boldsymbol x} \def\zz{\boldsymbol z} \def\uu{\boldsymbol u} \def\vv{\boldsymbol v} \def\yy{\boldsymbol y}...

View Article

Notes on the Frank-Wolfe Algorithm, Part II: A Primal-dual Analysis

November 16, 2018, 3:00 pm

This blog post extends the convergence theory from the first part of my notes on the Frank-Wolfe (FW) algorithm with convergence guarantees on the primal-dual gap which generalize and strengthen the...

View Article

Image may be NSFW.
Clik here to view.

How to Evaluate the Logistic Loss and not NaN trying

September 26, 2019, 3:00 pm

A naive implementation of the logistic regression loss can results in numerical indeterminacy even for moderate values. This post takes a closer look into the source of these instabilities and...

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Polynomials and Optimization

April 6, 2020, 3:00 pm

There's a fascinating link between minimization of quadratic functions and polynomials. A link that goes deep and allows to phrase optimization problems in the language of polynomials and vice versa....

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Optimization and Polynomials, Part 2

December 20, 2020, 3:00 pm

An analysis of momentum can be tightened using a combination Chebyshev polynomials of the first and second kind. Through this connection we'll derive one of the most iconic methods in optimization:...

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Optimization and Polynomials, Part 3

March 1, 2021, 3:00 pm

I've seen things you people wouldn't believe. Valleys sculpted by trigonometric functions. Rates on fire off the shoulder of divergence. Beams glitter in the dark near the Polyak gate. All those...

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Optimization and Polynomials, Part 4

April 12, 2021, 3:00 pm

While the most common accelerated methods like Polyak and Nesterov incorporate a momentum term, a little known fact is that simple gradient descent –no momentum– can achieve the same rate through only...

View Article

Optimization Nuggets: Exponential Convergence of SGD

December 14, 2021, 3:00 pm

This is the first of a series of blog posts on short and beautiful proofs in optimization (let me know what you think in the comments!). For this first post in the series I'll show that stochastic...

View Article

Optimization Nuggets: Implicit Bias of Gradient-based Methods

January 9, 2022, 3:00 pm

When an optimization problem has multiple global minima, different algorithms can find different solutions, a phenomenon often referred to as the implicit bias of optimization algorithms. In this post...

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Optimization and Polynomials, Part 5

May 26, 2022, 3:00 pm

Six: All of this has happened before. Baltar: But the question remains, does all of this have to happen again?Six: This time I bet no.Baltar: You know, I've never known you to play the optimist. Why...

View Article

Image may be NSFW.
Clik here to view.

Notes on the Frank-Wolfe Algorithm, Part III: backtracking line-search

August 25, 2022, 3:00 pm

Backtracking step-size strategies (also known as adaptive step-size or approximate line-search) that set the step-size based on a sufficient decrease condition are the standard way to set the...

View Article

--- Article Not Found! ---

*** *** *** RSSing Note: Article is missing! We don't know where we put it!!. *** ***

View Article

Image may be NSFW.
Clik here to view.

On the Convergence of the Unadjusted Langevin Algorithm

June 13, 2023, 3:00 pm

The Langevin algorithm is a simple and powerful method to sample from a probability distribution. It's a key ingredient of some machine learning methods such as diffusion models and differentially...

View Article

Image may be NSFW.
Clik here to view.

Optimization Nuggets: Stochastic Polyak Step-size

September 28, 2023, 3:00 pm

The stochastic Polyak step-size (SPS) is a practical variant of the Polyak step-size for stochastic optimization. In this blog post, we'll discuss the algorithm and provide a simple analysis for...

View Article

Optimization Nuggets: Stochastic Polyak Step-size, Part 2

November 18, 2023, 3:00 pm

This blog post discusses the convergence rate of the Stochastic Gradient Descent with Stochastic Polyak Step-size (SGD-SPS) algorithm for minimizing a finite sum objective. Building upon the proof of...

View Article

Image may be NSFW.
Clik here to view.

On the Link Between Optimization and Polynomials, Part 6.

May 3, 2024, 3:00 pm

Differentiating through optimization is a fundamental problem in hyperparameter optimization, dataset distillation, meta-learning and optimization as a layer, to name a few. In this blog post we'll...

View Article