Open in app

Sign In

Write

Sign In

Sergei Ivanov
Sergei Ivanov

850 Followers

Home

About

Aug 3, 2022

Why you should not use randomSplit in PySpark to split data into train and test.

In case you work with large scale data and want to prepare dataset for your Tensorflow/PyTorch model, don’t use randomSplit function to split data into train and test. The problem You have a pyspark dataframe and you would like to split it into two dataframes, train and test. Obviously, you would like…

Python

3 min read

Why you should not use randomSplit in PySpark to split data into train and test.
Why you should not use randomSplit in PySpark to split data into train and test.
Python

3 min read


Published in Towards Data Science

·Mar 8, 2021

Top-10 Research Papers in AI

The most cited AI works that influence our daily life today — Each year scientists from around the world publish thousands of research papers in AI but only a few of them reach wide audiences and make a global impact in the world. Below are the top-10 most impactful research papers published in top AI conferences during the last 5 years. The…

AI

5 min read

Top-10 Research Papers in AI
Top-10 Research Papers in AI
AI

5 min read


Published in Cantor’s Paradise

·Feb 25, 2021

The Easiest Unsolved Problem in Graph Theory

This post is written together with Ekaterina Vorobyeva. At ten I once said to my friend how great it would if our teacher presented to us unsolved math problems. It would be much more fun to approach these in class instead of the textbook exercises. It was the time when…

Graph

9 min read

The Easiest Unsolved Problem in Graph Theory
The Easiest Unsolved Problem in Graph Theory
Graph

9 min read


Published in Criteo R&D Blog

·Jan 14, 2021

Top Applications of Graph Neural Networks 2021

Chinese translation is available here. At the beginning of the year, I have a feeling that Graph Neural Nets (GNNs) became a buzzword. As a researcher in this field, I feel a little bit proud (at least not ashamed) to say that I work on this. It was not always…

Graph

8 min read

Top Applications of Graph Neural Networks 2021
Top Applications of Graph Neural Networks 2021
Graph

8 min read


Published in Criteo R&D Blog

·Oct 15, 2020

NeurIPS 2020. Comprehensive analysis of authors, organizations, and countries.

Welcome! This post analyzes what authors and organizations publish at NeurIPS 2020 this December, similar to the analysis I did for ICML 2020. Papers are available here. The code is available here. Disclaimer: As before, such analysis is prone to minor errors due to how people write their names and…

Neurips

7 min read

NeurIPS 2020. Comprehensive analysis of authors, organizations, and countries.
NeurIPS 2020. Comprehensive analysis of authors, organizations, and countries.
Neurips

7 min read


Published in Criteo R&D Blog

·Aug 23, 2020

KDD-2020 Highlights

This year KDD gathered 346 papers (for research and applied tracks), 34 workshops, 45 tutorials (lecture and hands-on) making it one of the biggest applied research conferences in computer science. Let’s take a look at some of the highlights of this conference. Trends of this year Let’s have a look at the word cloud…

Kdd

4 min read

KDD-2020 Highlights
KDD-2020 Highlights
Kdd

4 min read


Published in Criteo R&D Blog

·Jun 23, 2020

Criteo papers at ICML 2020. Online learning, Optimization, and Generative models.

Criteo AI Lab has 9 accepted papers at ICML 2020. This is a new record for us and we are proud of the research and engineering team that we have! Established in 2018, Criteo AI Lab drives the research agenda for Criteo with the focus on the topics of computational…

Icml

6 min read

Criteo papers at ICML 2020. Online learning, Optimization, and Generative models.
Criteo papers at ICML 2020. Online learning, Optimization, and Generative models.
Icml

6 min read


Published in Criteo R&D Blog

·Jun 16, 2020

ICML 2020. Comprehensive analysis of authors, organizations, and countries.

ICML is one of the most important conferences in Machine Learning and therefore it’s interesting to see who publishes at this conference. So I looked at the accepted papers for ICML 2020 and analyzed authors, organizations, and countries that participated this year. …

Icml 2020

6 min read

ICML 2020. Comprehensive analysis of authors, organizations, and countries.
ICML 2020. Comprehensive analysis of authors, organizations, and countries.
Icml 2020

6 min read


Published in Towards Data Science

·May 31, 2020

Why `True is False is False` -> False?

Python is cool: after so many years of using it, there are these little peculiar things that amaze me. I recently stumbled upon a very simple line of code that most of the experienced python programmers I know could not explain without googling it.

Programming

3 min read

Why `True is False is False` -> False?
Why `True is False is False` -> False?
Programming

3 min read


Published in Towards Data Science

·Apr 21, 2020

A forgotten story of Soviet AI

The names of Turing, Minsky, and McCarthy, the founders of Computer Science and Artificial Intelligence in the west, are now familiar to everybody. However, little is known about the history of AI developments under the Iron Curtain of the USSR, although sometimes the competition between two systems was not less…

AI

8 min read

A forgotten story of Soviet AI
A forgotten story of Soviet AI
AI

8 min read

Sergei Ivanov

Sergei Ivanov

850 Followers

Machine Learning research scientist with a focus on Graph Machine Learning and recommendations. t.me/graphML

Following
  • Michael Galkin

    Michael Galkin

  • Yoav Goldberg

    Yoav Goldberg

  • Olivier Koch

    Olivier Koch

  • Michael Bronstein

    Michael Bronstein

  • Igor Rukhovich

    Igor Rukhovich

See all (25)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech