Biography

I’m a statistician born in São Paulo, Brazil, and graduated from the University of São Paulo (USP) in 2008. Between 2009 and 2010, I was a Master’s student at USP (Master’s dissertation). From 2010 to 2014, I was a PhD student at Carnegie Mellon University (CMU) (PhD thesis), USA. Currently, I’m at the Department of Statistics of the Federal University of São Carlos (UFSCar).

I’m interested in theory, methodology, applications, and foundations of statististics and machine learning.

Research groups:

drawing

drawing

In case you are looking for Rafael Stern, his site is here.

Interests

  • Machine Learning
  • High-dimensional Inference
  • Nonparametric Statistics
  • Bayesian Inference
  • Foundations of Statistics
  • Astrostatistics

Education

  • PhD in Statistics, 2014

    Carnegie Mellon University

  • Master in Statistics, 2010

    University of São Paulo

  • BSc in Statistics, 2009

    University of São Paulo

Recent Publications

Quickly discover relevant content by filtering publications.
(2019). Pragmatic hypotheses in the evolution of Science. Entropy.

(2019). Quantification under prior probability shift: the ratio estimator and its extensions. Journal of Machine Learning Research.

Preprint PDF

(2019). Machine learning para análises preditivas em saúde: exemplo de aplicação para predizer óbito em idosos de São Paulo. Caderno de Saúde Pública.

(2019). Agnostic tests can control the type I and type II errors simultaneously. Brazilian Journal of Probability and Statistics.

Preprint

(2019). ABC-CDE: Toward Approximate Bayesian Computation with Complex High-Dimensional Data and Limited Simulations. Journal of Computational and Graphical Statistics.

Preprint PDF

Lecture Notes

Teaching

  • Bayesian Inference (2019_02)
  • Statistical Machine Learning (graduate level; 2019_02)
  • Introduction to Statistical Planning (2019_01)
  • Data Mining (2019_01)
  • Probability and Statistics (2018_02)
  • Statistical Machine Learning (graduate level; 2018_02)
  • Statistical Inference (graduate level; 2018_01)
  • Data Mining (2018_01)
  • Introduction to Probability (2017_02)
  • Statistical Machine Learning (graduate level; 2017_02)
  • Data Mining (2017_01)
  • Biostatistics (2017_01)
  • Computational Statistics (2016_02)
  • Decision Theory (graduate level; 2016_02)
  • Data Mining (2016_01)
  • Probability Theory (graduate level; 2016_01)
  • Computational Statistics (2015_02)
  • Decision Theory (graduate level; 2015_02)
  • Data Mining (2015_01)
  • Introduction to Statistical Planning (2015_01)
  • Computational Statistics (2014_02)
  • Data Mining (2014_02)

Students

PhD

  • Afonso Fernandes Vaz – (current student)
  • Gilson Shimizu – (current student)
  • Marco Henrique de Almeida Inacio – (current student)

Master

  • Felipe Hernandez Bisca – (current student)
  • Deborah Bassi Stern – (current student)
  • Victor Coscrato – (current student)
  • Rafael de Carvalho Ceregatti – A bayesian nonparametric approach for the two-sample problem (2016-2019, co-advisor)
  • Afonso Fernandes Vaz – Improved quantification under domain shift (2016-2018)
  • Marco Henrique de Almeida Inacio – Comparing two populations using Bayesian Fourier series density estimation (2016-2017)
  • Gretta Rossi Ferreira – Estimação de densidades condicionais com aplicações à astronomia (2015-2017)

Undergraduate

  • Mateus Borges Comito (CNPq) (current student)
  • Víctor Candido Reis (CNPq; FAPESP) (current student)
  • Macela Musetti - Combining photometric redshift estimators (2018)
  • Daniel Simionato (CNPq) – Inferência Via Métodos Preditivos (2017-2018)
  • Andressa de Jesus Dantas – Understanding Zika patients (2017-2018)
  • João Dantas – Optimal strategies in pocker (2017-2018)
  • Victor Coscrato – Word2Vec vs Bag-of-Words (2017)
  • Rafael Catoia – Collective posterior: can the updating time change it? (2017)
  • Mauricio Najjar Da Silveira (CNPq) – Comparação não-paramétrica de grupos com base em estimação de densidades (2016-2017; co-advisor)
  • Ana Molina – Comparação entre métodos de construção de árvores filogenéticas (2016-2017)
  • Victor Coscrato (CNPq) – Testes de Hipóteses Agnósticos (2016-2017)
  • Douglas Raul de Freitas – Alguns aspectos sobre o bigdata na estatística (2016-2017)
  • Letícia Octaviano da Cruz (CNPq) – Monitoramento Online da Dengue (2015-2016)
  • Paula Ianishi – Técnicas de predição para dados desbalanceados aplicadas ao problema de classificação morfológica de galáxias (2015-2016)
  • Felipe Henrique Mosquetta Oliveira – Tratamento e Classificação de Dados do Twitter sobre Política e Clima (2015)
  • Bruno Roberto Guimarães – Classificação automática de resenhas sobre jogos na Google Play Store (2015)

Recent Posts

Recomendações para meus orientandos

Algumas recomendações para quem está começando a trabalhar comigo