Statistics@Naples

28-30 giugno 2023

Department of Political Sciences

Europe/Rome timezone

Contact

maria.iannario@unina.it

Subdata selection for big data regression based on leverage scores

28 giu 2023, 15:00

20m

Aula Spinelli (Department of Political Sciences)

Aula Spinelli

Department of Political Sciences

Leopoldo Rodinò road, 22/a - 80138 - Napoli, Italy

First Session

Vasilis Chasiotis (Athens University of Economics and Business, Department of Statistics)

Data continues to become more abundant, and so the datasets that contain it. Even though big datasets can present insights and opportunities, they can pose significant challenges when it comes to statistical analysis. One of the biggest challenges, required to process and analyze large datasets, is the computational resources. Regression can be problematic in case of big datasets, due to the huge volumes of data. A standard approach is subsampling that aims at obtaining the most informative portion of the big data. We consider an approach based on leverages scores, already existing in the current literature for the selection of subdata for linear model discrimination. However, we highlight its importance on the selection of data points that are the most informative for estimating unknown parameters. We conclude that the approach based on leverage scores improves existing approaches, providing simulation experiments as well as a real data application.

Vasilis Chasiotis (Athens University of Economics and Business, Department of Statistics) Prof. Dimitris Karlis (Athens University of Economics and Business, Department of Statistics)

There are no materials yet.

Statistics@Naples

Contact

Subdata selection for big data regression based on leverage scores

Aula Spinelli

Department of Political Sciences

Speaker

Description

Primary authors

Presentation Materials

Your browser is out of date!