Sample Datasets

PyDaddy comes pre-packaged with several sample datasets. These can be loaded easily using the load_sample_dataset() function.

pydaddy.load_sample_dataset(<dataset-name>)

The following sample datasets are available:

  • fish-data-etroplus: Group polarization data from a fish schooling experiment 1.

  • model-data-scalar-pairwise and model-data-scalar-ternary: Scalar (1-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively 1 2.

  • model-data-vector-pairwise and model-data-vector-ternary: Vector (2-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively 1 2.

The fish schooling dataset contains the time series of the group polarization vector $mathbf m$ (2-dimensional), for a group of 15 fish (emph{Etroplus suratensis}). The polarization time series is available at a uniform interval of 0.12 second. The dataset contains many missing data points 1.

The simulated datasets were generated using a continuous-time stochastic simulation algorithm, with pairwise and ternary interaction models respectively. Each simulated time series was resampled to a suitable uniform sampling interval 1 2. Simulated datasets are provided for both 1-D and 2-D.

1(1,2,3,4,5)

Jhawar, J., Morris, R. G., Amith-Kumar, U. R., Danny Raj, M., Rogers, T., Rajendran, H., & Guttal, V. (2020). Noise-induced schooling of fish. Nature Physics, 16(4), 488-493.

2(1,2,3)

Jhawar, J., & Guttal, V. (2020). Noise-induced effects in collective dynamics and inferring local interactions from data. Philosophical Transactions of the Royal Society B, 375(1807), 20190381.