# Sample Datasets

PyDaddy comes pre-packaged with several sample datasets. These can be loaded easily using the `load_sample_dataset()`

function.

```
pydaddy.load_sample_dataset(<dataset-name>)
```

The following sample datasets are available:

`fish-data-etroplus`

: Group polarization data from a fish schooling experiment 1.

`model-data-scalar-pairwise`

and`model-data-scalar-ternary`

: Scalar (1-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively 1 2.

`model-data-vector-pairwise`

and`model-data-vector-ternary`

: Vector (2-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively 1 2.

The fish schooling dataset contains the time series of the group polarization vector $mathbf m$ (2-dimensional), for a group of 15 fish (emph{Etroplus suratensis}). The polarization time series is available at a uniform interval of 0.12 second. The dataset contains many missing data points 1.

The simulated datasets were generated using a continuous-time stochastic simulation algorithm, with pairwise and ternary interaction models respectively. Each simulated time series was resampled to a suitable uniform sampling interval 1 2. Simulated datasets are provided for both 1-D and 2-D.

- 1(1,2,3,4,5)
Jhawar, J., Morris, R. G., Amith-Kumar, U. R., Danny Raj, M., Rogers, T., Rajendran, H., & Guttal, V. (2020). Noise-induced schooling of fish. Nature Physics, 16(4), 488-493.

- 2(1,2,3)
Jhawar, J., & Guttal, V. (2020). Noise-induced effects in collective dynamics and inferring local interactions from data. Philosophical Transactions of the Royal Society B, 375(1807), 20190381.