Sample Datasets

PyDaddy comes pre-packaged with several sample datasets. These can be loaded easily using the load_sample_dataset() function.

pydaddy.load_sample_dataset(<dataset-name>)

The following sample datasets are available:

  • fish-data-etroplus: Group polarization data from a fish schooling experiment [1].

  • model-data-scalar-pairwise and model-data-scalar-ternary: Scalar (1-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively [1] [2].

  • model-data-vector-pairwise and model-data-vector-ternary: Vector (2-D) simulated datasets generated from a stochastic Gillespie simulation, with pairwise and ternary interaction models respectively [1] [2].

The fish schooling dataset contains the time series of the group polarization vector $mathbf m$ (2-dimensional), for a group of 15 fish (emph{Etroplus suratensis}). The polarization time series is available at a uniform interval of 0.12 second. The dataset contains many missing data points [1].

The simulated datasets were generated using a continuous-time stochastic simulation algorithm, with pairwise and ternary interaction models respectively. Each simulated time series was resampled to a suitable uniform sampling interval [1] [2]. Simulated datasets are provided for both 1-D and 2-D.