The NDDataset object

The NDDataset is the main object used by SpectroChemPy.

Like numpy ndarrays, NDDatasets have the capability to be sliced, sorted and subjected to mathematical operations.

But, in addition, NDDatasets may have units, can be masked and each dimension can have coordinates also with units. This makes NDDatasets aware of units compatibility, e.g., for binary operations such as addition or subtraction or during the application of mathematical operations. In addition to or in replacement of numerical data for coordinates, NDDatasets can also have labeled coordinates where labels can be different kinds of objects (strings, datetime objects, numpy ndarrays or other NDDatasets, etc.).

This offers a lot of flexibility in using NDDatasets that, we hope, will be useful for applications. See the Examples for additional information about such possible applications.

Table of Contents

Introduction
- 1D-Dataset (unidimensional dataset)
- nD-Dataset (multidimensional dataset)
Metadata and Attributes
- About dates and times
- About the history attribute
Units
Coordinates
Methods to create NDDataset

Introduction

Below (and in the next sections), we try to give an almost complete view of the NDDataset features.

As we will make some reference to the numpy library, we also import it here.

[1]:

import numpy as np

import spectrochempy as scp

SpectroChemPy's API - v.0.8.2.dev16
©Copyright 2014-2025 - A.Travert & C.Fernandez @ LCS

We additionally import the three main SpectroChemPy objects that we will use through this tutorial

[2]:

from spectrochempy import Coord
from spectrochempy import CoordSet
from spectrochempy import NDDataset

Running on GitHub Actions
MPL Configuration directory: /home/runner/.config/matplotlib
Stylelib directory: /home/runner/.config/matplotlib/stylelib

For a convenient usage of units, we will also directly import ur, the unit registry which contains all available units.

[3]:

from spectrochempy import ur

Multidimensional arrays are defined in Spectrochempy using the NDDataset object.

NDDataset objects mostly behave like numpy’s numpy.ndarray (see for instance numpy quickstart tutorial).

However, unlike raw numpy arrays, the presence of optional properties makes them (hopefully) more appropriate for handling spectroscopic information, which is one of the major objectives of the SpectroChemPy package:

mask: Data can be partially masked at will
units: Data can have units, allowing units-aware operations
CoordSet: Data can have a set of coordinates, one or several per dimension

Additional metadata can also be added to the instances of this class through the meta properties.

1D-Dataset (unidimensional dataset)

In the following example, a minimal 1D dataset is created from a simple list, to which we can add some metadata:

[4]:

d1D = NDDataset(
    [10.0, 20.0, 30.0],
    name="Dataset N1",
    author="Blake and Mortimer",
    description="A dataset from scratch",
    history="creation",
)
d1D

[4]:

NDDataset: [float64] unitless (size: 3)[Dataset N1]

Summary

name

:

Dataset N1

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

A dataset from scratch

history

:

2025-07-13 02:00:22+00:00> Creation

Data

title

:

values

:

...

[ 10 20 30]

size

:

3

[5]:

print(d1D)

NDDataset: [float64] unitless (size: 3)

[6]:

d1D.plot(figsize=(3, 2))

[6]:

../../../_images/userguide_objects_dataset_dataset_17_1.png

Except few additional metadata such author , created …, there is not much difference with respect to a conventional numpy.array. For example, one can apply numpy ufunc‘s directly to a NDDataset or make basic arithmetic operation with these objects:

[7]:

np.sqrt(d1D)

[7]:

NDDataset: [float64] unitless (size: 3)[Dataset N1]

Summary

name

:

Dataset N1

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

A dataset from scratch

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Ufunc sqrt applied.

Data

title

:

sqrt()

values

:

...

[ 3.162 4.472 5.477]

size

:

3

[8]:

d1D += d1D / 2.0
d1D

[8]:

NDDataset: [float64] unitless (size: 3)[Dataset N1]

Summary

name

:

Dataset N1

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

A dataset from scratch

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`

Data

title

:

values

:

...

[ 15 30 45]

size

:

3

As seen above, there are some attributes that are automatically added to the dataset:

id: This is a unique identifier for the object.
name: A short and unique name for the dataset. It will be equal to the automatic id if it is not provided.
author: Author determined from the computer name if not provided.
created: Date and time of creation.
modified: Date and time of modification.

These attributes can be modified by the user, but the id, created and modified attributes are read only.

Some other attributes are defined to describe the data:

title: A long name that will be used in plots or in some other operations.
history: History of operations performed on the object since its creation.
description: A comment or a description of the object’s purpose or contents.
origin: An optional reference to the source of the data.

Here is an example of the use of the NDDataset attributes:

[9]:

d1D.title = "intensity"
d1D.name = "mydataset"
d1D.history = "created from scratch"
d1D.description = "Some experimental measurements"
d1D

[9]:

NDDataset: [float64] unitless (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

intensity

values

:

...

[ 15 30 45]

size

:

3

d1D is a 1D (1-dimensional) dataset with only one dimension.

Some attributes are useful to check this kind of information:

[10]:

d1D.shape  # the shape of 1D contain only one dimension size

[10]:

(3,)

[11]:

d1D.ndim  # the number of dimensions

[11]:

[12]:

d1D.dims  # the name of the dimension (it has been automatically attributed)

[12]:

['x']

Note: The names of the dimensions are set automatically. But they can be changed, with the limitation that the name must be a single letter.

[13]:

d1D.dims = ["q"]  # change the list of dim names.

[14]:

d1D.dims

[14]:

['q']

nD-Dataset (multidimensional dataset)

To create a nD NDDataset, we can provide a nD-array like object to the NDDataset instance constructor

[15]:

a = np.random.rand(2, 4, 6)
a

[15]:

array([[[  0.1736,   0.8456, ...,   0.4469,   0.1069],
        [  0.2934,   0.6779, ...,   0.2674,    0.366],
        [ 0.05894,   0.8487, ...,   0.6593,    0.966],
        [  0.1632,   0.5269, ...,   0.7561,   0.6546]],

       [[  0.6393,   0.4299, ...,   0.3385,    0.868],
        [  0.2947,   0.4181, ...,    0.277,   0.9911],
        [  0.1561,   0.2507, ...,   0.6906,   0.2049],
        [  0.4501,   0.8858, ...,  0.08974,  0.09001]]], shape=(2, 4, 6))

[16]:

d3D = NDDataset(a)
d3D.title = "energy"
d3D.author = "Someone"
d3D.name = "3D dataset creation"
d3D.history = "created from scratch"
d3D.description = "Some example"
d3D.dims = ["u", "v", "t"]
d3D

[16]:

NDDataset: [float64] unitless (shape: (u:2, v:4, t:6))[3D dataset creation]

Summary

name

:

3D dataset creation

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

Some example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(u:2, v:4, t:6)

We can also add all information in a single statement

[17]:

d3D = NDDataset(
    a,
    dims=["u", "v", "t"],
    title="Energy",
    author="Someone",
    name="3D_dataset",
    history="created from scratch",
    description="a single statement creation example",
)
d3D

[17]:

NDDataset: [float64] unitless (shape: (u:2, v:4, t:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(u:2, v:4, t:6)

Three names are attributed at creation (if they are not provided with the dims attribute, then the names ‘z’,’y’,’x’ are automatically attributed)

[18]:

d3D.dims

[18]:

['u', 'v', 't']

[19]:

d3D.ndim

[19]:

[20]:

d3D.shape

[20]:

(2, 4, 6)

About dates and times

The dates and times are stored internally as UTC (Coordinated Universal Time). Timezone information is stored in the timezone attribute. If not set, the default is to use the local timezone, which is probably the most common case.

[21]:

nd = NDDataset()
nd.created

[21]:

'2025-07-13 02:00:22+00:00'

In this case our local timezone has been used by default for the conversion from UTC datetime.

[22]:

nd.local_timezone

[22]:

'Etc/UTC'

[23]:

nd.timezone = "EST"
nd.created

[23]:

'2025-07-12 21:00:22-05:00'

For a list of timezone code (TZ) you can have a look at List_of_tz_database_time_zones.

About the `history` attribute

The history is saved internally as a list, but it has a different behavior than the usual list. The first time a NDDataset is created, the list is empty.

[24]:

nd = NDDataset()
nd.history

[24]:

[]

Assigning a string to the history attribute has two effects. First, the string is appended automatically to the previous history list, and second, it is preceded by the time it was added.

[25]:

nd.history = "some history"
nd.history = "another history to append"
nd.history = "..."
nd.history

[25]:

['2025-07-13 02:00:22+00:00> Some history',
 '2025-07-13 02:00:22+00:00> Another history to append',
 '2025-07-13 02:00:22+00:00> ...']

If you want to erase the history, assign an empty list

[26]:

nd.history = []
nd.history

[26]:

[]

If you want to replace the full history, use brackets around your history line:

[27]:

nd.history = "Created form scratch"
nd.history = "a second ligne that will be erased"
nd.history = ["A more interesting message"]
nd.history

[27]:

['2025-07-13 02:00:22+00:00> A more interesting message']

Units

One interesting feature of NDDataset is the ability to define units for the internal data.

[28]:

d1D.units = ur.eV  # ur is a registry containing all available units

[29]:

d1D  # note the eV symbol of the units added to the values field below

[29]:

NDDataset: [float64] eV (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

intensity

values

:

...

[ 15 30 45] eV

size

:

3

This allows to make units-aware calculations:

[30]:

d1D**2  # note the results in eV^2

[30]:

NDDataset: [float64] eV² (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch
2025-07-13 02:00:22+00:00> Binary operation pow with `2` has been performed

Data

title

:

intensity

values

:

...

[ 225 900 2025] eV²

size

:

3

[31]:

np.sqrt(d1D)  # note the result in e^0.5

[31]:

NDDataset: [float64] eV⁰⋅⁵ (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch
2025-07-13 02:00:22+00:00> Ufunc sqrt applied.

Data

title

:

sqrt(intensity)

values

:

...

[ 3.873 5.477 6.708] eV⁰⋅⁵

size

:

3

[32]:

time = 5.0 * ur.second
d1D / time  # here we get results in eV/s

[32]:

NDDataset: [float64] eV⋅s⁻¹ (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch
2025-07-13 02:00:22+00:00> Binary operation truediv with `5.0 s` has been performed

Data

title

:

intensity

values

:

...

[ 3 6 9] eV⋅s⁻¹

size

:

3

Conversion can be done between different units transparently

[33]:

d1D.to("J")

[33]:

NDDataset: [float64] J (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

intensity

values

:

...

[2.403e-18 4.807e-18 7.21e-18] J

size

:

3

[34]:

d1D.to("K")

[34]:

NDDataset: [float64] K (size: 3)[mydataset]

Summary

name

:

mydataset

author

:

Blake and Mortimer

created

:

2025-07-13 02:00:22+00:00

description

:

Some experimental measurements

history

:

2025-07-13 02:00:22+00:00> Creation
2025-07-13 02:00:22+00:00> Inplace binary op: iadd with `Dataset N1`
2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

intensity

values

:

...

[1.741e+05 3.481e+05 5.222e+05] K

size

:

3

For more examples on how to use units with NDDataset, see the gallery example

Coordinates

The above created d3D dataset has 3 dimensions, but no coordinates for these dimensions. Here arises a big difference with simple numpy arrays:

We can add coordinates to each dimension of a NDDataset.

To get the list of all defined coordinates, we can use the coords attribute:

[35]:

d3D.coordset  # no coordinates, so it returns nothing (None)

[36]:

d3D.t  # the same for coordinate  t, v, u which are not yet set

To add coordinates, one way is to set them one by one:

[37]:

d3D.t = (
    Coord.arange(6) * 0.1
)  # we need a sequence of 6 values for `t` dimension (see shape above)
d3D.t.title = "time"
d3D.t.units = ur.seconds
d3D.coordset  # now return a list of coordinates

[37]:

CoordSet: [t:time, u:, v:][CoordSet_22b89609]

Dimension `t`

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

[38]:

d3D.t

[38]:

Coord: [float64] s (size: 6)[t]

Summary

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

[39]:

d3D.coordset("t")  # Alternative way to get a given coordinates

[39]:

Coord: [float64] s (size: 6)[t]

Summary

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

[40]:

d3D["t"]  # another alternative way to get a given coordinates

[40]:

Coord: [float64] s (size: 6)[t]

Summary

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

The two other coordinates u and v are still undefined

[41]:

d3D.u, d3D.v

[41]:

(Coord: empty, Coord: empty)

When the dataset is printed, only the information for the existing coordinates is given.

[42]:

d3D

[42]:

NDDataset: [float64] unitless (shape: (u:2, v:4, t:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(u:2, v:4, t:6)

Dimension `t`

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

Programmatically, we can use the attribute is_empty or has_data to check this

[43]:

d3D.v.has_data, d3D.v.is_empty

[43]:

(False, True)

An error is raised when a coordinate doesn’t exist

[44]:

try:
    d3D.x
except Exception as e:
    scp.error_(Exception, e)

 ERROR | Exception:

In some case it can also be useful to get a coordinate from its title instead of its name (the limitation is that if several coordinates have the same title, then only the first ones that is found in the coordinate list, will be returned - this can be ambiguous)

[45]:

d3D["time"]

[45]:

Coord: [float64] s (size: 6)[t]

Summary

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

[46]:

d3D.time

[46]:

Coord: [float64] s (size: 6)[t]

Summary

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

Labels

It is possible to use labels instead of numerical coordinates. Labels are sequences of objects. The length of the sequence must be equal to the size of the dimension.

[47]:

tags = list("ab")
d3D.u.title = "some tags"
d3D.u.labels = tags
d3D

[47]:

NDDataset: [float64] unitless (shape: (u:2, v:4, t:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(u:2, v:4, t:6)

Dimension `t`

size

:

6

title

:

time

coordinates

:

[ 0 0.1 0.2 0.3 0.4 0.5] s

Dimension `u`

size

:

2

title

:

some tags

labels

:

[ a b]

or more complex objects.

For instance here we use datetime.timedelta objects:

[48]:

from datetime import timedelta

start = timedelta(0)
times = [start + timedelta(seconds=x * 60) for x in range(6)]
d3D.t = None
d3D.t.labels = times
d3D.t.title = "time"
d3D

[48]:

NDDataset: [float64] unitless (shape: (u:2, v:4, t:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(u:2, v:4, t:6)

Dimension `t`

size

:

6

title

:

time

labels

:

[ 0:00:00 0:01:00 0:02:00 0:03:00 0:04:00 0:05:00]

Dimension `u`

size

:

2

title

:

some tags

labels

:

[ a b]

In this case, getting a coordinate that doesn’t possess numerical data but labels, will return the labels

[49]:

d3D.time

[49]:

Coord: [labels] [ 0:00:00 0:01:00 0:02:00 0:03:00 0:04:00 0:05:00] (size: 6)[t]

Summary

size

:

6

title

:

time

labels

:

[ 0:00:00 0:01:00 0:02:00 0:03:00 0:04:00 0:05:00]

More insight on coordinates

Setting coordinates using `set_coordset`

Let’s create 3 Coord objects to be used as coordinates for the 3 dimensions of the previous d3D dataset.

[52]:

d3D.dims = ["t", "v", "u"]
s0, s1, s2 = d3D.shape
coord0 = Coord.linspace(10.0, 100.0, s0, units="m", title="distance")
coord1 = Coord.linspace(20.0, 25.0, s1, units="K", title="temperature")
coord2 = Coord.linspace(0.0, 1000.0, s2, units="hour", title="elapsed time")

Syntax 1

[53]:

d3D.set_coordset(u=coord2, v=coord1, t=coord0)
d3D

[53]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

Syntax 2

[54]:

d3D.set_coordset({"u": coord2, "v": coord1, "t": coord0})
d3D

[54]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

Adding several coordinates to a single dimension

We can add several coordinates to the same dimension

[55]:

coord1b = Coord([1, 2, 3, 4], units="millitesla", title="magnetic field")

[56]:

d3D.set_coordset(u=coord2, v=[coord1, coord1b], t=coord0)
d3D

[56]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

We can retrieve the various coordinates for a single dimension easily:

[57]:

d3D.v_1

[57]:

Coord: [float64] mT (size: 4)[_1]

Summary

size

:

4

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

Math operations on coordinates

Arithmetic operations can be performed on single coordinates:

[58]:

d3D.u = d3D.u * 2
d3D.u

[58]:

Coord: [float64] h (size: 6)[u]

Summary

size

:

6

title

:

elapsed time

coordinates

:

[ 0 400 800 1200 1600 2000] h

The ufunc numpy functions can also be applied, and will affect both the magnitude and the units of the coordinates:

[59]:

d3D.u = 1.5 + np.sqrt(d3D.u)
d3D.u

[59]:

Coord: [float64] h⁰⋅⁵ (size: 6)[u]

Summary

size

:

6

title

:

sqrt(elapsed time)

coordinates

:

[ 1.5 21.5 29.78 36.14 41.5 46.22] h⁰⋅⁵

A particularly frequent use case is to subtract the initial value from a coordinate. This can be done directly with the - operator:

[60]:

d3D.u = d3D.u - d3D.u[0]
d3D.u

[60]:

Coord: [float64] h⁰⋅⁵ (size: 6)[u]

Summary

size

:

6

title

:

sqrt(elapsed time)

coordinates

:

[ 0 20 28.28 34.64 40 44.72] h⁰⋅⁵

The operations above will generally not work on multiple coordinates, and will raise an error if attempted:

[61]:

try:
    d3D.v = d3D.v - 1.5
except NotImplementedError as e:
    scp.error_(NotImplementedError, e)

 ERROR | NotImplementedError: Subtraction f a CoordSet with an object of type <class 'float'> is not implemented yet

Only subtraction between multiple coordinates is allowed, and will return a new CoordSet where each coordinate has been subtracted:

[62]:

d3D.v = d3D.v - d3D.v[0]
d3D.v

[62]:

CoordSet: [_1:magnetic field, _2:temperature][v]

Dimension `_1`

size

:

4

title

:

magnetic field

coordinates

:

[ 0 1 2 3] mT

Dimension `_2`

size

:

4

title

:

temperature

coordinates

:

[ 0 1.667 3.333 5] K

It is always possible to carry out operations on a given coordinate of a CoordSet. This must be done by accessing the coordinate by its name, e.g. 'temperature' or '_2' for the second coordinate of the v dimension:

[63]:

d3D.v["_2"] = d3D.v["_2"] + 5.0
d3D.v

[63]:

CoordSet: [_1:magnetic field, _2:temperature][v]

Dimension `_1`

size

:

4

title

:

magnetic field

coordinates

:

[ 0 1 2 3] mT

Dimension `_2`

size

:

4

title

:

temperature

coordinates

:

[ 5 6.667 8.333 10] K

Summary of the coordinate setting syntax

Some additional information about coordinate setting syntax

A. First syntax (probably the safer because the name of the dimension is specified, so this is less prone to errors!)

[64]:

d3D.set_coordset(u=coord2, v=[coord1, coord1b], t=coord0)
# or equivalent
d3D.set_coordset(u=coord2, v=CoordSet(coord1, coord1b), t=coord0)
d3D

[64]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

B. Second syntax assuming the coordinates are given in the order of the dimensions.

Remember that we can check this order using the dims attribute of a NDDataset

[65]:

d3D.dims

[65]:

['t', 'v', 'u']

[66]:

d3D.set_coordset((coord0, [coord1, coord1b], coord2))
# or equivalent
d3D.set_coordset(coord0, CoordSet(coord1, coord1b), coord2)
d3D

[66]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

C. Third syntax (from a dictionary)

[67]:

d3D.set_coordset({"t": coord0, "u": coord2, "v": [coord1, coord1b]})
d3D

[67]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

D. It is also possible to use directly the CoordSet property

[68]:

d3D.coordset = coord0, [coord1, coord1b], coord2
d3D

[68]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

(_2)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

[69]:

d3D.coordset = {"t": coord0, "u": coord2, "v": [coord1, coord1b]}
d3D

[69]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

[70]:

d3D.coordset = CoordSet(t=coord0, u=coord2, v=[coord1, coord1b])
d3D

[70]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

WARNING

Do not use lists for setting multiple coordinates across different dimensions! Use tuples instead.

Lists have a special meaning in SpectroChemPy - they’re used to set multiple coordinates for the same dimension.

This raises an error (lists have another meaning: they’re used to set multiple coordinates for the same dimension, as shown in examples A and B above):

[71]:

try:
    d3D.coordset = [coord0, coord1, coord2]
except ValueError:
    scp.error_(
        ValueError,
        "Coordinates must be of the same size for a dimension with multiple coordinates",
    )

 ERROR | ValueError: Coordinates must be of the same size for a dimension with multiple coordinates

This works: it uses a tuple (), not a list []

[72]:

d3D.coordset = (
    coord0,
    coord1,
    coord2,
)  # equivalent to d3D.coordset = coord0, coord1, coord2
d3D

[72]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

E. Setting the coordinates individually

Either a single coordinate

[73]:

d3D.u = coord2
d3D

[73]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

or multiple coordinates for a single dimension

[74]:

d3D.v = [coord1, coord1b]
d3D

[74]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

(_2)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

or using a CoordSet object.

[75]:

d3D.v = CoordSet(coord1, coord1b)
d3D

[75]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

Someone

created

:

2025-07-13 02:00:22+00:00

description

:

a single statement creation example

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

Methods to create NDDataset

There are many ways to create NDDataset objects.

Let’s first create 2 coordinate objects, for which we can define labels and units. Note the use of the function linspace to generate the data.

[76]:

c0 = Coord.linspace(
    start=4000.0, stop=1000.0, num=5, labels=None, units="cm^-1", title="wavenumber"
)

[77]:

c1 = Coord.linspace(
    10.0, 40.0, 3, labels=["Cold", "RT", "Hot"], units="K", title="temperature"
)

The full coordset will be the following

[78]:

cs = CoordSet(c0, c1)
cs

[78]:

CoordSet: [x:temperature, y:wavenumber][CoordSet_231a8c12]

Dimension `x`

size

:

3

title

:

temperature

coordinates

:

[ 10 25 40] K

labels

:

[ Cold RT Hot]

Dimension `y`

size

:

5

title

:

wavenumber

coordinates

:

[ 4000 3250 2500 1750 1000] cm⁻¹

Now we will generate the full dataset using the fromfunction method. All needed information is passed as parameters to the NDDataset constructor.

Create a dataset from a function

[79]:

def func(x, y, extra):
    return x * y / extra

[80]:

ds = NDDataset.fromfunction(
    func,
    extra=100 * ur.cm**-1,  # extra arguments passed to the function
    coordset=cs,
    name="mydataset",
    title="absorbance",
    units=None,
)  # when None, units will be determined from the function results

ds.description = """Dataset example created for this tutorial.
It's a 2-D dataset"""

ds.author = "Blake & Mortimer"
ds

[80]:

NDDataset: [float64] K (shape: (y:5, x:3))[mydataset]

Summary

name

:

mydataset

author

:

Blake & Mortimer

created

:

2025-07-13 02:00:23+00:00

description

:

Dataset example created for this tutorial.
It's a 2-D dataset

history

:

2025-07-13 02:00:23+00:00> Created using method : fromfunction

Data

title

:

absorbance

values

:

...

[[ 400 1000 1600]
[ 325 812.5 1300]
...
[ 175 437.5 700]
[ 100 250 400]] K

shape

:

(y:5, x:3)

Dimension `x`

size

:

3

title

:

temperature

coordinates

:

[ 10 25 40] K

labels

:

[ Cold RT Hot]

Dimension `y`

size

:

5

title

:

wavenumber

coordinates

:

[ 4000 3250 2500 1750 1000] cm⁻¹

Using numpy-like constructors of NDDatasets

[81]:

dz = NDDataset.zeros(
    (5, 3), coordset=cs, units="meters", title="Datasets with only zeros"
)

[82]:

do = NDDataset.ones(
    (5, 3), coordset=cs, units="kilograms", title="Datasets with only ones"
)

[83]:

df = NDDataset.full(
    (5, 3), fill_value=1.25, coordset=cs, units="radians", title="with only float=1.25"
)
df

[83]:

NDDataset: [float64] rad (shape: (y:5, x:3))[NDDataset_231a8c26]

Summary

name

:

NDDataset_231a8c26

author

:

runner@pkrvmq0rgcvqdmg

created

:

2025-07-13 02:00:23+00:00

history

:

2025-07-13 02:00:23+00:00> Created using method : full

Data

title

:

with only float=1.25

values

:

...

[[ 1.25 1.25 1.25]
[ 1.25 1.25 1.25]
...
[ 1.25 1.25 1.25]
[ 1.25 1.25 1.25]] rad

shape

:

(y:5, x:3)

Dimension `x`

size

:

3

title

:

temperature

coordinates

:

[ 10 25 40] K

labels

:

[ Cold RT Hot]

Dimension `y`

size

:

5

title

:

wavenumber

coordinates

:

[ 4000 3250 2500 1750 1000] cm⁻¹

As with numpy, it is also possible to take another dataset as a template:

[84]:

df = NDDataset.full_like(d3D, dtype="int", fill_value=2)
df

[84]:

NDDataset: [float64] unitless (shape: (t:2, v:4, u:6))[3D_dataset]

Summary

name

:

3D_dataset

author

:

runner@pkrvmq0rgcvqdmg

created

:

2025-07-13 02:00:23+00:00

history

:

2025-07-13 02:00:22+00:00> Created from scratch
2025-07-13 02:00:23+00:00> Created using method : full_like

Data

title

:

Energy

values

:

...

[[[ 2 2 ... 2 2]
[ 2 2 ... 2 2]
[ 2 2 ... 2 2]
[ 2 2 ... 2 2]]

[[ 2 2 ... 2 2]
[ 2 2 ... 2 2]
[ 2 2 ... 2 2]
[ 2 2 ... 2 2]]]

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

[85]:

nd = NDDataset.diag((3, 3, 2.5))
nd

[85]:

NDDataset: [float64] unitless (shape: (y:3, x:3))[NDDataset_231a8c4a]

Summary

name

:

NDDataset_231a8c4a

author

:

runner@pkrvmq0rgcvqdmg

created

:

2025-07-13 02:00:23+00:00

history

:

2025-07-13 02:00:23+00:00> Created using method : diag

Data

title

:

values

:

...

[[ 3 0 0]
[ 0 3 0]
[ 0 0 2.5]]

shape

:

(y:3, x:3)

Copying existing NDDataset

To copy an existing dataset, this is as simple as:

[86]:

d3D_copy = d3D.copy()

or alternatively:

[87]:

d3D_copy = d3D[:]

Finally, it is also possible to initialize a dataset using an existing one:

[88]:

d3Dduplicate = NDDataset(d3D, name=f"duplicate of {d3D.name}", units="absorbance")
d3Dduplicate

[88]:

NDDataset: [float64] a.u. (shape: (t:2, v:4, u:6))[duplicate of 3D_dataset]

Summary

name

:

duplicate of 3D_dataset

author

:

runner@pkrvmq0rgcvqdmg

created

:

2025-07-13 02:00:23+00:00

history

:

2025-07-13 02:00:22+00:00> Created from scratch

Data

title

:

Energy

values

:

...

[[[ 0.1736 0.8456 ... 0.4469 0.1069]
[ 0.2934 0.6779 ... 0.2674 0.366]
[ 0.05894 0.8487 ... 0.6593 0.966]
[ 0.1632 0.5269 ... 0.7561 0.6546]]

[[ 0.6393 0.4299 ... 0.3385 0.868]
[ 0.2947 0.4181 ... 0.277 0.9911]
[ 0.1561 0.2507 ... 0.6906 0.2049]
[ 0.4501 0.8858 ... 0.08974 0.09001]]] a.u.

shape

:

(t:2, v:4, u:6)

Dimension `t`

size

:

2

title

:

distance

coordinates

:

[ 10 100] m

Dimension `u`

size

:

6

title

:

elapsed time

coordinates

:

[ 0 200 400 600 800 1000] h

Dimension `v`

size

:

4

(_1)

title

:

magnetic field

coordinates

:

[ 1 2 3 4] mT

(_2)

title

:

temperature

coordinates

:

[ 20 21.67 23.33 25] K

Importing from external datasets

NDDatasets can be created from the importation of external data.

A test data folder contains some sample data for experimenting with features of datasets.

[89]:

# let check if this directory exists and display its actual content:
datadir = scp.preferences.datadir
if datadir.exists():
    print(datadir.name)

testdata

Let’s load grouped IR spectra acquired using OMNIC:

[90]:

nd = scp.read_omnic(datadir / "irdata/nh4y-activation.spg")
scp.preferences.reset()
nd.plot()

[90]:

../../../_images/userguide_objects_dataset_dataset_175_1.png

Even if we do not specify the datadir, the application first looks in the default directory.

Now, lets load a NMR dataset (in the Bruker format).

[91]:

path = datadir / "nmrdata" / "bruker" / "tests" / "nmr" / "topspin_2d"

# load the data directly (no need to create the dataset first)
nd2 = scp.read_topspin(path, expno=1, remove_digital_filter=True)

# view it...
nd2.x.to("s")
nd2.y.to("ms")

ax = nd2.plot(method="map")

 WARNING | (UserWarning) (196608,)cannot be shaped into(147, 1024)

../../../_images/userguide_objects_dataset_dataset_178_1.png

The NDDataset object

Table of Contents

Introduction

1D-Dataset (unidimensional dataset)

nD-Dataset (multidimensional dataset)

About dates and times

About the history attribute

Units

Coordinates

Labels

More insight on coordinates

Sharing coordinates between dimensions

Setting coordinates using set_coordset

Syntax 1

Syntax 2

Adding several coordinates to a single dimension

Math operations on coordinates

Summary of the coordinate setting syntax

Methods to create NDDataset

Create a dataset from a function

Using numpy-like constructors of NDDatasets

Copying existing NDDataset

Importing from external datasets

About the `history` attribute

Setting coordinates using `set_coordset`