fairseq2.device¶

This module provides abstractions for managing PyTorch devices and handling CUDA contexts.

Interfaces¶

class fairseq2.device.CudaContext[source]¶

Bases: ABC

Represents an interface for interacting with CUDA runtime and device information.

abstract is_available() → bool[source]¶

abstract device_count() → int[source]¶

abstract get_device_properties(device: device) → Any[source]¶

abstract memory_stats(device: device) → dict[str, Any][source]¶

abstract reset_peak_memory_stats() → None[source]¶

class fairseq2.device.SupportsDeviceTransfer[source]¶

Bases: ABC

Represents an object that can be transferred between devices.

abstract to(device: device, *, non_blocking: bool = False) → None[source]¶

Transfers this object to the specified device.

If non_blocking is True, the transfer will be performed asynchronously if possible.

Classes¶

final class fairseq2.device.StandardCudaContext[source]¶

Bases: CudaContext

Represents the standard implementation of CudaContext.

is_available() → bool[source]¶

device_count() → int[source]¶

get_device_properties(device: device) → Any[source]¶

memory_stats(device: device) → dict[str, Any][source]¶

reset_peak_memory_stats() → None[source]¶

Functions¶

fairseq2.device.get_default_device() → device[source]¶

Returns the default device of this process.

The default device is determined by the following precedence:

If FAIRSEQ2_DEVICE environment variable is set, the specified device will be used.
If CUDA is enabled and CUDA_VISIBLE_DEVICES environment variable contains a single device, the specified device will be used.
If CUDA is enabled and LOCAL_RANK environment variable is set, the CUDA device at the specified index will be used.
CPU will be used.

Raises:: LocalRankOutOfRangeError – If LOCAL_RANK environment variable is less than zero or exceeds the number of available devices.

fairseq2.device.get_current_device() → device[source]¶

Returns the current device of the calling thread.

The current device of a thread can be changed by using a device as a context manager. See here for more information.

Note

PyTorch does not currently expose a public API to retrieve the current device of a thread. If such an API becomes available in the future, this function will act as an alias for it.

Warning

This function might impose a slight performance cost. Avoid calling it in hot code paths.

import torch

from fairseq2.device import get_current_device

# Default device used by PyTorch. Typically CPU.
default_device = torch.get_default_device()

assert get_current_device() == default_device

device = torch.device("cuda:0")

# Instruct PyTorch to use the specified device instead of the default
# device for tensor factory operations.
with device:
    assert get_current_device() == device

Exceptions¶

class fairseq2.device.LocalRankOutOfRangeError(local_rank: int, num_devices: int, device_type: str)[source]¶

Bases: Exception

Raised by get_default_device() when LOCAL_RANK environment variable is less than zero or exceeds the number of available devices.