Enable GPU-supported RGF calculation by adding 'rgf_device' by AsymmetryChou · Pull Request #29 · deepmodeling/dpnegf

AsymmetryChou · 2026-06-14T06:47:22Z

This pull request introduces comprehensive improvements to device management and tensor placement throughout the NEGF codebase. The main focus is to allow explicit control over which device (CPU or CUDA) is used for the recursive Green's function (RGF) step, while ensuring all relevant tensors and operations are consistently placed on the correct device. This is achieved by propagating an rgf_device parameter and updating tensor allocations, conversions, and file I/O accordingly. Additionally, the pull request clarifies device usage in documentation and argument parsing.

The most important changes are:

Device management and propagation:

Added an rgf_device parameter (defaulting to CPU) to core classes and functions, including DeviceProperty, NEGF, and argument parsing, to control where the RGF step runs. All relevant tensor allocations and computations in the NEGF pipeline now use this device. [1] [2] [3] [4] [5] [6]

Consistent tensor placement:

Ensured all tensors used in RGF and related calculations are allocated on or moved to the correct device (rgf_device), including Hamiltonian blocks, self-energies, and intermediate matrices in both batched and non-batched scenarios. [1] [2] [3] [4] [5] [6]
Updated tensor operations and file I/O to handle device transfers, including moving tensors to CPU before saving with .numpy() and ensuring device consistency during block construction and property calculations. [1] [2] [3] [4] [5] [6] [7]

Documentation and argument updates:

Clarified in comments and argument documentation that Hamiltonian initialization and self-energy calculations always run on CPU, while only the RGF step uses the specified device. Added a new rgf_device argument to the CLI and configuration. [1] [2]

These changes enable flexible GPU acceleration for the RGF step, improve code clarity, and ensure robust and error-free device handling across the codebase.

TODO:
For now, in RGF, all sub-blocks Hamiltonian and Overlap are transfered to GPU at once. For large systems, OOM would happen. More detailed optimizaiton should be implemented.

AsymmetryChou added 13 commits June 13, 2026 16:05

add runner_device in NEGF.py

f6207dd

add runner_device in density.py

ce2ad8d

add runner_device in device_property.py

406b6f7

add runner_device in negf_hamilotnian initialization

ebc24ba

add runner_device in run.py entrypoints

9e6c0bb

transfer to cpu before hamiltonian saved

5a05ad3

add rgf_device; all steps except RGF is on CPU now.

e613bce

update examples

70579ff

move necessary green's function in cuda

670009d

rename DeviceProperty.device as rgf_device

5be1769

update docstring and temp var

1cffa9b

add TODO for OOM in RGF

d538abf

update example: add GPU vs. CPU benchmark

564da00

AsymmetryChou changed the title ~~Add 'rgf_device' support~~ Enable GPU-supported RGF calculation by adding 'rgf_device' Jun 14, 2026

AsymmetryChou merged commit b6663bd into deepmodeling:main Jun 14, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable GPU-supported RGF calculation by adding 'rgf_device'#29

Enable GPU-supported RGF calculation by adding 'rgf_device'#29
AsymmetryChou merged 13 commits into
deepmodeling:mainfrom
AsymmetryChou:rgf_acc

AsymmetryChou commented Jun 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AsymmetryChou commented Jun 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

AsymmetryChou commented Jun 14, 2026 •

edited

Loading