Write MPS/MPO to disk or increase RAM?

Sergi · Post by **Sergi** » 01 Apr 2020, 10:39

I am simulating a fermionic model on a cylinder. Everything is working well, except that I need to simulate a quiet large system (20x4 unit cells, and each unit cell composed of two sites).

In order to achieve convergence I need a bond dimension of ~3000-5000, which is not achievable with the resources that I currently have (64GB of RAM divided in several cores).

Is it possible to store the MPS/MPO to disk in order to save memory? I have seen the related topic for the case of the MPO (viewtopic.php?f=7&t=55), but would it be much more work for the MPS also?

Alternatively, any hints or flavour whether incrementing the RAM memory would suffice, or it is not feasible to simulate such a 2D system without a write-to-disk functionality?

Thanks a lot!

kevln · Post by **kevln** » 04 Apr 2023, 09:23

Hi,Sergi
DO you know how to solve this problem now?Or someother ways to store the MPS/MPO to disk in order to prevent out of memory?
Thanks a lot!

Johannes · Post by **Johannes** » 24 Apr 2023, 16:46

In terms of memory usage of DMRG, the limiting factore are usually the MPO environments rather than the MPS itself:
the number of parameters for the MPS scales as \(O(L\chi^2 d)\), where L is MPS unit cell length, chi is bond dimension and d is local dimension.
In contrast, MPO environments go as \(O(L\chi^2 D)\) with MPO bond dimension \( D \propto L_y \approx 20-40\) scaling with cylinder circumference (and number of terms in H).

We fully support caching all the MPO environments to disk, using the simulation class with cache_params.
It can be as easy as adding those lines in your yaml simulation config:

Code: Select all

cache_params:
  storage_class: PickleStorage
  use_threading: True  # reduce the OMP_NUM_THREADS if you use this!
  # further specify `directory` or `tmpdir` on the cluster node's local file system
  tmpdir: /scratch/local/tmp 
cache_threshold_chi: 2000   # use cache for chi larger than that

kevln · Post by **kevln** » 31 May 2023, 13:52

Johannes wrote: ↑24 Apr 2023, 16:46 In terms of memory usage of DMRG, the limiting factore are usually the MPO environments rather than the MPS itself:
the number of parameters for the MPS scales as \(O(L\chi^2 d)\), where L is MPS unit cell length, chi is bond dimension and d is local dimension.
In contrast, MPO environments go as \(O(L\chi^2 D)\) with MPO bond dimension \( D \propto L_y \approx 20-40\) scaling with cylinder circumference (and number of terms in H).

We fully support caching all the MPO environments to disk, using the simulation class with cache_params.
It can be as easy as adding those lines in your yaml simulation config:
Code: Select all
cache_params:
  storage_class: PickleStorage
  use_threading: True  # reduce the OMP_NUM_THREADS if you use this!
  # further specify `directory` or `tmpdir` on the cluster node's local file system
  tmpdir: /scratch/local/tmp 
cache_threshold_chi: 2000   # use cache for chi larger than that

Hi,Johannes
Your suggestion above is correct and helpful. Following the instructions provided, I successfully managed to write some env to the disk.

However, based on my testing, it seems that tenpy unable to write the majority of the computations to the disk . For example, in a complex calculation I am currently attempting, it requires 120GB of RAM, but only 81GB is being written to the disk. While my disk has sufficient capacity, the RAM using is nearing its limit.This makes it challenging for me to perform more accurate computations with larger bond sizes and obtain the desired results.

Therefore, is there a way for Tenpy to write more computational processes to the disk and reduce the consumption of RAM? The parameters I previously used were as follows:

Code: Select all

   
   'cache_params':{
        'storage_class': 'PickleStorage',
        'use_threading': True,  # reduce the OMP_NUM_THREADS if you use this!
        # further specify `directory` or `tmpdir` on the cluster node's local file system
        'tmpdir': './',
        ####max_queue_size=1 
        ####l do not know this param max_queue_size, but By reading the comments, I believe this can reduce the environments written to RAM and decrease RAM usage. Is that correct?	
    },
    'cache_threshold_chi':1000,   # use cache for chi larger than that

TeNPy Forum

Write MPS/MPO to disk or increase RAM?

Write MPS/MPO to disk or increase RAM?

Re: Write MPS/MPO to disk or increase RAM?

Re: Write MPS/MPO to disk or increase RAM?

Re: Write MPS/MPO to disk or increase RAM?