pytorch GC #592

x66ccff · 2025-02-04T11:49:39Z

I'm using torch in PythonCall. When I try to create tensors multiple times, even after reassigning the same tensor or setting it to nothing, I don't observe any decrease in GPU memory usage. This persists even after using GC.

using PythonCall
torch = pyimport("torch")
torch.cuda.is_available()
n=20000
 
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM increase here
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM also increase here
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM also increase here
a = nothing # useless
 
PythonCall.GC.gc() # useless
torch.cuda.empty_cache() # useless

Anyone can help?

julia Version 1.11.3
julia> torch.version
Python: '2.6.0+cu126'

x66ccff · 2025-02-04T12:26:38Z

@cjdoris @MilesCranmer Any thoughts on this issue? Thanks!

x66ccff · 2025-02-04T12:44:52Z

alright, PythonCall.pydel!(a) can solve this

using PythonCall
torch = pyimport("torch")
torch.cuda.is_available()
n=20000
 
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM increase here
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM also increase here
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM also increase here
a = nothing # useless
 
PythonCall.GC.gc() # useless
torch.cuda.empty_cache() # useless

PythonCall.pydel!(a)

PythonCall.GC.gc()
torch.cuda.empty_cache() # Released!

x66ccff · 2025-02-22T04:54:37Z

x66ccff/SymbolicRegressionGPU.jl#22

still problem here

x66ccff · 2025-02-22T04:56:01Z

When using PythonCall and Pytorch together, if tensors are created in Julia code (including any temporary tensors) without keeping a "handle" for Julia to release them through pydel!(), it leads to an inability to release these tensors through gc or torch[].cuda.empty_cache. I wrote a specific example to demonstrate:

using PythonCall
torch = pyimport("torch")
torch.cuda.is_available()
n=20000
 
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM increase here

f(x) = begin
1 + 1
3 * 1
x + x             # ✅ can be released
end             

g = f(a)

PythonCall.pydel!(a)
PythonCall.pydel!(g)

println(torch.cuda.memory_summary())

using PythonCall
torch = pyimport("torch")
torch.cuda.is_available()
n=20000
 
a = torch.randn((1,n*n),device=torch.device("cuda"))  # VRAM increase here

f(x) = begin
x + 1         # ❌ can not release this anymore
x * 1         # ❌ can not release this anymore
x + x         # ✅ can be released
end              

g = f(a)

PythonCall.pydel!(a)
PythonCall.pydel!(g)

println(torch.cuda.memory_summary())

cjdoris · 2025-02-22T09:21:04Z

After f(x) finishes, the result of x+1 is unreachable so Julia will finalize it at some point in the future, which will delete the Python object and so free the memory backing the tensor. However Julia provides no guarantees about when it will GC (which runs finalizers) - and in general waits until there is too much memory pressure on your system. Explicitly calling GC.gc() at some point after f(x) should free that memory.

x66ccff · 2025-02-22T10:44:09Z

wow! Thanks! That works! 😂 I forgot to try GC.gc() I kept trying using PythonCall.pydel!, PythonCall.GC.gc(), and torch.cuda.empty_cache().

x66ccff · 2025-02-22T11:58:08Z

@cjdoris However, frequently calling GC.gc() makes it too slow. Do you have any methods that specifically focus on garbage collection for PyTorch tensors?

cjdoris · 2025-02-23T08:39:32Z

No, unless you pydel! every intermediate tensor.

x66ccff closed this as completed Feb 4, 2025

x66ccff reopened this Feb 22, 2025

x66ccff closed this as completed Feb 22, 2025

x66ccff mentioned this issue Feb 22, 2025

PythonCall & PyTorch Mem Leak? x66ccff/SymbolicRegressionGPU.jl#22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch GC #592

pytorch GC #592

x66ccff commented Feb 4, 2025 •

edited

Loading

x66ccff commented Feb 4, 2025

x66ccff commented Feb 4, 2025 •

edited

Loading

x66ccff commented Feb 22, 2025

x66ccff commented Feb 22, 2025

cjdoris commented Feb 22, 2025

x66ccff commented Feb 22, 2025

x66ccff commented Feb 22, 2025

cjdoris commented Feb 23, 2025

pytorch GC #592

pytorch GC #592

Comments

x66ccff commented Feb 4, 2025 • edited Loading

x66ccff commented Feb 4, 2025

x66ccff commented Feb 4, 2025 • edited Loading

x66ccff commented Feb 22, 2025

x66ccff commented Feb 22, 2025

cjdoris commented Feb 22, 2025

x66ccff commented Feb 22, 2025

x66ccff commented Feb 22, 2025

cjdoris commented Feb 23, 2025

x66ccff commented Feb 4, 2025 •

edited

Loading

x66ccff commented Feb 4, 2025 •

edited

Loading