Aiedail's comments

Aiedail · on Feb 22, 2024

This is commendable, but there's room for improvement. Up until now, SOTA-level "open-source" LLM models (LLaMA, Mistral, etc.) have usually only made their inference code and model architecture public. While these elements are not insignificant, they are somewhat trivial when compared to the training code and training datasets, as these two factors largely determine the performance of the model. This is not open at all. It goes without saying that sharing the training datasets and process with other AI researchers is crucial. This transparency would not only help to improve the model(for others could contribute to it) but also contribute to the whole community, as they usually advertised. Otherwise, it will be difficult for these efforts to truly promote the development of LLM.

Aiedail · on April 12, 2023

Is there a similar tool for python? I don't like the DAP solution but prefer a gdb like python debugger :(

Aiedail · on Dec 12, 2022

Have you ever tried a bright environment and dark mode monitor? I'm working with this one about 3 years and it makes me both relax and comfortable.

wincy · on Dec 12, 2022

Maybe it’s just that my monitors are too dark, but when I try this everything just looks too washed out and I can’t really tell what I’m looking at. Switching to light mode was more a side effect of the increased lighting than anything.

Aiedail · on Aug 30, 2022

I'm curious about what makes this project special, see there's a lot of similar implementations of diffusion models based on pytorch/tf. Is it because it use the cpu itself to produce the diffusion process?

nperez · on Aug 30, 2022

Yeah. For something like this, you ideally would want a powerful GPU with 12-24gb VRAM. If you have something like an RTX 2070 at the bare minimum, you probably don't need this and could do a lot more steps a lot faster on a GPU, but it's great for those who don't have that option.

Scaevolus · on Aug 30, 2022

A $500 RTX 3070 with 8GB of VRAM can generate 512x512 images with 50 steps in 7 seconds.

ShamelessC · on Aug 30, 2022

The RTX 2070 also shipped with 8GB of VRAM, just fyi.

nperez · on Aug 30, 2022

Yep, 8GB works fine. The 2070 is where I started. I wouldn't consider it ideal, though. There will be cases where you'll wish you could increase the resolution a little more, or could do just a few more per batch, but you're getting CUDA out-of-memory errors