跳转至

谭邵杰的计算机奇妙之旅

Artificial Intelligence

Artificial Intelligence¶

2023年12月18日
分类于 Artificial Intelligence
需要 4 分钟阅读时间

Deploy Stable Diffusion to A100

导言

图片推理多采用各种GUI(ComfyUI, Stable Diffusion WebUI) ²
训练基于 kohya-trainer 和 GUI，带标签的二次元图片数据可以从 danbooru 爬取。
模型和方法实现，如LyCORIS框架？从civitai免费下载

2023年12月18日
分类于 Artificial Intelligence
需要 1 分钟阅读时间

CV Model

导言

和AIGC 生图相关

2023年12月18日
分类于 Artificial Intelligence
需要 1 分钟阅读时间

Inference Basic

导言

RL 涉及到推理，推理的流程细节不是很明晰。

warmup，计算kvcache
chunked prefill，降低prefill的显存

2023年12月18日
分类于 Artificial Intelligence
需要 1 分钟阅读时间

Inference Optimization

导言

训练由于要计算并更新梯度，一般是计算密集。但是推理一般是访存密集。

2023年12月18日
分类于 Artificial Intelligence
需要 3 分钟阅读时间

AI Training Optimization

导言

训练由于要计算并更新梯度，一般是计算密集。但是推理一般是访存密集。

2023年12月18日
分类于 Artificial Intelligence
需要 2 分钟阅读时间

[LLM]: DeekSeekV3

导言

本来在多模态组，结果被拉去优化TX的dspv3部署，还是要熟悉相关概念逻辑。

2023年12月18日
分类于 Artificial Intelligence
需要 3 分钟阅读时间

LLM Model

导言

Foudation Models(One4All): General pre-training model

LLM path ，generative-ai-for-beginners

排行榜:

2023年12月18日
分类于 Artificial Intelligence
需要 2 分钟阅读时间

LLM Model Basic

导言

LLM Prefill、decode、kvcache等概念

2023年12月18日
分类于 Artificial Intelligence
需要 8 分钟阅读时间

Classical AI Models

导言

机器学习和人工智能模型算法，从一开始模仿神经元设计，到现在根据任务定制或者基于naive的思想构建(例如对抗思想、感受野、注意力机制)。模型的设计可以说是日新月异，截然不同。但是从高性能计算的角度来看，还是离不开求导操作、矩阵操作、激活函数计算这几点。剩下值得考虑的就是寻找现有或者未来模型构成计算操作的最大公约数，来对其进行特殊软硬件设计加速。或者只是对现有模型的适配加速工作。

2023年12月17日
分类于 Artificial Intelligence
需要 1 分钟阅读时间

Deploy OpenLLM to one A100

导言

Practice is the best teacher in learning.