Measure Zero


  • About

  • Quotes

  • Notes

  • Sitemap

  • Search

Python 并发简要

2022-10-23 | ~ | Tech

并发 (concurrency) 和并行 (parallelism) 是两个概念. 这两个术语的使用还有争议, 下面依照书上 (Fluent Python, 2nd Edition) 的说法, 并行是并发的子集.

A modern laptop with 4 CPU cores is routinely running more than 200 processes at any given time under normal, casual use. To execute 200 tasks in parallel, you’d need 200 cores. So, in practice, most computing is concurrent and not parallel.

Read more »

深度学习性能瓶颈往往是内存带宽

2022-10-16 | ~ | Machine Learning

首先是这篇文章

  • Horace He. (2022). Making Deep Learning Go Brrrr From First Principles

内容不再赘述, 下面是纲要和一些对原文的注解. 虽然文章用 Torch 和 GPU 举例, 但基本原理依然普适.

注 (Show more »)

机器之心 编译 过这篇文章, 但有些错误. 比如只看第一段, 前几句话还行, 但光最后一句就有两个技术错误: (1) 把 in-place 说成内置; (2) 梯度设置为 None 说成 0. 关于第 2 点可以参考 torch 文档 Use parameter.grad = None instead of model.zero_grad() or optimizer.zero_grad().

简单介绍了模型耗时可以分为三部分

  • Compute: Time spent on your GPU computing actual floating point operations (FLOPS)
  • Bandwith: Time spent on moving the data from CPU to GPU, from one node to another, or even from CUDA global memory to CUDA shared memory
  • Overhead: Everything else
Read more »

Python 自动化杂录: Makefile 等

2022-10-03 | ~ 2022-10-18 | Tech

Makefile

直接看原文

  • Aniket Bhattacharyea. (2021). Makefile Tutorials and Examples to Build From
  • Aniket Bhattacharyea. (2021). Creating a Python Makefile
Read more »

Python setuptools 简要

2022-10-02 | ~ | Tech

用 setuptools

Read more »

Java 初步: 简化版

2022-10-01 | ~ | Language

主要参考 java4python “Learn Java: Programiz” app 上的小课. Just a quick review.

Read more »

C++ 初步: 简化版

2022-09-26 | ~ | Language

主要参考 cpp4python 和 “Learn C++: Programiz” app 上的小课, 内容是简化版中的简化版.

备用

  • 终极 C++避坑指南
  • cppreference.com

C++ 文件的扩展名常见的有 .cpp, .cc 等.

Read more »

Shell 脚本简要

2022-09-20 | ~ 2022-10-13 | Language

主要参考

  • Shotts, W. (2019). The Linux command line

学习目标: 能写简单脚本, 能看懂长一些的脚本. Shell 脚本不是开发语言, 难以 debug, 不适合写太长.

If you are writing a script that is more than 100 lines long, or that uses non-straightforward control flow logic, you should rewrite it in a more structured language now. Bear in mind that scripts grow. Rewrite your script early to avoid a more time-consuming rewrite at a later date. (Google Shell Style Guide)

Read more »

Python 装饰器杂录

2022-09-11 | ~ | Tech

装饰器基础, 以及常用装饰器略了, 比如 staticmethod, classmethod, property, functools.lru_cache.

Read more »

Python 的 web 相关库杂录

2022-09-04 | ~ 2023-12-25 | Tech

随手找的一些模型部署简单例子.

  • Flask+Gunicorn+Service Streamer
  • Flask+Gunicorn
  • Flask+gevent
  • Tornado
Read more »

Triton Inference Server 简要

2022-08-29 | ~ | Tech

NVIDIA 的 Triton Inference Server.

总体架构

把模型文件 (支持主流格式) 放在 model repo, 由 inference server 调度, 根据模型配置的推理引擎 (ONNX, TensorRT 等) 输出结果.

下面以在 CPU 上部署 ONNX 模型为例.

Read more »
1 … 4 5 6 … 18
Shiina

Shiina

知乎 豆瓣 bangumi Instagram Weibo
Creative Commons
RSS
© 2019 - 2025   Shiina   CC BY-NC-ND 4.0
RSS  
Powered by Jekyll
 
Theme NexT.Mist