Skip to content

perf: parallelize tune_kernel epsilon sweep#12

Open
malik672 wants to merge 2 commits into
harnesslabs:mainfrom
malik672:expf
Open

perf: parallelize tune_kernel epsilon sweep#12
malik672 wants to merge 2 commits into
harnesslabs:mainfrom
malik672:expf

Conversation

@malik672
Copy link
Copy Markdown

Cache epsilon table once and parallelize the epsilon sweep in tune_kernel to cut diffusion build time.

Cache epsilon table once and parallelize the epsilon sweep in tune_kernel to cut diffusion build time.
Comment thread include/igneous/data/topology.hpp Outdated
Some Linux toolchains don't expose std::expf; use std::exp which is overloaded for float.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants