-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
try fix tgs agitate #751
Merged
mrdanielw
merged 23 commits into
DeepLink-org:main
from
DeepLink-org:zgc/dipu_fix_tgs_agitate2
Apr 10, 2024
Merged
try fix tgs agitate #751
Changes from 16 commits
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
2941910
try fix tgs agitate
zhaoguochun1995 d5f3696
async_mempool.length-0
zhaoguochun1995 a18bfd0
use new param
zhaoguochun1995 5a177b0
Merge remote-tracking branch 'origin/main' into zgc/dipu_fix_tgs_agit…
9156868
reduce max extend size
6139204
kMaxAsyncResourcePoolLength->8,kMaxExtendSize->256MB
cf57466
supports using the value of environment variables to initialize kMaxA…
zhaoguochun1995 949f83d
kMaxExtendSize->32MB
zhaoguochun1995 f87d57e
kMaxExtendSize->128MB
zhaoguochun1995 ff5d314
enable init kMaxExtendSize using env var
zhaoguochun1995 2f48b62
enable init kMaxExtendSize using env var
zhaoguochun1995 549e274
fix lint
zhaoguochun1995 ea2db53
modify default value
zhaoguochun1995 e49f4cf
back up
zhaoguochun1995 c36dadb
modify maxExtendSize 512->1024
zhaoguochun1995 4128475
fix lint
zhaoguochun1995 a580fbf
add get_env_or_default func
zhaoguochun1995 98dd050
fix no need change
zhaoguochun1995 3122100
fix typo error
zhaoguochun1995 f277255
fix lint
zhaoguochun1995 3a04678
minor change
zhaoguochun1995 a5edb80
optimize BF cached-allocator logic
zhaoguochun1995 cd605ff
minor change
zhaoguochun1995 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里 events 为空需要加入 list 吗,我以为可以忽略它
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
需要的,不然会内存泄漏
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
建议后续把这块逻辑整体改下, 对于没有在流上等待的 tensor, 析构时直接 restore()。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里实际上是故意没有在析构的时候restore。主要目的: 1. 加快tensor析构的速度 2. tensor析构时restore没有什么用,只有在申请的时候才需要尽可能多的内存已经回收。 3. 析构时里面回收,有可能流上还没有读写完毕,减小竞争的概率 4. resotre时可能会有碎片整理等操作, 把潜在的耗时放在申请的时候,可以让一部分wait变成有意义的cpu操作