/* holds the size of maximum between msg size and cycle buffer, * aligned to cache line, * it is multiply by 2 for send and receive * with reference to number of flows and number of QPs */ ctx->buff_size = INC(BUFF_SIZE(ctx->size, ctx->cycle_buffer), ctx->cache_line_size) * 2 * num_of_qps_factor * user_param->flows;
65536 = 64Kb
generally, 16 pages
root cause: ulimit -l is 16 (default) in container
Python 3.6.12 |Anaconda, Inc.| (default, Sep 8 2020, 17:50:39) [GCC Clang 10.0.0 ] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> import torch >>> torch.__version__ '1.8.0a0+f1a8a82'
TcpStore
1 2 3 4 5 6 7
python test/distributed/test_c10d.py
Python 3.6.12 |Anaconda, Inc.| (default, Sep 8 2020, 17:50:39) [GCC Clang 10.0.0 ] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> import torch.distributed as dist >>> server_store = dist.TCPStore("127.0.0.1", 18668, 1, True)