gigl.distributed.dist_sampling_producer#

Classes#

DistSamplingProducer

A subprocess group of distributed sampling workers.

Module Contents#

class gigl.distributed.dist_sampling_producer.DistSamplingProducer(data, sampler_input, sampling_config, worker_options, output_channel)[source]#

Bases: graphlearn_torch.distributed.DistMpSamplingProducer

A subprocess group of distributed sampling workers.

Note that this producer is only used for workload with separate sampling and training, all sampled results will be sent to the output channel.

Parameters:
  • data (graphlearn_torch.distributed.dist_dataset.DistDataset)

  • sampler_input (Union[graphlearn_torch.sampler.NodeSamplerInput, graphlearn_torch.sampler.EdgeSamplerInput])

  • sampling_config (graphlearn_torch.sampler.SamplingConfig)

  • worker_options (graphlearn_torch.distributed.dist_options._BasicDistSamplingWorkerOptions)

  • output_channel (graphlearn_torch.channel.ChannelBase)

init()[source]#

Create the subprocess pool. Init samplers and rpc server.