Browse Source
Summary: the combination of tensors on multiple devices in get_rel_pos was preventing cuda graphs from correctly optimizing things Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]gh/HDCharles/1/head
1 changed files with 2 additions and 2 deletions
Loading…
Reference in new issue