Skip to content

Conversation

@RyanUnderhill
Copy link
Contributor

Make sure fenced tensor could not reuse other tensor. (#2561)
Add support for opset 11 in reshape fusion (#2592)
Support opset 11 subgraph of Squad model in Embed Layer Normalization (#2605)
Allow providers to be set for InferenceSession at construction (#2606)
EmbedLayerNormalization Fusion For Dynamic Squad Model Opset 10 (#2613)
Improve Embed Layer Norm Fusion for SQuAD with static input shape (#2621)
Improve cuda expand() opeator's performance. (#2624)
Cuda pad optimize when no padding is needed. (#2625)
Improve performance of resize() in Nearest mode (#2626)
Optimize cuda scatter() on 2D compatible. (#2628)
fix float16 comparison in initializer (#2629)
epsilon attribute for layernormalization fusion (#2639)
Fix memory exception in Layer Norm Fusion (#2644)

askhade and others added 20 commits December 10, 2019 15:01
 Support opset verion 11 in reshape fusion
…#2605)

Support opset 11 Squad model that is exported from PyTorch nightly. The embed layer uses Range op which is missed in the transformer.
Support subgraph of SQuAD model exported from pytorch with dynamic input axes
* Shortcut cuda Pad() when no padding is needed.
Special treatment for 2D, check same size as input image.
And in 2d kernel, template use_expolation.
* Optimize cuda scatter() on 2D compatible.

* Add some comments.
@RyanUnderhill RyanUnderhill requested a review from a team as a code owner December 13, 2019 21:59
@RyanUnderhill RyanUnderhill merged commit 82d35de into rel-1.1.0 Dec 14, 2019
@RyanUnderhill RyanUnderhill deleted the ryanunderhill/rel-1.1.0 branch September 4, 2020 00:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.