Commit 4130d73
[PyTorch] Update cuBLASLt grouped gemm filter (#3119)
* update cublaslt grouped gemm filter
Signed-off-by: Xin Yao <xiny@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update nvfp4 filter and tests
Signed-off-by: Xin Yao <xiny@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* test correctness
Signed-off-by: Varun Thumbe <vthumbe@nvidia.com>
* better test
Signed-off-by: Varun Thumbe <vthumbe@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Xin Yao <xiny@nvidia.com>
Signed-off-by: Varun Thumbe <vthumbe@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Varun Thumbe <vthumbe@nvidia.com>1 parent 547d284 commit 4130d73
4 files changed
Lines changed: 180 additions & 43 deletions
File tree
- tests/pytorch
- transformer_engine
- common/gemm
- pytorch
- module
- ops/basic
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1497 | 1497 | | |
1498 | 1498 | | |
1499 | 1499 | | |
| 1500 | + | |
1500 | 1501 | | |
1501 | 1502 | | |
1502 | 1503 | | |
| |||
1580 | 1581 | | |
1581 | 1582 | | |
1582 | 1583 | | |
| 1584 | + | |
| 1585 | + | |
| 1586 | + | |
| 1587 | + | |
1583 | 1588 | | |
1584 | | - | |
| 1589 | + | |
1585 | 1590 | | |
1586 | 1591 | | |
1587 | 1592 | | |
1588 | 1593 | | |
1589 | 1594 | | |
1590 | 1595 | | |
1591 | 1596 | | |
1592 | | - | |
1593 | | - | |
1594 | | - | |
1595 | 1597 | | |
| 1598 | + | |
| 1599 | + | |
| 1600 | + | |
| 1601 | + | |
| 1602 | + | |
| 1603 | + | |
| 1604 | + | |
| 1605 | + | |
| 1606 | + | |
| 1607 | + | |
| 1608 | + | |
| 1609 | + | |
| 1610 | + | |
1596 | 1611 | | |
1597 | 1612 | | |
1598 | 1613 | | |
1599 | 1614 | | |
1600 | 1615 | | |
1601 | | - | |
1602 | | - | |
| 1616 | + | |
| 1617 | + | |
1603 | 1618 | | |
1604 | 1619 | | |
1605 | 1620 | | |
| |||
1683 | 1698 | | |
1684 | 1699 | | |
1685 | 1700 | | |
| 1701 | + | |
| 1702 | + | |
| 1703 | + | |
| 1704 | + | |
| 1705 | + | |
| 1706 | + | |
| 1707 | + | |
| 1708 | + | |
| 1709 | + | |
| 1710 | + | |
| 1711 | + | |
| 1712 | + | |
| 1713 | + | |
| 1714 | + | |
| 1715 | + | |
| 1716 | + | |
| 1717 | + | |
| 1718 | + | |
| 1719 | + | |
| 1720 | + | |
| 1721 | + | |
| 1722 | + | |
| 1723 | + | |
| 1724 | + | |
| 1725 | + | |
| 1726 | + | |
| 1727 | + | |
| 1728 | + | |
| 1729 | + | |
| 1730 | + | |
| 1731 | + | |
| 1732 | + | |
| 1733 | + | |
| 1734 | + | |
| 1735 | + | |
| 1736 | + | |
| 1737 | + | |
| 1738 | + | |
| 1739 | + | |
| 1740 | + | |
| 1741 | + | |
| 1742 | + | |
| 1743 | + | |
| 1744 | + | |
| 1745 | + | |
| 1746 | + | |
| 1747 | + | |
| 1748 | + | |
| 1749 | + | |
| 1750 | + | |
| 1751 | + | |
| 1752 | + | |
| 1753 | + | |
| 1754 | + | |
| 1755 | + | |
| 1756 | + | |
| 1757 | + | |
| 1758 | + | |
| 1759 | + | |
| 1760 | + | |
| 1761 | + | |
| 1762 | + | |
| 1763 | + | |
| 1764 | + | |
| 1765 | + | |
| 1766 | + | |
| 1767 | + | |
| 1768 | + | |
| 1769 | + | |
| 1770 | + | |
| 1771 | + | |
| 1772 | + | |
| 1773 | + | |
| 1774 | + | |
| 1775 | + | |
| 1776 | + | |
| 1777 | + | |
| 1778 | + | |
| 1779 | + | |
| 1780 | + | |
| 1781 | + | |
| 1782 | + | |
| 1783 | + | |
| 1784 | + | |
1686 | 1785 | | |
1687 | 1786 | | |
1688 | 1787 | | |
| |||
1691 | 1790 | | |
1692 | 1791 | | |
1693 | 1792 | | |
| 1793 | + | |
| 1794 | + | |
| 1795 | + | |
| 1796 | + | |
1694 | 1797 | | |
1695 | | - | |
| 1798 | + | |
1696 | 1799 | | |
1697 | 1800 | | |
1698 | 1801 | | |
1699 | 1802 | | |
1700 | | - | |
1701 | | - | |
| 1803 | + | |
| 1804 | + | |
| 1805 | + | |
| 1806 | + | |
| 1807 | + | |
| 1808 | + | |
| 1809 | + | |
| 1810 | + | |
| 1811 | + | |
| 1812 | + | |
| 1813 | + | |
| 1814 | + | |
| 1815 | + | |
1702 | 1816 | | |
1703 | 1817 | | |
1704 | 1818 | | |
1705 | 1819 | | |
1706 | | - | |
1707 | 1820 | | |
1708 | 1821 | | |
1709 | 1822 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
231 | | - | |
| 231 | + | |
232 | 232 | | |
233 | 233 | | |
234 | 234 | | |
| |||
335 | 335 | | |
336 | 336 | | |
337 | 337 | | |
338 | | - | |
| 338 | + | |
| 339 | + | |
339 | 340 | | |
340 | 341 | | |
341 | 342 | | |
| |||
344 | 345 | | |
345 | 346 | | |
346 | 347 | | |
347 | | - | |
| 348 | + | |
348 | 349 | | |
349 | 350 | | |
350 | 351 | | |
| |||
400 | 401 | | |
401 | 402 | | |
402 | 403 | | |
403 | | - | |
| 404 | + | |
404 | 405 | | |
405 | 406 | | |
406 | 407 | | |
| |||
1567 | 1568 | | |
1568 | 1569 | | |
1569 | 1570 | | |
1570 | | - | |
| 1571 | + | |
1571 | 1572 | | |
1572 | 1573 | | |
1573 | 1574 | | |
| |||
1650 | 1651 | | |
1651 | 1652 | | |
1652 | 1653 | | |
1653 | | - | |
| 1654 | + | |
1654 | 1655 | | |
1655 | 1656 | | |
1656 | 1657 | | |
| |||
1801 | 1802 | | |
1802 | 1803 | | |
1803 | 1804 | | |
1804 | | - | |
| 1805 | + | |
1805 | 1806 | | |
1806 | 1807 | | |
1807 | 1808 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
58 | | - | |
59 | | - | |
| 58 | + | |
60 | 59 | | |
61 | 60 | | |
62 | 61 | | |
| |||
95 | 94 | | |
96 | 95 | | |
97 | 96 | | |
98 | | - | |
99 | 97 | | |
100 | | - | |
101 | 98 | | |
102 | | - | |
| 99 | + | |
103 | 100 | | |
104 | 101 | | |
105 | 102 | | |
106 | 103 | | |
107 | 104 | | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
108 | 115 | | |
| 116 | + | |
109 | 117 | | |
110 | 118 | | |
| 119 | + | |
111 | 120 | | |
112 | 121 | | |
113 | 122 | | |
| |||
116 | 125 | | |
117 | 126 | | |
118 | 127 | | |
119 | | - | |
| 128 | + | |
| 129 | + | |
120 | 130 | | |
| 131 | + | |
121 | 132 | | |
122 | 133 | | |
| 134 | + | |
123 | 135 | | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
129 | 140 | | |
130 | 141 | | |
131 | 142 | | |
| |||
234 | 245 | | |
235 | 246 | | |
236 | 247 | | |
237 | | - | |
| 248 | + | |
238 | 249 | | |
239 | 250 | | |
240 | 251 | | |
| |||
491 | 502 | | |
492 | 503 | | |
493 | 504 | | |
494 | | - | |
495 | 505 | | |
496 | | - | |
497 | 506 | | |
498 | 507 | | |
499 | 508 | | |
| |||
745 | 754 | | |
746 | 755 | | |
747 | 756 | | |
748 | | - | |
| 757 | + | |
749 | 758 | | |
750 | 759 | | |
751 | 760 | | |
| |||
0 commit comments