Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich research conference will feature first major presentation on KAT model by Mar 31, 2025?
NeurIPS • 25%
ICML • 25%
CVPR • 25%
Other • 25%
Conference schedules and presentation lists
NUS Introduces KAT: New Transformer with KAN Layers Enhances Neural Network Performance
Sep 18, 2024, 08:27 PM
Researchers have introduced a new neural network architecture called the Kolmogorov-Arnold Transformer (KAT), which replaces traditional multi-layer perceptron (MLP) layers with Kolmogorov-Arnold Network (KAN) layers. This innovation enhances the expressiveness and performance of the model. The KAT model, developed by researchers at the National University of Singapore, leverages KANs to capture more complex relationships in data and has shown improved scalability and state-of-the-art results in various applications. Simple tricks were used to improve KAN's scalability, and the initialization of activation weights ensures that the activation variance is maintained. This advancement in deep learning is seen as a significant step forward in solving multi-dimensional and fractional optimal control problems with higher accuracy and efficiency compared to traditional methods.
View original story
Yes • 50%
No • 50%
American Physical Society (APS) Meeting • 25%
International Conference on Gravitation and Cosmology • 25%
European Space Agency (ESA) Conference • 25%
Other • 25%
NeurIPS • 25%
ICML • 25%
AAAI • 25%
Other • 25%
American Physical Society • 25%
International Conference on Particle Physics • 25%
Cosmology and Astrophysics Conference • 25%
Other • 25%
CVPR • 25%
NeurIPS • 25%
ICML • 25%
Other • 25%
NeurIPS • 25%
ICML • 25%
CVPR • 25%
Other • 25%
NeurIPS • 25%
ICML • 25%
CVPR • 25%
Other • 25%
NeurIPS • 25%
ICML • 25%
AAAI • 25%
Other • 25%
NeurIPS 2024 • 25%
ICML 2025 • 25%
AAAI 2025 • 25%
Other • 25%
MIT • 25%
Stanford • 25%
Harvard • 25%
Other • 25%
Roscosmos • 25%
NASA • 25%
European Space Agency • 25%
Other • 25%
Yes • 50%
No • 50%
Recommender Systems • 25%
Other • 25%
Natural Language Processing • 25%
Computer Vision • 25%