Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Spatially Optimized Compact Deep Metric Learning Model for Similarity Search (2404.06593v1)

Published 9 Apr 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Spatial optimization is often overlooked in many computer vision tasks. Filters should be able to recognize the features of an object regardless of where it is in the image. Similarity search is a crucial task where spatial features decide an important output. The capacity of convolution to capture visual patterns across various locations is limited. In contrast to convolution, the involution kernel is dynamically created at each pixel based on the pixel value and parameters that have been learned. This study demonstrates that utilizing a single layer of involution feature extractor alongside a compact convolution model significantly enhances the performance of similarity search. Additionally, we improve predictions by using the GELU activation function rather than the ReLU. The negligible amount of weight parameters in involution with a compact model with better performance makes the model very useful in real-world implementations. Our proposed model is below 1 megabyte in size. We have experimented with our proposed methodology and other models on CIFAR-10, FashionMNIST, and MNIST datasets. Our proposed method outperforms across all three datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Metric learning. Springer Nature, 2022.
  2. A unifying mutual information view of metric learning: Cross-entropy vs. pairwise losses. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm, editors, Computer Vision – ECCV 2020, pages 548–564, Cham, 2020. Springer International Publishing.
  3. Learning to rank: from pairwise approach to listwise approach. In Proceedings of the 24th international conference on Machine learning, pages 129–136, 2007.
  4. Activation functions in deep learning: A comprehensive survey and benchmark. Neurocomputing, 503:92–108, 2022.
  5. Gaussian error linear units (gelus), 2023.
  6. Involution fused convnet for classifying eye-tracking patterns of children with autism spectrum disorder, 2024.
  7. Unic-net: Uncertainty aware involution-convolution hybrid network for two-level disease identification. In SoutheastCon 2023, pages 305–312, 2023.
  8. Coinnet: A convolution-involution network with a novel statistical attention for automatic polyp segmentation. IEEE Transactions on Medical Imaging, 2023.
  9. Minhyeok Lee. Gelu activation function in deep learning: A comprehensive mathematical analysis and performance, 2023.
  10. Involution: Inverting the inherence of convolution for visual recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12321–12330, June 2021.
  11. I-cnet: Leveraging involution and convolution for image classification. IEEE Access, 10:2077–2082, 2022.
  12. An overview and empirical comparison of distance metric learning methods. IEEE transactions on cybernetics, 47(3):612–625, 2016.
  13. An analysis of state-of-the-art activation functions for supervised deep neural network. In 2021 International Conference on System Science and Engineering (ICSSE), pages 215–220, 2021.
  14. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823, 2015.
  15. Spatial–spectral involution mlp network for hyperspectral image classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15:9293–9310, 2022.
  16. A tutorial on distance metric learning: Mathematical foundations, algorithms, experimental analysis, prospects and challenges. Neurocomputing, 425:300–322, 2021.
  17. Multi-similarity loss with general pair weighting for deep metric learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5022–5030, 2019.
  18. Metric learning for dynamic text classification. arXiv preprint arXiv:1911.01026, 2019.
  19. Automatic modulation classification using involution enabled residual networks. IEEE Wireless Communications Letters, 10(11):2417–2420, 2021.
  20. Deep network approximation: Beyond relu to diverse activation functions. Journal of Machine Learning Research, 25(35):1–39, 2024.
  21. Deep metric learning-based image retrieval system for chest radiograph and its clinical applications in covid-19. Medical Image Analysis, 70:101993, 2021.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Md. Farhadul Islam (5 papers)
  2. Md. Tanzim Reza (5 papers)
  3. Meem Arafat Manab (8 papers)
  4. Mohammad Rakibul Hasan Mahin (2 papers)
  5. Sarah Zabeen (3 papers)
  6. Jannatun Noor (9 papers)

Summary

We haven't generated a summary for this paper yet.