Code Monkey home page Code Monkey logo

fpt-thaituan / traffic-sign-classification-using-transfer-learning-with-resnet152v2 Goto Github PK

View Code? Open in Web Editor NEW
5.0 2.0 2.0 392 KB

Utilizing ResNet152V2 for Traffic Sign Classification: Achieve High Accuracy in Identifying 52 Sign Types with 99% Precision

Home Page: https://www.kaggle.com/code/tuanai/traffic-sign-classification-resnet152v2-acc-99

Jupyter Notebook 100.00%
classification cnn computer-vision resnet152v2 traffic-sign-classification transfer-learning

traffic-sign-classification-using-transfer-learning-with-resnet152v2's Introduction

Traffic sign classification using transfer learning with ResNet152V2

1. Introduction

  • In today's rapidly evolving world, the integration of artificial intelligence (AI) and computer vision technologies into various sectors has become increasingly prevalent. One critical application area is traffic sign classification, which plays a pivotal role in enhancing road safety and facilitating autonomous driving systems. In this context, the adoption of advanced deep learning architectures, such as ResNet152V2, has demonstrated remarkable effectiveness in achieving high accuracy and robustness in traffic sign recognition tasks. This paper explores the utilization of ResNet152V2 for traffic sign classification, aiming to provide insights into its performance and potential applications in real-world scenarios.

2. Abstract

  • This study investigates the utilization of ResNet152V2, a state-of-the-art deep learning architecture, for traffic sign classification tasks. Leveraging a comprehensive dataset comprising 52 different types of traffic signs, the proposed approach aims to achieve high accuracy and precision in identifying various sign categories. Through extensive experimentation and evaluation, our results demonstrate the effectiveness of ResNet152V2 in accurately recognizing traffic signs, with a precision rate of 99%. The findings highlight the potential of leveraging advanced deep learning models for enhancing road safety and enabling robust autonomous driving systems in real-world environments.

3. ResNet152V2

  • ResNet152V2 is a deep neural network architecture developed by the team at Microsoft Research. It is a variant of ResNet (Residual Network), a very powerful and popular type of deep neural network in the field of computer vision.
  • The configuration structure of ResNet152V2 consists of several main blocks, called residual blocks, and several convolutional, pooling, and fully connected layers. The special feature of ResNet is to use skip connections (skip connections) to avoid the phenomenon of gradient disappearance and help the network learn deeper symbols. ResNet152V2 is built on top of ResNet with a remaining block count of 152, which increases the depth of the network and improves its representation learning ability.
  • The ResNet152V2 network also uses techniques such as batch normalization and rectified linear units (ReLU) to speed up the training process and reduce overfitting. Additionally, it applies techniques such as bottleneck blocks to reduce computational complexity while maintaining high performance.
  • You can see the structure of ResNet152V2 below [1]

屏幕截图 2024-04-11 214112

4. Project

4.1. Data visualization

  • Displays the number of each traffic sign

屏幕截图 2024-04-11 220637

  • Displays a photo of each traffic sign

屏幕截图 2024-04-11 220817

4.2. Data augmentation

  • TensorFlow's image generator is a powerful tool in generating input data for machine learning models, especially in image processing. In the code you provided, we are using ImageDataGenerator to generate variations of the training images so that our model learns to diversify and augment the data.

4.3. Load the pre-trained model and edit

  • Download the pre-trained model
  • Edit desired output according to project. Here are 52 traffic signs

屏幕截图 2024-04-11 221948

4.4. Build models

  • Model training process

屏幕截图 2024-04-11 222120

4.5. Result

  • Plot the loss of training and validation

屏幕截图 2024-04-11 222217

  • Plot the accuracy of training and validation

屏幕截图 2024-04-11 222312

4.6. Test the model with real data

  • Results with test data set

屏幕截图 2024-04-11 222426

5. References

[1] Multi-classification deep learning models for detection of ulcerative colitis, polyps, and dyed-lifted polyps using wireless capsule endoscopy images. Here

Hope this article can help you.

If you have any questions please contact me for help!

Kaggle: Tuan_AI

Thanks everyone!

traffic-sign-classification-using-transfer-learning-with-resnet152v2's People

Contributors

fpt-thaituan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.