Mihaela Grigore
  • 👋About
  • 👩‍🏭Personal projects
    • Computer Vision | Deep Learning with Tensorflow & Keras (ResNet50, GPU training)
    • Computer Vision | Convolutional Neural Networks with PyTorch
    • Computer Vision | Facial Recognition with Keras, FaceNet, Inception, Siamese Networks
    • NLP | Topic modeling on tweets
    • NLP | Sentiment analysis of tweets: TextBlob, VADER and Flair
    • Time series | Exploration on Crypto price dataset
    • Data scraping | Social Media Scraping: Twitter Developer API for Academics
    • Data Scraping | Collecting historical tweets without Twitter API
  • ✍️Notes
    • Machine Learning in Production
      • Feature transforms
      • Feature selection
      • Data journey
    • NLP
      • Information Retrieval
    • Computer Vision
    • Time series
      • Stationarity
    • Data
      • Labeling
    • Python
      • ndarray slicing with index out of bounds
  • 📚Readings & other media
    • Computer Vision
      • Selection of research articles
    • NLP
      • Handwriting Text
      • Information Retrieval
      • Mono- / multilingual
      • Topic Modeling
      • Language Models
    • Time Series
    • Generative Adversarial Netoworks (GAN)
    • Python
      • Python basics
Powered by GitBook
On this page
  • Handwriting text recognition (HTR)
  • Handwriting text generation (HTG)
  1. Readings & other media
  2. NLP

Handwriting Text

PreviousNLPNextInformation Retrieval

Last updated 3 years ago

Handwriting text recognition (HTR)

  • , 2016. -> using convolutional recurrent neural network (CRNN) architecture for scene text recognition

  • -> using an RCNN to estimate an n-gram profile of an image and match it to the profile of an existing word from a dictionary

  • IEEE, 2016. -> improves the architecture from Poznanski et al 2016

  • . -> use attention decoder instead of RCNN outputs

  • . -> they mix several strategies developed in previous years into one architecture

  • -> advances in text scene recognition

Handwriting text generation (HTG)

  • 2013. -> LSTM for generating text sequences

  • 2019 -> extends Graves 2013 with a Discriminator and converts it into a GAN paradigm

  • 2018. -> aka DeepWriting; introduces control of the styling for Graves 2013.

  • 2019 -> generate text images conditioned on input text; for this, they extend a BigGAN architecture (Brock et al. 2018); the generator can produce images of fied size but containing text of variable length

  • 2020 -> build upon Alonso et al 2019 to produce a network that generates images with both variable text length and variable image size.

📚
Baoguang Shi, Xiang Bai, and Cong Yao. An endto- end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11):2298–2304
Arik Poznanski and Lior Wolf. Cnn-n-gram for handwriting word recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2305–2314, 2016.
Sebastian Sudholt and Gernot A Fink. Phocnet: A deep convolutional neural network for word spotting in handwritten documents. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 277–282.
Jorge Sueiras, Victoria Ruiz, Angel Sanchez, and Jose F Velez. Offline continuous handwriting recognition using sequence to sequence neural networks. Neurocomputing, 289:119–128, 2018
Kartik Dutta, Praveen Krishnan, Minesh Mathew, and CV Jawahar. Improving cnn-rnn hybrid networks for handwriting recognition. In 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 80–85. IEEE, 2018
Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, and Hwalsuk Lee. What is wrong with scene text recognition model comparisons? dataset and model analysis, 2019.
Alex Graves. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850
Bo Ji and Tianyi Chen. Generative adversarial network for handwritten text. arXiv:1907.11845,
Emre Aksan, Fabrizio Pece, and Otmar Hilliges. Deepwriting: Making digital ink editable via deep generative modeling. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pages 1–14
Eloi Alonso, Bastien Moysset, and Ronaldo Messina. Adversarial generation of handwritten text images conditioned on sequences. arXiv preprint arXiv:1903.00277,
Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen, Shai Mazor, Roee Litman. ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation. arXiv:2003.10557,