# Handwriting Text

### Handwriting text recognition (HTR)

* [Baoguang Shi, Xiang Bai, and Cong Yao. An endto- end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11):2298–2304](https://ieeexplore.ieee.org/document/7801919), **2016**.\
  -> using convolutional recurrent neural network (CRNN) architecture for scene text recognition
* [Arik Poznanski and Lior Wolf. Cnn-n-gram for handwriting word recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2305–2314, **2016**.](https://www.cs.tau.ac.il/~wolf/papers/CNNNGram.pdf)\
  -> using an RCNN to estimate an n-gram profile of an image and match it to the profile of an existing word from a dictionary
* [Sebastian Sudholt and Gernot A Fink. Phocnet: A deep convolutional neural network for word spotting in handwritten documents. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 277–282.](https://arxiv.org/abs/1604.00187) IEEE, **2016**.\
  -> improves the architecture from Poznanski et al 2016
* [Jorge Sueiras, Victoria Ruiz, Angel Sanchez, and Jose F Velez. Offline continuous handwriting recognition using sequence to sequence neural networks. Neurocomputing, 289:119–128, **2018**](https://dl.acm.org/doi/abs/10.1016/j.neucom.2018.02.008).\
  -> use attention decoder instead of RCNN outputs
* [Kartik Dutta, Praveen Krishnan, Minesh Mathew, and CV Jawahar. Improving cnn-rnn hybrid networks for handwriting recognition. In 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pages 80–85. IEEE, **2018**](https://cdn.iiit.ac.in/cdn/cvit.iiit.ac.in/images/ConferencePapers/2018/improving-cnn-rnn.pdf).\
  -> they mix several strategies developed in previous years into one architecture
* [Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, and Hwalsuk Lee. What is wrong with scene text recognition model comparisons? dataset and model analysis, **2019**.](https://arxiv.org/abs/1904.01906)\
  -> advances in text scene recognition

### Handwriting text generation (HTG)

* [Alex Graves. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850](https://arxiv.org/abs/1308.0850) **2013**.\
  -> LSTM for generating text sequences
* [Bo Ji and Tianyi Chen. Generative adversarial network for handwritten text. arXiv:1907.11845,](https://arxiv.org/abs/1907.11845) **2019**\
  -> extends Graves 2013 with a Discriminator and converts it into a GAN paradigm
* [Emre Aksan, Fabrizio Pece, and Otmar Hilliges. Deepwriting: Making digital ink editable via deep generative modeling. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pages 1–14](https://dl.acm.org/doi/10.1145/3173574.3173779) **2018**.\
  -> aka DeepWriting; introduces control of the styling for Graves 2013.
* [Eloi Alonso, Bastien Moysset, and Ronaldo Messina. Adversarial generation of handwritten text images conditioned on sequences. arXiv preprint arXiv:1903.00277,](https://arxiv.org/abs/1903.00277) **2019**\
  -> generate text images conditioned on input text; for this, they extend a BigGAN architecture (Brock et al. 2018); the generator can produce images of fied size but containing text of variable length
* [Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen, Shai Mazor, Roee Litman. ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation. arXiv:2003.10557,](https://arxiv.org/abs/2003.10557) **2020**\
  -> build upon Alonso et al 2019 to produce a network that generates images with both variable text length and variable image size.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://grigore-mihaela.gitbook.io/machine-learning/readings-and-other-media/nlp/handwriting-text.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
