Subtitulo: Image-captioning

A deep learning based android application which takes an image as a input and provides automatic caption for the following image. Caption Generation is a challenging task in Artificial Intelligience where a textual description of the image must be generated.

Link to the demo

Model architecture

The model we are using is based on attention based CNN-RNN network which will allow us to focus on selective regions while generating description much like the way humans perceive the visual world.

Dataset used:The model is trained on MS-COCO dataset which is used for benchmarking object detection, segmentation and captioning datasets

Flow of the process:

Loading of the dataset
Preprocessing the images
Preprocessing and tokenization of the captions and defining the vocabulry
Choosing a pre-trained model for image
Splitting the data into training and testing
Defining the Model Architecture
Training the model
Test the model

Results

The results on validation dataset were:

The unseen image given was:

The results obtained on the model with unseen image is as shown below:

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
codebase		codebase
flask		flask
images		images
subtitulo		subtitulo
.gitignore		.gitignore
1.jpg		1.jpg
2.jpeg		2.jpeg
4.JPG		4.JPG
LICENSE		LICENSE
README.md		README.md
model.png		model.png
result-1.PNG		result-1.PNG
result.PNG		result.PNG
results.zip		results.zip
val-result.jpeg		val-result.jpeg
val.jpeg		val.jpeg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitulo: Image-captioning

Model architecture

Results

About

Releases

Packages

Contributors 3

Languages

License

Mushrifah/Image-captioning

Folders and files

Latest commit

History

Repository files navigation

Subtitulo: Image-captioning

Model architecture

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages