Image Captioning App

About: Used Merge model developed by Tanti et al. in 2017 as a refereance for the base model for image captioning

Dataset: Acquired dataset from kaggle : Flicker8k

Here i used Tensorflow tokenizers to get a word dictionary for encoding and de-coding for sequence processing.
Used Tensorflow library for preprocessing the text data.
For giving the image input to the model, I used various different architecture's for transfer learning.
Got the best loss by using VGG16. Rather then using cool architecture focused to solve the business problem with the best solution possible.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
results		results
.gitignore		.gitignore
Readme.md		Readme.md
app.py		app.py
flicker3-iamge-cap (1).ipynb		flicker3-iamge-cap (1).ipynb
img_cap.ipynb		img_cap.ipynb
model_vgg16.h5		model_vgg16.h5
process_img.p		process_img.p

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning App

About: Used Merge model developed by Tanti et al. in 2017 as a refereance for the base model for image captioning

Dataset: Acquired dataset from kaggle : Flicker8k

Here i used Tensorflow tokenizers to get a word dictionary for encoding and de-coding for sequence processing.

Used Tensorflow library for preprocessing the text data.

For giving the image input to the model, I used various different architecture's for transfer learning.

Got the best loss by using VGG16. Rather then using cool architecture focused to solve the business problem with the best solution possible.

The model used for input of the processed data as:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image Captioning App

About: Used Merge model developed by Tanti et al. in 2017 as a refereance for the base model for image captioning

Dataset: Acquired dataset from kaggle : Flicker8k

Here i used Tensorflow tokenizers to get a word dictionary for encoding and de-coding for sequence processing.

Used Tensorflow library for preprocessing the text data.

For giving the image input to the model, I used various different architecture's for transfer learning.

Got the best loss by using VGG16. Rather then using cool architecture focused to solve the business problem with the best solution possible.

The model used for input of the processed data as:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages