SeqGAN VAE Paraphrasing

About The Project

My project is called Paraphrasing. This is an implementation of An End-to-End Generative Architecture for Paraphrase Generation.

Getting Started

To get started, you should have prior knowledge on Python and Pytorch at first. A few resources to get you started if this is your first Python or Tensorflow project:

Installation and Run

Clone the repo

git clone https://github.com/phkhanhtrinh23/seqgan_vae_paraphrasing.git

Use any code editor to open the folder seqgan_vae_paraphrasing.

Step-by-step

Read and run data.py to convert data/train.csv to a compatible format. The dataset originates from Quora Question Pairs (QQP).
Read and run train.py to train the SeqGAN VAE model. The model architecture originates from "An End-to-End Generative Architecture for Paraphrase Generation".

Results

Description:

Inp: the input data.
Pre: the prediction from the model.
Tar: the targe/label data (groundtruth).

Note 1: <eos> is just the end-of-sentence token.

Note 2: As you can witness, QQP just covers paraphrasing on question so this model may not work well on normal sentences. Moreover, some of the QQP's data are not good enough to the model because of the low quality of inputs and labels. Sometimes, our model has much better paraphrases than the QQP's labels.

Contribution

Contributions are what make GitHub such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the project
Create your Contribute branch: git checkout -b contribute/Contribute
Commit your changes: git commit -m 'add your messages'
Push to the branch: git push origin contribute/Contribute
Open a pull request

Contact

Email: phkhanhtrinh23@gmail.com

Project Link: https://github.com/phkhanhtrinh23/seqgan_vae_paraphrasing.git

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
checkpoints		checkpoints
data		data
logs		logs
README.md		README.md
data.py		data.py
duplicate.txt		duplicate.txt
inference.py		inference.py
model.py		model.py
output.png		output.png
output.txt		output.txt
preprocess.py		preprocess.py
rollout.py		rollout.py
scheduler.py		scheduler.py
tokenizer.py		tokenizer.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SeqGAN VAE Paraphrasing

About The Project

Getting Started

Installation and Run

Step-by-step

Results

Contribution

Contact

About

Releases

Packages

Languages

phkhanhtrinh23/seqgan_vae_paraphrasing

Folders and files

Latest commit

History

Repository files navigation

SeqGAN VAE Paraphrasing

About The Project

Getting Started

Installation and Run

Step-by-step

Results

Contribution

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages