Final Project
A grid of Machine Generated Leaves by Aarati
You and your team will create a training dataset of images together throughout the rest of the semester. At the end of the semester, we will train a Generative Adversarial Network on your dataset. You will not be assessed on the results of the GAN, but rather the thoughtfulness, care, and consideration of your dataset creation process. You will need to have at least 500 images within your dataset so please take that into consideration when developing your project idea.
Technical Specifications:
- Each image must be square. (You can crop your images down or add extra space on either side to make them square.)
- You don't need to label your images with categories or descriptions. We will be using StyleGAN which just takes a dataset of images and generates images based on the training data.
- You can be use .jpg or .png file format but make sure all your files are the same file type (so all jpegs or all pngs)
- If you are downloading or scraping images, you must have the legal right to use these images (i.e. they are public domain)
- You will upload your dataset to a google drive folder that I will create for your group.
Tips:
- Try to keep your images a maximum of 1024px by 1024px and/or under 1mb so that you don't have to keep transferring super large files to your drive.
- You can also use video stills. This is a great way to generate thousands of images easily without taking individual photos. I have a script for automating the process of splitting video into image frames. Please reach out if you need this script.
- Upload as you go. Don't wait until the last minute to transfer your images to the google drive (you don't want a situation where your external harddrive breaks, or phone gets stolen and you lose all your work!)
- The more homogenous your images are, the "better" your GAN will work. The more diverse your images are, the more images you will need to get an output that resembles the input images.
- In regards to the above point, you can edit your images to create homogeneity. For example, I have done this with a dataset of faces where I used facial recognition to align the images so that the eyes are in the same position across the images. Reach out to me if you have alignment questions and I will be able to tell you whether alignment can be automated or needs to be manually done.
- If you are downloading or scraping images and need to crop them to be square all at once, I have a script for automating this. Please reach out to me.
Submission schedule:
Inspo: