ImageSquish is a standalone service that resizes your images on the fly. It is easy to setup, scales linearly, is cheap to operate because it is very efficient, requires no integration and provides the best experience possible for your users.
#Setup Instructions
##Environment Variables
- AWS_ACCESS_KEY_ID (required)
- AWS_SECRET_ACCESS_KEY (required)
- IMAGESQUISH_PORT (optional) - The port ImageSquish will listen on. Default is 3000.
- MAX_CONCURRENT_PROXY_STREAMS (optional) - The maximum number of images that ImageSquish will stream at once. If traffic exceeds this limit, it will be queued up. Default is 20. If your server has excellent I/O performance, I suggest increasing this.
- MAX_CONCURRENT_MANIPULATIONS (optional) - The maximum number of images that will be manipulated (e.g. resized) in parallel. The default is the number of CPUs available. Most of the time it makes sense to leave this alone.
- CONFIG_SOURCE (optional) - Set to "etcd" to provide your bucket configurations via etcd. Probably only interesting if you are using CoreOS.
- LOG_LEVEL (optional) - Set the logging level to debug to get more info or none to avoid any (default is info)
- ETCD_HOST (optional) - Provide a non-standard etcd host (default is 127.0.0.1)
- ETCD_PORT (optional) - Provide a non-standard etcd port (default is 4001)
- NEWRELIC_ENABLED (optional) - Enable NewRelic tracking
- NEWRELIC_LICENSE_KEY _(required if NEWRELIC_ENABLED is 1)
- NEWRELIC_LOG_LEVEL (optional) - Default: info
- NEWRELIC_APP_NAME (optional) - Default: ImageSquish
##Bucket Configuration
The bucket configuration is what tells ImageSquish where to find your original images (your origin server), and what manipulations are available (plus some more goodies). For example, you might define "small" to mean that the image should be resized to 75x75.
The bucket configuration is always provided in JSON format, but can be stored in two ways:
- Map the /data/config volume to a local directory and put your configuration in config.json in that directory.
- Tell ImageSquish via env variables to use etcd for the bucket config (see CONFIG_SOURCE env variable above) and then store this config in the /imagesquish key.
Here is an example bucket config.
##Run Natively
- Install graphics magick (image magik might work too, but i haven't tested it)
npm install .
(sorry it is not available on npm registry yet)- Set required configuration parameters
- Start with
forever start app.js
##Run On Docker
docker run -e AWS_ACCESS_KEY_ID=XXX \
-e AWS_SECRET_ACCESS_KEY=XXX \
-p 3000:3000 \
-v /data/config:/path/to/my/configs \
dmcquay/imagesquish`
The /data/config volume mapping can be omitted if you are using etcd for configuration.
#Deploying using CoreOS
I like deploying on CoreOS, so I've provided an example fleet Unit File to make things easy. Just replace your AWS credentials at a minimum, set your bucket config in etcd, and then spin up 3 instances like this:
fleetctl start imagesquish@1.service
fleetctl start imagesquish@2.service
fleetctl start imagesquish@3.service
Setting your config in etcd is slightly weird. I suggest keeping your config in a flat file somewhere to make maintenance and change tracking easier. To apply to etcd, copy the contents of the file, open a JavaScript console (like Chrome developer tools), type a=[paste your code here]
and then JSON.stringify(a)
. Copy the results of that and set in etcd like this: etcdctl set /imagesquish '[paste here]'
.
You have this image on your site
http://www.mysite.com/media/profile_images/12345.jpg
Configure ImageSquish to tell it the following:
- Your images are found at
www.mysite.com
- Give it an S3 bucket to store sizes in
- Configure your desired sizes (for example, we might configure "small" to be 100x100)
Start the ImageSquish service
forever start app.js
Assuming you are running ImageSquish on images.mysite.com
, request the small size with this URL:
http://images.mysite.com/default/small/media/profile_images/12345.jpg
- Users shouldn't have to wait for sizes to be generated upon upload or see placeholder images while sizes are generated.
- Your app server shouldn't be resizing images. It is CPU and memory hungry and scales differently.
- Images should be generated in parallel when needed, and fast.
- When you need a new image size, you shouldn't have to make some script to reprocess all existing images.
- It should be really easy and scale effortlessly so you can worry about more important things.
- ImageSquish is a standalone service. You simply edit the configuration file and then start the service.
- Image sizes are stored in S3. When you request an image in a certain size from ImageSquish, it makes a request to S3 where that image should be found and streams it back to the user. It acts very much like a reverse proxy.
- If the image is not found on S3, the 404 response is detected and ImageSquish then generates the size and stores it in S3. It then repeats step 2.
- Though ImageSquish is robust enough to be used without a CDN, I still highly recommend using a CDN.
- Scale easily by adding more servers and putting them behind a load balancer or round robin DNS. It scales linearly. The nodes do not need to know about each other.
- ImageSquish has been tested at high loads. If you manage to exceed what it can handle, it will not crumble. The requests will simply queue up very efficiently with low memory consumption and wait until your server can catch up.
- Install node. Good instructions here: https://github.com/joyent/node/wiki/Installing-Node.js-via-package-manager#ubuntu-mint
- git-clone this repo or download the latest tarball. Just put it in your home directory.
- Install graphicsmagick.
sudo apt-get install graphicsmagick
or whatever, depending on your OS. - Run
npm install
to install all dependencies. - Create config/aws.json with your AWS credentials. (see example below)
- Create config/config.json. (see example below)
- Run using Forever.
npm install forever -g
and thenforever start app.js
. Learn more about forever at http://blog.nodejitsu.com/keep-a-nodejs-server-up-with-forever.
Note about running on port 80. Only root can do that, but you should not run this as root. Instead, I suggest one of these options:
- Use iptables to redirect port 80 to another port.
sudo iptables -t nat -A PREROUTING -i eth0 -p tcp --dport 80 -j REDIRECT --to-port 3000
- Just run it on port 3000. You should be putting a CDN in front of this anyway, which will hide your weird port from your users.
Each operation and parameters are passed directly to graphics magick.
See docs here: http://aheckmann.github.io/gm/
Some custom operations are also provided. Feel free to fork and add to this.
https://github.com/dmcquay/imagesquish/blob/master/operations.js
Once you have ImageSquish server configured, all that is left is to form your URLs correctly.
http://{imagesquish_host}/{bucket}/{manipulation}/{image_path}
In the configuration example above, let's say I have the following original image on my server.
http://www.mysite.com/profile_images/12345.jpg
And let's say I have ImageSquish running on the host images.mysite.com and I want to the small
size (aka "manipulation") that we defined above.
http://images.mysite.com/test/small/profile_images/12345.jpg
Where "test" is the bucket, "small" is the manipulation and "profile_images/12345.jpg" is the image path.
If you add "allowOTFManipulations": true
to your bucket config, you could also request it like this:
http://images.mysite.com/test/otf:resize(100,100)/profile_images/12345.jpg
The important config options have already been mentioned, but here's a detailed list including some we haven't covered yet.
port
- The port ImageSquish should listen on. Defaults to 3000.maxConcurrentProxyStreams
- The maximum number of images that can be streamed from S3 or the origin server to the user. 20-40 is recommended. Defaults to 20.maxConcurrentManipulations
- The maximum number of uncached manipulations that can be executed concurrently. As you surpass the number of CPUs on your server, you'll soon start to see performance degredation. It defaults to the number of CPUs available. I recommend you use this default.
originHost
- Required. The host where your original images can be found.originPathPrefix
- Optional. Will be prefixed to all your image paths. Useful for making your ImageSquish URLs shorter. So instead ofhttp://images.mysite.com/default/small/profile_images/12345.jpg
, you can set"originPathPrefix": "profile_images/"
and then your URL becomeshttp://images.mysite.com/default/small/12345.jpg
.manipulationsS3Bucket
- Required. The Amazon S3 bucket where your cached images should be storedmanipulationKeyFormat
- The S3 key for storing your manipulations. Defaults toimagesquish/{bucket}/{manipulation}/{imgId}
. There should be no need to change this.manipulations
- See the config example aboveallowOTFManipulations
- Defeault isfalse
. If this istrue
, then you don't have to pre-configure your manipulations. Instead you can just form URLs like this:http://images.mysite.com/default/otf:resize(100)/12345.jpg
. This makes you vulnerable to DOS attacks, so if you want to be extra careful, leave this disabled.
You can get some status information at /status which looks something like this. It should give you a pretty good idea of how well your instance is handling current load.
{
"manipulations": {
"currentCount": 1,
"limit": 1,
"currentQueueSize": 1,
"averageQueueSizes": {
"lastMinute": 0.16666666666666666,
"lastHour": 0.01680672268907563,
"lastDay": 0.0033277870216306157
},
"activeManipulations": [
"imagesquish/test/otf:resize(163,121)/IMG_1296.JPG",
"imagesquish/test/otf:resize(152,168)/IMG_1296.JPG"
]
},
"proxyStreams": {
"currentCount": 1,
"limit": 1,
"currentQueueSize": 217,
"averageQueueSizes": {
"lastMinute": 145.25,
"lastHour": 14.647058823529411,
"lastDay": 7.063227953410982
}
}
}
#DEPLOYMENT SUGGESTIONS
##USE A CDN
ImageSquish is totally fast. It only generates a version of an image once. It is cached in S3 and ImageSquish just acts as a streaming proxy for S3 the next time the image is accessed. That being said, I suggest putting S3 behind a CDN. It is super easy and makes the service just that much more bullet proof.
##USE A LOAD BALANCER
To scale horizontally, simply add another node. If you're using fleet, it is as simple as another fleet start imagesquish@2.service
call. The nodes don't need to know anything about each other. Put them behind a load balancer and point the CDN to the load balancer and voila, you are ready to scale like crazy.
Here's a simple way to make ImageSquish really easy to use with Django. Just use this field in your model definition instead of models.ImageField.
import re
from django.conf import settings
from django.db import models
from django.db.models.fields.files import ImageFieldFile
from south.modelsinspector import add_introspection_rules
class ImageSquishFieldFile(ImageFieldFile):
def __getattribute__(self, name):
match = re.match('^url_(.*)$', name)
if not match:
return object.__getattribute__(self, name)
else:
manipulation = match.group(1)
return object.__getattribute__(
self, 'get_manipulation_url')(manipulation)
def get_manipulation_url(self, manipulation):
return self.field.imagesquish_url_pattern.format(
base_url=settings.IMAGESQUISH_BASE_URL,
bucket=self.field.imagesquish_bucket,
manipulation=manipulation,
img_path=str(self))
class ImageSquishField(models.ImageField):
attr_class = ImageSquishFieldFile
imagesquish_bucket = 'default'
imagesquish_url_pattern = '{base_url}/{bucket}/{manipulation}/{img_path}'
add_introspection_rules([], ["^athlete\.db\.fields\.ImageSquishField"])
Then use in your model.
class UserProfile:
image = ImageSquishField(upload_to='profile_images')
And then you can get the "small" size like this:
profile.image.url_small
ImageSquish is being used in a few places right now:
Let us know if you're using it too!
ImageSquish is awesome and is easy to use, but if you get stuck, just submit an issue on GitHub and add the "question" label. I'm happy to help. You can also create a question in StackOverflow and use the imagesquish tag.
The MIT License (MIT)
Copyright (c) 2015 Dustin McQuay
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.