Notes for distributed cronjobs in AWS Scaling Architecture

ByConan December 25, 2013July 17, 2026

When designing the scalable systems which can run on multiple nodes, one common problem to face is how to deal with scheduled tasks which must be run on one instance, not in multiple instances. When checking the AWS architecture for a team working on AWS, I found a problem of multiple cronjobs running on multiple nodes which cause duplicated work. The key to solving distributed cronjobs in AWS Scaling Architecture is to have a locking method to guarantee that if a node is performing cron, no other nodes can be. Another approach is to have a centralized task handling system to deal with this. I note here some references which might be useful for your reference when dealing with this issue.

Implemented on Scalr system: http://highscalability.com/blog/2010/3/22/7-secrets-to-successfully-scaling-with-scalr-on-amazon-by-se.html

Answers from AWS Staff:

I did a quick poll of some of my colleagues and came up empty on the cron, but after sleeping on it I realised the important step may be limited to locking. So I looked for “distributed cron job locking” and found a reference to Zookeeper, an Apache project.
http://zookeeper.apache.org/doc/r3.2.2/recipes.html
http://highscalability.com/blog/2010/3/22/7-secrets-to-successfully-scaling-with-scalr-on-amazon-by-se.html
Also I have seen reference to using memcached or a similar caching mechanism as a way to create locks with a TTL. In this way you set a flag, with a TTL of 300 seconds and no other cron worker will execute the job. The lock will automatically be released after the TTL has expired. This is conceptually very similar to the SQS option we discussed yesterday.
Also see; Google’s chubby http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//archive/chubby-osdi06.pdf
Let me know if this helps, and feel free to ask questions, we are very aware that our services can be complex and daunting to both beginners and seasoned developers alike. We are always happy to offer architecture and best practice advice.

Performance & Scaling
Disable AMDs CPU-scaling (AMD cool and quiet)
ByConan December 13, 2012July 17, 2026
Our centos system with AMD Cool and Quite activated seems to be slower under load than without cpu scaling. Another point is running a vmware server on such a host – will this feature work probably? Follow these steps to turn it off, just to make sure 😉 To immediate turn off on running system:…
Read More Disable AMDs CPU-scaling (AMD cool and quiet)
Performance & Scaling
Some Linux performance testing commands
ByConan September 17, 2013July 17, 2026
This short tutorial mentions some Linux performance testing commands to measure server/VPS performance. This will be added gradually as soon as I have more tools to test 🙂 1. Test disk IO: $ dd if=/dev/zero of=test bs=64k count=16k conv=fdatasync 16384+0 records in 16384+0 records out 1073741824 bytes (1.1 GB) copied, 15.1643 s, 70.8 MB/s 2. Test disk…
Read More Some Linux performance testing commands
Performance & Scaling
Install HAProxy with SSL Termination
ByConan June 22, 2018July 17, 2026
These days I have been working with scaling solutions for a PHP framework. Previously I came with Nginx as load balancers, however, with the requirement of health check and failover, I need to come to HAProxy this time. So I write this entry as a note for installing HAProxy with SSL Termination. Most of my machines…
Read More Install HAProxy with SSL Termination
Performance & Scaling
Some concerns when building SaaS application [Beginner]
ByConan March 4, 2015July 17, 2026
First, we must keep in mind that the SaaS application architecture must support multi-tenant from beginning, otherwise we will struggle with scaling the application in the future. Application architecture level We must consider careful which is needed to separate: application, database, or what other else? It should use the same application level (source code) to…
Read More Some concerns when building SaaS application [Beginner]
Performance & Scaling
Load Testing with Locust
ByConan June 11, 2018July 17, 2026
You might be familiar with load testing tools such as Apache Benchmark (ab), siege, Apache JMeter and cloud services such as BlazeMeter, LoadImpact, Loader.io, etc. I tried many other tools, and found that there is no tool that completely satisfies me: ab and siege are too plain and simple without scenario, JMeter needs time for recording and defining cases…
Read More Load Testing with Locust
Performance & Scaling
Load Sharing with DNS
ByConan November 21, 2012July 17, 2026
This article discusses on how to do Load sharing with DNS (Domain Name System). Introduction: A DNS based approach is a classical approach to sharing the load between multiple servers. DNS responds to domain name look-up requests issued by clients and returns the corresponding IP address. DNS is an Internet service that translates domain names…
Read More Load Sharing with DNS

Similar Posts

Leave a Reply Cancel reply