Challenge: Open source tools failed to identify root cause of problems
Jacob Marcus, Senior Director of Engineering, and Praveen Subbarao, Senior Quality Assurance Manager, at Care.com were responsible for keeping the site's critical applications responsive. They primarily relied on open source tools to monitor performance, which could no longer meet their needs. “We were traversing through logs to find errors,” Subbarao said, which was a slow and tedious process. In addition, they noticed intermittent memory problems that couldn't be properly identified with the tools deployed. “We had on-and-off memory-related issues with the garbage collection. It wasn't completely understood what was holding onto memory at any point in time,” said Marcus.