What’s the overhead of running in docker vs bare metal?
I can't say. It's definitely not the same as running on bare metal. Then again, any benchmark in a test environment is just an approximation of the real thing. To see how anything will act in the real environment, you have to run it there.
In our use case we wanted a way to quickly and easily set up benchmarks that would allow us to compare our software to our competitors under the same conditions. Given that Benchi can run the same benchmark scenario for different tools, the results are comparable with each other. We also run all benchmarks on an empty AWS EC2 instance, to minimize any other factors. But does that mean that the collected results show the absolute limit of what the tools can handle? Probably not. Under different conditions, results can change, but that's just the nature of benchmarks.
Add to that the number of people we've had apply for jobs using AI and then turn out to have zero knowledge of what we've hired them for when they turn up.
There's a ton of employer of record companies that can hire for you. I've worked via Deel and Remote.com, both without any issues.