This is an area where FLOSS has an opportunity to shine. I think many of these algorithms are described in scientific papers and considering FLOSS is much more collaboration-prone, I'd really expect the best algorithms (except for the ones that require much training data) to soon be implemented. An example of a success case: AV1.
The problem is only one: PROPRIETARY APPLICATIONS
Could you write a custom and simplified Facebook Messenger client that would allow clear and complete navigation through hardware buttons or vocal commands? Abso-fucking-lutely!
Can you do it without Facebook's approval which will never come? Abso-fucking-lutely not!