I tried playing around with changing the system and user prompts for a while (see the
guide I used), and changing other parameters. The best answer is to limit by
max_tokens
incase that's very important to your usecase - even with the best system prompts: I saw response end abruptly.