just for reference, the current ops are
primop PrefetchByteArrayOp2 "prefetchByteArray2#" GenPrimOp
ByteArray# -> Int# -> ByteArray#
with llvm_only = True
primop PrefetchMutableByteArrayOp2 "prefetchMutableByteArray2#" GenPrimOp
MutableByteArray# s -> Int# -> State# s -> State# s
with has_side_effects = True
llvm_only = True
primop PrefetchAddrOp2 "prefetchAddr2#" GenPrimOp
Addr# -> Int# -> Addr#
with llvm_only = True
one of the things my patch (for 7.8) will do is add preliminary native code gen support for prefetch, admittedly in the form of Nops. (later work will fix that to be a bit more actionable)
any opinions that help us suss this out would be appreciated