Implement #[bytecode_instruction] #187

brson · 2023-06-07T00:39:23Z

#[bytecode_instruction] is a built-in attribute for native functions. It is applied only to the natives in 0x1::vector as an optimization. Calls to these functions do not emit bytecode for a native call but instead emit dedicated bytecode for these specific native functions.

These don't seem to be represented consistently in stackless bytecode:

While calls to these instructions have dedicated bytecode in normal move, in stackless bytecode they do not - they are just regular call instructions; but also the move model does not tell us these functions are called when we are making declarations - they do not show up in get_called_functions etc.

So it looks like our step of declaring called functions needs to have a special case of looking through the bytecode for calls to these functions in order to declare them.

The text was updated successfully, but these errors were encountered:

brson · 2023-06-07T00:40:52Z

move_compiler::naming::fake_natives is the code that deals with these special functions.

nvjle · 2023-06-07T01:57:47Z

move_compiler::naming::fake_natives is the code that deals with these special functions.

Yikes-- what an ugly wart. For the SBC path, it is pointless to lower them to specialized bytecode operations in the first place. What makes sense to me is to simply have the front-end ignore the directive for cfg Solana. If the SBC wants to turn them back into the calls they originally were, it should do so consistently. I'd actually call this a defect-- worse than a mere wart. It's all fixable in their code, but we're trying to avoid touching and/or fixing their problems.

nvjle · 2023-06-07T13:31:42Z

move_compiler::naming::fake_natives is the code that deals with these special functions.

Yikes-- what an ugly wart. For the SBC path, it is pointless to lower them to specialized bytecode operations in the first place. What makes sense to me is to simply have the front-end ignore the directive for cfg Solana. If the SBC wants to turn them back into the calls they originally were, it should do so consistently. I'd actually call this a defect-- worse than a mere wart. It's all fixable in their code, but we're trying to avoid touching and/or fixing their problems.

Following myself up-- it appears that the EVM translator also ignores them (move_compiler::to_bytecode::translate::module_call).

fn module_call(
    context: &mut Context,
    loc: Loc,
    code: &mut IR::BytecodeBlock,
    mident: ModuleIdent,
    fname: FunctionName,
    tys: Vec<H::BaseType>,
) {
    use IR::Bytecode_ as B;
    match fake_natives::resolve_builtin(&mident, &fname) {
        // TODO full evm support for vector bytecode instructions
        Some(mk_bytecode) if !cfg!(feature = "evm-backend") => {
            code.push(sp(loc, mk_bytecode(base_types(context, tys))))
        }
        _ => {
            let (m, n) = context.qualified_function_name(&mident, fname);
            code.push(sp(loc, B::Call(m, n, base_types(context, tys))))
        }
    }
}

brson · 2023-06-07T22:11:25Z

It would be easier to just turn this feature off in the compiler, though we would have to modify the compiler to do it, which I am reluctant to do - it would be more desirable to operate on standard move bytecode and not require any compiler modifications.

A flag to disable this feature seems like a reasonable thing to push upstream since standard move can work without these special vector bytecodes; though I am not confident upstream is engaged enough to consider such a patch.

The hacks needed to work around this in the llvm code generator are not too onerous. And maybe there are still apis I'm not seeing in the move model to figure out which of these fake natives have been used and declare them.

nvjle · 2023-06-07T22:39:56Z

It would be easier to just turn this feature off in the compiler, though we would have to modify the compiler to do it, which I am reluctant to do - it would be more desirable to operate on standard move bytecode and not require any compiler modifications.

A flag to disable this feature seems like a reasonable thing to push upstream since standard move can work without these special vector bytecodes; though I am not confident upstream is engaged enough to consider such a patch.

The hacks needed to work around this in the llvm code generator are not too onerous. And maybe there are still apis I'm not seeing in the move model to figure out which of these fake natives have been used and declare them.

Did you see my other comment above? Turning it off is simple and is a one-line change. The EVM compiler does the same thing-- guarding the builtin conversion with !cfg!(feature = "evm-backend"). See the code block in the comment above for resolve_builtin.

nvjle · 2023-06-16T17:38:30Z

Closing issue-- this has been fixed by #199.

brson added the enhancement New feature or request label Jun 7, 2023

brson self-assigned this Jun 7, 2023

nvjle closed this as completed Jun 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement #[bytecode_instruction] #187

Implement #[bytecode_instruction] #187

brson commented Jun 7, 2023 •

edited

brson commented Jun 7, 2023 •

edited

nvjle commented Jun 7, 2023 •

edited

nvjle commented Jun 7, 2023

brson commented Jun 7, 2023

nvjle commented Jun 7, 2023

nvjle commented Jun 16, 2023

Implement #[bytecode_instruction] #187

Implement #[bytecode_instruction] #187

Comments

brson commented Jun 7, 2023 • edited

brson commented Jun 7, 2023 • edited

nvjle commented Jun 7, 2023 • edited

nvjle commented Jun 7, 2023

brson commented Jun 7, 2023

nvjle commented Jun 7, 2023

nvjle commented Jun 16, 2023

brson commented Jun 7, 2023 •

edited

brson commented Jun 7, 2023 •

edited

nvjle commented Jun 7, 2023 •

edited