Fix MasterPodMoveTimeout field that cannot be unmarshalled #816

ghost · 2020-02-05T14:40:45Z

I tried using the default CRD-based operator configuration from here, but it failed:

time="2020-02-05T13:48:02Z" level=fatal msg="unable to read operator configuration: could not get operator configuration object \"postgres-operator-configuration\": v1.OperatorConfiguration.Configuration: v1.OperatorConfigurationData.Kubernetes: v1.KubernetesMetaConfiguration.MasterPodMoveTimeout: readUint64: unexpected character: \xff, error found in #10 byte of ...|timeout\":\"20m\",\"oaut|..., bigger context ...|\"enable_sidecars\":true,\"master_pod_move_timeout\":\"20m\",\"oauth_token_secret_name\":\"postgresql-operato|..." pkg=controller

From what I can tell, attempting to unmarshal the MasterPodMoveTimeout field from a CRD-based operator configuration fails because the time.Duration type is not unmarshallable.

The solution to this seems to be to use the Duration type instead:

postgres-operator/pkg/apis/acid.zalan.do/v1/operator_configuration_type.go

Lines 192 to 193 in 8794e4f

    
           //Duration shortens this frequently used name 
        
           type Duration time.Duration

That type seems to be what's consistently used for other fields, and it is unmarshallable:

postgres-operator/pkg/apis/acid.zalan.do/v1/marshal.go

Lines 128 to 152 in 8794e4f

    
           // UnmarshalJSON convert to Duration from byte slice of json 
        
           func (d *Duration) UnmarshalJSON(b []byte) error { 
        
           	var ( 
        
           		v   interface{} 
        
           		err error 
        
           	) 
        
           	if err = json.Unmarshal(b, &v); err != nil { 
        
           		return err 
        
           	} 
        
           	switch val := v.(type) { 
        
           	case string: 
        
           		t, err := time.ParseDuration(val) 
        
           		if err != nil { 
        
           			return err 
        
           		} 
        
           		*d = Duration(t) 
        
           		return nil 
        
           	case float64: 
        
           		t := time.Duration(val) 
        
           		*d = Duration(t) 
        
           		return nil 
        
           	default: 
        
           		return fmt.Errorf("could not recognize type %T as a valid type to unmarshal to Duration", val) 
        
           	} 
        
           }

I've tested locally and with this change the error disappeared.

erthalion · 2020-02-11T16:09:04Z

Good catch! I've checked, and looks like you're right. I'm afraid the same problem is with the configuration via configmap, although it's sort of deprecated by now.

erthalion · 2020-02-11T16:09:18Z

👍

Jan-M · 2020-02-11T16:12:20Z

👍

pcornelissen · 2020-02-12T05:17:49Z

FYI: this is not fixed. I just tried it ad the "... readUint64: unexpected character:" still happens with the postgresql-operator-default-configuration.yaml from a few minutes ago:

time="2020-02-12T05:12:31Z" level=fatal msg="unable to read operator configuration: could not get operator configuration object \"postgresql-operator-default-configuration\": v1.OperatorConfiguration.Configuration: v1.OperatorConfigurationData.Kubernetes: v1.KubernetesMetaConfiguration.MasterPodMoveTimeout: readUint64: unexpected character: \xff, error found in #10 byte of ...|timeout\":\"20m\",\"oaut|..., bigger context ...|\"enable_sidecars\":true,\"master_pod_move_timeout\":\"20m\",\"oauth_token_secret_name\":\"postgresql-operato|..." pkg=controller

tclass · 2020-02-14T14:58:33Z

is there some quick fix for this?

pcornelissen · 2020-02-14T15:27:51Z

I just removed the value, but I don't know which value is used in that case ;)

erthalion · 2020-02-17T10:23:11Z

FYI: this is not fixed. I just tried it ad the "... readUint64: unexpected character:" still happens with the postgresql-operator-default-configuration.yaml from a few minutes ago:

Strange, I'll check. @pcornelissen just to make sure, you build the operator from 00f00af or higher?

pcornelissen · 2020-02-17T11:00:11Z

I checked out the repo on the 12th and used the files in Master, so the commit should be included.
The image used is: registry.opensource.zalan.do/acid/postgres-operator:v1.3.1

erthalion · 2020-02-17T11:57:26Z

I checked out the repo on the 12th and used the files in Master, so the commit should be included.
The image used is: registry.opensource.zalan.do/acid/postgres-operator:v1.3.1

I'm confused, do you use a prebuild image postgres-operator:v1.3.1 (which doesn't include this fix yet), or build your own from the master branch?

pcornelissen · 2020-02-17T12:06:51Z

I checked out the master and used the config files from there, which use the container image above. I don't know when the code is built, so I assumed that when this is fixed, that the corresponding image would also be updated.
If that is not the case, could you trigger an image rebuild (and update the yamls), so other people don't fall into this trap as well?

FxKu · 2020-02-25T13:06:52Z

@pcornelissen included within the new v1.4.0 release

frxstrem added 2 commits February 5, 2020 15:22

Update operator_configuration_type.go

b1bd45a

Update operator_config.go

49080f4

ghost requested review from avaczi, CyberDem0n, erthalion, FxKu, Jan-M, RafiaSabih and sdudoladov as code owners February 5, 2020 14:40

erthalion merged commit 00f00af into zalando:master Feb 11, 2020

FxKu mentioned this pull request Feb 19, 2020

Operator cannot move pod, in case of using node_readiness_label #792

Open

FxKu added this to the 1.4 milestone Feb 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix MasterPodMoveTimeout field that cannot be unmarshalled #816

Fix MasterPodMoveTimeout field that cannot be unmarshalled #816

Uh oh!

ghost commented Feb 5, 2020

Uh oh!

erthalion commented Feb 11, 2020

Uh oh!

erthalion commented Feb 11, 2020

Uh oh!

Jan-M commented Feb 11, 2020

Uh oh!

pcornelissen commented Feb 12, 2020 •

edited

Loading

Uh oh!

tclass commented Feb 14, 2020

Uh oh!

pcornelissen commented Feb 14, 2020

Uh oh!

erthalion commented Feb 17, 2020

Uh oh!

pcornelissen commented Feb 17, 2020

Uh oh!

erthalion commented Feb 17, 2020

Uh oh!

pcornelissen commented Feb 17, 2020

Uh oh!

FxKu commented Feb 25, 2020

Uh oh!

Uh oh!

	//Duration shortens this frequently used name
	type Duration time.Duration

	// UnmarshalJSON convert to Duration from byte slice of json
	func (d *Duration) UnmarshalJSON(b []byte) error {
	var (
	v interface{}
	err error
	)
	if err = json.Unmarshal(b, &v); err != nil {
	return err
	}
	switch val := v.(type) {
	case string:
	t, err := time.ParseDuration(val)
	if err != nil {
	return err
	}
	*d = Duration(t)
	return nil
	case float64:
	t := time.Duration(val)
	*d = Duration(t)
	return nil
	default:
	return fmt.Errorf("could not recognize type %T as a valid type to unmarshal to Duration", val)
	}
	}

Fix MasterPodMoveTimeout field that cannot be unmarshalled #816

Fix MasterPodMoveTimeout field that cannot be unmarshalled #816

Uh oh!

Conversation

ghost commented Feb 5, 2020

Uh oh!

erthalion commented Feb 11, 2020

Uh oh!

erthalion commented Feb 11, 2020

Uh oh!

Jan-M commented Feb 11, 2020

Uh oh!

pcornelissen commented Feb 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tclass commented Feb 14, 2020

Uh oh!

pcornelissen commented Feb 14, 2020

Uh oh!

erthalion commented Feb 17, 2020

Uh oh!

pcornelissen commented Feb 17, 2020

Uh oh!

erthalion commented Feb 17, 2020

Uh oh!

pcornelissen commented Feb 17, 2020

Uh oh!

FxKu commented Feb 25, 2020

Uh oh!

Uh oh!

pcornelissen commented Feb 12, 2020 •

edited

Loading