Amino acid dipepetide frequency for Schlumbergera virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.151AlaAla: 4.151 ± 3.162
0.461AlaCys: 0.461 ± 0.245
2.306AlaAsp: 2.306 ± 0.82
2.768AlaGlu: 2.768 ± 0.5
4.613AlaPhe: 4.613 ± 1.498
3.69AlaGly: 3.69 ± 1.428
2.306AlaHis: 2.306 ± 1.226
2.768AlaIle: 2.768 ± 1.35
3.229AlaLys: 3.229 ± 0.94
8.303AlaLeu: 8.303 ± 2.187
1.845AlaMet: 1.845 ± 0.825
3.69AlaAsn: 3.69 ± 1.159
4.151AlaPro: 4.151 ± 1.678
5.535AlaGln: 5.535 ± 3.384
3.69AlaArg: 3.69 ± 1.962
4.151AlaSer: 4.151 ± 1.422
2.768AlaThr: 2.768 ± 0.5
5.535AlaVal: 5.535 ± 1.586
2.306AlaTrp: 2.306 ± 1.525
2.768AlaTyr: 2.768 ± 0.874
0.0AlaXaa: 0.0 ± 0.0
Cys
1.384CysAla: 1.384 ± 0.683
0.461CysCys: 0.461 ± 0.245
0.0CysAsp: 0.0 ± 0.0
0.923CysGlu: 0.923 ± 0.49
0.461CysPhe: 0.461 ± 0.977
0.923CysGly: 0.923 ± 0.85
0.0CysHis: 0.0 ± 0.0
0.461CysIle: 0.461 ± 0.245
0.461CysLys: 0.461 ± 0.245
0.461CysLeu: 0.461 ± 0.245
0.0CysMet: 0.0 ± 0.0
0.923CysAsn: 0.923 ± 0.85
0.923CysPro: 0.923 ± 1.722
0.923CysGln: 0.923 ± 0.737
0.461CysArg: 0.461 ± 0.977
1.845CysSer: 1.845 ± 0.981
1.384CysThr: 1.384 ± 2.902
0.923CysVal: 0.923 ± 0.49
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.845AspAla: 1.845 ± 1.475
0.461AspCys: 0.461 ± 0.245
2.768AspAsp: 2.768 ± 1.471
3.229AspGlu: 3.229 ± 1.167
4.613AspPhe: 4.613 ± 0.867
4.151AspGly: 4.151 ± 1.491
0.923AspHis: 0.923 ± 0.49
1.845AspIle: 1.845 ± 0.981
1.845AspLys: 1.845 ± 0.714
4.151AspLeu: 4.151 ± 1.872
0.461AspMet: 0.461 ± 0.468
1.384AspAsn: 1.384 ± 0.719
4.151AspPro: 4.151 ± 1.591
2.768AspGln: 2.768 ± 0.944
1.384AspArg: 1.384 ± 0.736
3.229AspSer: 3.229 ± 1.536
1.845AspThr: 1.845 ± 0.981
1.845AspVal: 1.845 ± 0.981
0.461AspTrp: 0.461 ± 0.245
1.384AspTyr: 1.384 ± 0.736
0.0AspXaa: 0.0 ± 0.0
Glu
6.458GluAla: 6.458 ± 2.309
0.461GluCys: 0.461 ± 0.245
3.229GluAsp: 3.229 ± 0.442
4.613GluGlu: 4.613 ± 2.452
1.845GluPhe: 1.845 ± 0.714
2.768GluGly: 2.768 ± 0.978
0.923GluHis: 0.923 ± 0.49
3.229GluIle: 3.229 ± 1.717
6.458GluLys: 6.458 ± 2.591
5.074GluLeu: 5.074 ± 1.65
1.384GluMet: 1.384 ± 0.573
5.535GluAsn: 5.535 ± 2.277
2.306GluPro: 2.306 ± 0.82
1.384GluGln: 1.384 ± 0.719
1.384GluArg: 1.384 ± 0.719
2.306GluSer: 2.306 ± 1.699
5.535GluThr: 5.535 ± 1.313
3.69GluVal: 3.69 ± 1.962
0.923GluTrp: 0.923 ± 0.49
0.461GluTyr: 0.461 ± 0.861
0.0GluXaa: 0.0 ± 0.0
Phe
3.69PheAla: 3.69 ± 2.715
0.923PheCys: 0.923 ± 0.822
4.613PheAsp: 4.613 ± 2.103
3.69PheGlu: 3.69 ± 0.51
2.306PhePhe: 2.306 ± 0.749
1.384PheGly: 1.384 ± 0.736
1.384PheHis: 1.384 ± 0.736
2.306PheIle: 2.306 ± 0.652
3.229PheLys: 3.229 ± 1.042
5.074PheLeu: 5.074 ± 1.581
1.384PheMet: 1.384 ± 0.736
2.306PheAsn: 2.306 ± 0.82
1.384PhePro: 1.384 ± 0.813
1.384PheGln: 1.384 ± 0.736
0.923PheArg: 0.923 ± 0.49
4.151PheSer: 4.151 ± 1.678
0.923PheThr: 0.923 ± 0.49
1.384PheVal: 1.384 ± 0.813
1.845PheTrp: 1.845 ± 0.981
0.461PheTyr: 0.461 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
5.535GlyAla: 5.535 ± 2.513
0.923GlyCys: 0.923 ± 1.396
4.151GlyAsp: 4.151 ± 1.678
3.229GlyGlu: 3.229 ± 1.167
1.384GlyPhe: 1.384 ± 0.736
1.384GlyGly: 1.384 ± 0.813
2.768GlyHis: 2.768 ± 1.333
2.306GlyIle: 2.306 ± 1.462
2.768GlyLys: 2.768 ± 0.978
4.151GlyLeu: 4.151 ± 3.14
0.461GlyMet: 0.461 ± 0.245
0.461GlyAsn: 0.461 ± 0.977
1.384GlyPro: 1.384 ± 0.719
2.306GlyGln: 2.306 ± 0.82
2.768GlyArg: 2.768 ± 1.094
5.535GlySer: 5.535 ± 1.709
4.151GlyThr: 4.151 ± 2.054
4.613GlyVal: 4.613 ± 2.314
0.0GlyTrp: 0.0 ± 0.0
0.923GlyTyr: 0.923 ± 0.737
0.0GlyXaa: 0.0 ± 0.0
His
4.151HisAla: 4.151 ± 1.174
1.384HisCys: 1.384 ± 1.786
0.461HisAsp: 0.461 ± 0.245
0.923HisGlu: 0.923 ± 0.49
3.69HisPhe: 3.69 ± 1.428
2.768HisGly: 2.768 ± 0.944
1.845HisHis: 1.845 ± 0.981
0.923HisIle: 0.923 ± 0.49
2.306HisLys: 2.306 ± 1.226
4.151HisLeu: 4.151 ± 1.628
0.461HisMet: 0.461 ± 0.245
0.461HisAsn: 0.461 ± 0.245
3.229HisPro: 3.229 ± 1.539
1.845HisGln: 1.845 ± 0.981
1.845HisArg: 1.845 ± 1.58
3.229HisSer: 3.229 ± 1.536
1.384HisThr: 1.384 ± 0.683
0.923HisVal: 0.923 ± 1.388
0.0HisTrp: 0.0 ± 0.0
0.461HisTyr: 0.461 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
4.613IleAla: 4.613 ± 1.692
0.0IleCys: 0.0 ± 0.0
1.384IleAsp: 1.384 ± 0.736
5.074IleGlu: 5.074 ± 2.067
2.768IlePhe: 2.768 ± 1.625
4.151IleGly: 4.151 ± 1.247
1.845IleHis: 1.845 ± 0.848
3.229IleIle: 3.229 ± 4.092
4.151IleLys: 4.151 ± 1.591
5.535IleLeu: 5.535 ± 1.718
1.384IleMet: 1.384 ± 0.736
3.229IleAsn: 3.229 ± 1.167
2.768IlePro: 2.768 ± 0.874
3.229IleGln: 3.229 ± 1.384
1.384IleArg: 1.384 ± 2.582
3.69IleSer: 3.69 ± 2.018
3.69IleThr: 3.69 ± 1.237
2.768IleVal: 2.768 ± 1.35
0.461IleTrp: 0.461 ± 0.977
1.384IleTyr: 1.384 ± 1.22
0.0IleXaa: 0.0 ± 0.0
Lys
5.535LysAla: 5.535 ± 1.957
0.923LysCys: 0.923 ± 0.49
3.69LysAsp: 3.69 ± 1.962
4.151LysGlu: 4.151 ± 1.591
1.384LysPhe: 1.384 ± 0.719
3.69LysGly: 3.69 ± 1.699
1.845LysHis: 1.845 ± 0.848
4.613LysIle: 4.613 ± 1.666
2.768LysLys: 2.768 ± 1.471
6.919LysLeu: 6.919 ± 2.021
1.384LysMet: 1.384 ± 0.683
2.306LysAsn: 2.306 ± 0.749
3.229LysPro: 3.229 ± 1.912
2.768LysGln: 2.768 ± 1.471
2.768LysArg: 2.768 ± 1.471
3.69LysSer: 3.69 ± 0.982
5.535LysThr: 5.535 ± 1.957
3.69LysVal: 3.69 ± 0.982
0.0LysTrp: 0.0 ± 0.0
0.461LysTyr: 0.461 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
6.458LeuAla: 6.458 ± 1.782
0.461LeuCys: 0.461 ± 0.861
2.768LeuAsp: 2.768 ± 2.033
6.458LeuGlu: 6.458 ± 1.226
3.69LeuPhe: 3.69 ± 0.982
5.996LeuGly: 5.996 ± 2.817
3.229LeuHis: 3.229 ± 1.376
6.458LeuIle: 6.458 ± 3.732
8.303LeuLys: 8.303 ± 3.181
4.613LeuLeu: 4.613 ± 1.801
0.461LeuMet: 0.461 ± 0.245
4.613LeuAsn: 4.613 ± 6.961
10.609LeuPro: 10.609 ± 2.182
3.69LeuGln: 3.69 ± 0.51
5.074LeuArg: 5.074 ± 1.892
6.458LeuSer: 6.458 ± 3.067
9.686LeuThr: 9.686 ± 3.454
5.996LeuVal: 5.996 ± 2.964
0.461LeuTrp: 0.461 ± 0.245
4.151LeuTyr: 4.151 ± 1.732
0.0LeuXaa: 0.0 ± 0.0
Met
1.384MetAla: 1.384 ± 0.736
0.461MetCys: 0.461 ± 0.245
1.384MetAsp: 1.384 ± 1.067
0.461MetGlu: 0.461 ± 0.245
1.384MetPhe: 1.384 ± 1.314
1.384MetGly: 1.384 ± 0.736
0.0MetHis: 0.0 ± 0.0
1.384MetIle: 1.384 ± 0.683
1.384MetLys: 1.384 ± 0.736
1.845MetLeu: 1.845 ± 0.981
0.0MetMet: 0.0 ± 0.0
0.923MetAsn: 0.923 ± 0.49
1.845MetPro: 1.845 ± 0.714
0.923MetGln: 0.923 ± 0.49
1.384MetArg: 1.384 ± 0.736
1.384MetSer: 1.384 ± 1.328
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.306MetTyr: 2.306 ± 1.226
0.0MetXaa: 0.0 ± 0.0
Asn
3.229AsnAla: 3.229 ± 1.318
1.384AsnCys: 1.384 ± 0.736
2.306AsnAsp: 2.306 ± 1.226
2.768AsnGlu: 2.768 ± 1.439
1.845AsnPhe: 1.845 ± 0.981
2.306AsnGly: 2.306 ± 1.905
2.306AsnHis: 2.306 ± 2.677
2.768AsnIle: 2.768 ± 0.978
1.845AsnLys: 1.845 ± 0.692
2.306AsnLeu: 2.306 ± 1.905
1.384AsnMet: 1.384 ± 0.736
0.923AsnAsn: 0.923 ± 0.49
4.613AsnPro: 4.613 ± 0.867
1.384AsnGln: 1.384 ± 0.719
1.845AsnArg: 1.845 ± 1.168
4.613AsnSer: 4.613 ± 1.801
1.845AsnThr: 1.845 ± 1.274
1.845AsnVal: 1.845 ± 1.644
0.461AsnTrp: 0.461 ± 0.245
3.229AsnTyr: 3.229 ± 1.691
0.0AsnXaa: 0.0 ± 0.0
Pro
2.306ProAla: 2.306 ± 0.749
0.461ProCys: 0.461 ± 0.245
5.074ProAsp: 5.074 ± 1.492
5.535ProGlu: 5.535 ± 2.877
1.384ProPhe: 1.384 ± 0.719
1.845ProGly: 1.845 ± 0.714
2.768ProHis: 2.768 ± 2.44
5.074ProIle: 5.074 ± 1.518
5.535ProLys: 5.535 ± 1.03
7.841ProLeu: 7.841 ± 2.869
1.384ProMet: 1.384 ± 0.719
3.69ProAsn: 3.69 ± 1.053
4.151ProPro: 4.151 ± 1.676
0.923ProGln: 0.923 ± 0.49
1.384ProArg: 1.384 ± 0.736
5.074ProSer: 5.074 ± 3.059
4.613ProThr: 4.613 ± 1.666
3.229ProVal: 3.229 ± 1.539
0.923ProTrp: 0.923 ± 0.49
1.384ProTyr: 1.384 ± 0.736
0.0ProXaa: 0.0 ± 0.0
Gln
3.69GlnAla: 3.69 ± 1.237
0.0GlnCys: 0.0 ± 0.0
1.845GlnAsp: 1.845 ± 0.981
2.306GlnGlu: 2.306 ± 0.82
1.845GlnPhe: 1.845 ± 0.692
2.768GlnGly: 2.768 ± 2.273
2.306GlnHis: 2.306 ± 0.82
1.845GlnIle: 1.845 ± 0.981
1.384GlnLys: 1.384 ± 1.789
5.535GlnLeu: 5.535 ± 1.499
0.923GlnMet: 0.923 ± 0.49
0.0GlnAsn: 0.0 ± 0.0
4.613GlnPro: 4.613 ± 1.443
1.845GlnGln: 1.845 ± 0.714
1.845GlnArg: 1.845 ± 0.714
4.151GlnSer: 4.151 ± 2.207
4.613GlnThr: 4.613 ± 1.829
1.845GlnVal: 1.845 ± 0.692
1.384GlnTrp: 1.384 ± 0.736
1.384GlnTyr: 1.384 ± 1.968
0.0GlnXaa: 0.0 ± 0.0
Arg
1.384ArgAla: 1.384 ± 0.719
0.461ArgCys: 0.461 ± 0.861
2.768ArgAsp: 2.768 ± 0.944
2.306ArgGlu: 2.306 ± 0.82
0.923ArgPhe: 0.923 ± 0.49
2.306ArgGly: 2.306 ± 0.948
2.768ArgHis: 2.768 ± 1.644
3.229ArgIle: 3.229 ± 1.042
2.306ArgLys: 2.306 ± 1.4
5.996ArgLeu: 5.996 ± 1.39
0.0ArgMet: 0.0 ± 0.0
2.768ArgAsn: 2.768 ± 0.874
0.461ArgPro: 0.461 ± 0.245
3.229ArgGln: 3.229 ± 0.442
2.306ArgArg: 2.306 ± 1.226
1.384ArgSer: 1.384 ± 1.786
3.69ArgThr: 3.69 ± 1.373
1.845ArgVal: 1.845 ± 0.692
0.0ArgTrp: 0.0 ± 0.0
1.845ArgTyr: 1.845 ± 0.692
0.0ArgXaa: 0.0 ± 0.0
Ser
2.768SerAla: 2.768 ± 2.467
0.461SerCys: 0.461 ± 1.514
1.845SerAsp: 1.845 ± 0.714
3.229SerGlu: 3.229 ± 1.042
3.229SerPhe: 3.229 ± 1.717
2.306SerGly: 2.306 ± 0.948
2.768SerHis: 2.768 ± 0.944
3.229SerIle: 3.229 ± 0.442
3.69SerLys: 3.69 ± 0.51
8.764SerLeu: 8.764 ± 2.43
1.845SerMet: 1.845 ± 1.256
2.768SerAsn: 2.768 ± 3.489
5.535SerPro: 5.535 ± 3.423
4.151SerGln: 4.151 ± 1.422
5.535SerArg: 5.535 ± 2.23
7.841SerSer: 7.841 ± 2.182
5.996SerThr: 5.996 ± 3.201
5.074SerVal: 5.074 ± 1.225
0.923SerTrp: 0.923 ± 0.737
2.306SerTyr: 2.306 ± 0.82
0.0SerXaa: 0.0 ± 0.0
Thr
3.229ThrAla: 3.229 ± 1.042
0.923ThrCys: 0.923 ± 0.737
1.384ThrAsp: 1.384 ± 0.683
3.229ThrGlu: 3.229 ± 1.167
3.229ThrPhe: 3.229 ± 1.717
2.306ThrGly: 2.306 ± 1.226
3.69ThrHis: 3.69 ± 1.469
4.613ThrIle: 4.613 ± 2.726
3.69ThrLys: 3.69 ± 1.677
9.686ThrLeu: 9.686 ± 3.472
2.306ThrMet: 2.306 ± 0.823
3.69ThrAsn: 3.69 ± 1.159
5.535ThrPro: 5.535 ± 0.783
3.69ThrGln: 3.69 ± 2.172
1.384ThrArg: 1.384 ± 1.22
4.613ThrSer: 4.613 ± 1.946
5.996ThrThr: 5.996 ± 1.737
4.151ThrVal: 4.151 ± 1.34
0.923ThrTrp: 0.923 ± 0.49
3.229ThrTyr: 3.229 ± 1.272
0.0ThrXaa: 0.0 ± 0.0
Val
3.229ValAla: 3.229 ± 0.946
0.923ValCys: 0.923 ± 0.49
1.384ValAsp: 1.384 ± 0.736
3.69ValGlu: 3.69 ± 1.428
2.768ValPhe: 2.768 ± 1.362
3.229ValGly: 3.229 ± 1.952
2.306ValHis: 2.306 ± 0.652
4.613ValIle: 4.613 ± 1.279
2.306ValLys: 2.306 ± 0.82
5.074ValLeu: 5.074 ± 2.019
1.384ValMet: 1.384 ± 0.736
2.768ValAsn: 2.768 ± 0.961
1.845ValPro: 1.845 ± 1.644
1.384ValGln: 1.384 ± 0.736
3.69ValArg: 3.69 ± 0.51
3.69ValSer: 3.69 ± 1.83
4.613ValThr: 4.613 ± 3.06
3.69ValVal: 3.69 ± 5.037
0.461ValTrp: 0.461 ± 0.245
1.845ValTyr: 1.845 ± 0.848
0.0ValXaa: 0.0 ± 0.0
Trp
2.768TrpAla: 2.768 ± 0.874
0.0TrpCys: 0.0 ± 0.0
1.384TrpAsp: 1.384 ± 0.683
0.923TrpGlu: 0.923 ± 0.822
0.461TrpPhe: 0.461 ± 0.245
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.461TrpLys: 0.461 ± 0.245
1.384TrpLeu: 1.384 ± 0.736
0.0TrpMet: 0.0 ± 0.0
1.384TrpAsn: 1.384 ± 0.719
0.461TrpPro: 0.461 ± 0.245
1.384TrpGln: 1.384 ± 0.736
0.0TrpArg: 0.0 ± 0.0
0.461TrpSer: 0.461 ± 0.245
0.0TrpThr: 0.0 ± 0.0
0.461TrpVal: 0.461 ± 0.245
0.461TrpTrp: 0.461 ± 0.245
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.229TyrAla: 3.229 ± 1.717
1.384TyrCys: 1.384 ± 1.067
0.461TyrAsp: 0.461 ± 0.245
0.461TyrGlu: 0.461 ± 0.245
0.923TyrPhe: 0.923 ± 0.822
1.845TyrGly: 1.845 ± 0.714
0.461TyrHis: 0.461 ± 0.245
1.845TyrIle: 1.845 ± 1.475
3.229TyrLys: 3.229 ± 1.042
2.768TyrLeu: 2.768 ± 1.471
0.923TyrMet: 0.923 ± 0.49
1.384TyrAsn: 1.384 ± 0.736
1.384TyrPro: 1.384 ± 1.584
1.384TyrGln: 1.384 ± 0.813
0.923TyrArg: 0.923 ± 1.512
2.768TyrSer: 2.768 ± 2.446
3.229TyrThr: 3.229 ± 1.042
1.384TyrVal: 1.384 ± 0.813
0.0TyrTrp: 0.0 ± 0.0
0.923TyrTyr: 0.923 ± 0.85
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2169 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski