Amino acid dipepetide frequency for Actinidia seed borne latent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.415AlaAla: 1.415 ± 0.385
1.062AlaCys: 1.062 ± 0.572
1.415AlaAsp: 1.415 ± 1.309
1.769AlaGlu: 1.769 ± 2.529
2.477AlaPhe: 2.477 ± 1.334
1.769AlaGly: 1.769 ± 0.954
1.062AlaHis: 1.062 ± 0.572
2.477AlaIle: 2.477 ± 0.898
5.662AlaLys: 5.662 ± 0.913
4.954AlaLeu: 4.954 ± 1.796
1.415AlaMet: 1.415 ± 0.754
3.539AlaAsn: 3.539 ± 1.619
1.415AlaPro: 1.415 ± 0.93
0.708AlaGln: 0.708 ± 0.381
1.769AlaArg: 1.769 ± 1.409
2.477AlaSer: 2.477 ± 0.768
1.415AlaThr: 1.415 ± 1.309
2.477AlaVal: 2.477 ± 1.155
0.354AlaTrp: 0.354 ± 0.758
2.123AlaTyr: 2.123 ± 0.609
0.0AlaXaa: 0.0 ± 0.0
Cys
0.708CysAla: 0.708 ± 0.465
0.0CysCys: 0.0 ± 0.0
1.415CysAsp: 1.415 ± 0.93
3.185CysGlu: 3.185 ± 2.508
1.415CysPhe: 1.415 ± 0.762
1.415CysGly: 1.415 ± 1.546
0.708CysHis: 0.708 ± 0.381
1.062CysIle: 1.062 ± 0.572
1.769CysLys: 1.769 ± 0.814
1.769CysLeu: 1.769 ± 0.472
0.708CysMet: 0.708 ± 0.381
1.062CysAsn: 1.062 ± 0.913
0.708CysPro: 0.708 ± 0.381
0.0CysGln: 0.0 ± 0.0
1.769CysArg: 1.769 ± 0.558
2.123CysSer: 2.123 ± 0.609
1.062CysThr: 1.062 ± 0.572
1.769CysVal: 1.769 ± 0.472
0.0CysTrp: 0.0 ± 0.0
0.708CysTyr: 0.708 ± 0.773
0.0CysXaa: 0.0 ± 0.0
Asp
3.539AspAla: 3.539 ± 1.299
0.708AspCys: 0.708 ± 0.381
3.539AspAsp: 3.539 ± 1.37
6.723AspGlu: 6.723 ± 1.268
2.831AspPhe: 2.831 ± 1.191
4.954AspGly: 4.954 ± 0.856
0.354AspHis: 0.354 ± 0.191
5.308AspIle: 5.308 ± 1.846
2.123AspLys: 2.123 ± 0.452
6.016AspLeu: 6.016 ± 1.679
1.769AspMet: 1.769 ± 0.472
2.831AspAsn: 2.831 ± 1.204
1.062AspPro: 1.062 ± 0.572
2.477AspGln: 2.477 ± 1.459
2.831AspArg: 2.831 ± 2.243
1.415AspSer: 1.415 ± 0.596
2.477AspThr: 2.477 ± 0.742
2.123AspVal: 2.123 ± 1.143
1.769AspTrp: 1.769 ± 0.953
2.831AspTyr: 2.831 ± 0.698
0.0AspXaa: 0.0 ± 0.0
Glu
3.539GluAla: 3.539 ± 0.709
1.769GluCys: 1.769 ± 0.829
4.954GluAsp: 4.954 ± 1.674
8.846GluGlu: 8.846 ± 2.36
3.185GluPhe: 3.185 ± 0.84
3.185GluGly: 3.185 ± 0.84
1.769GluHis: 1.769 ± 1.5
4.954GluIle: 4.954 ± 2.041
9.2GluLys: 9.2 ± 3.305
8.139GluLeu: 8.139 ± 1.01
1.415GluMet: 1.415 ± 0.754
3.539GluAsn: 3.539 ± 1.37
2.123GluPro: 2.123 ± 0.609
1.415GluGln: 1.415 ± 0.701
4.954GluArg: 4.954 ± 2.383
6.369GluSer: 6.369 ± 0.858
2.831GluThr: 2.831 ± 0.698
8.846GluVal: 8.846 ± 1.252
1.062GluTrp: 1.062 ± 0.862
3.539GluTyr: 3.539 ± 1.28
0.0GluXaa: 0.0 ± 0.0
Phe
3.539PheAla: 3.539 ± 1.191
2.477PheCys: 2.477 ± 0.768
4.246PheAsp: 4.246 ± 1.512
6.723PheGlu: 6.723 ± 2.508
3.539PhePhe: 3.539 ± 1.906
2.831PheGly: 2.831 ± 1.94
0.0PheHis: 0.0 ± 0.0
4.6PheIle: 4.6 ± 2.477
3.539PheLys: 3.539 ± 0.646
4.6PheLeu: 4.6 ± 1.497
1.062PheMet: 1.062 ± 0.739
4.6PheAsn: 4.6 ± 1.827
1.415PhePro: 1.415 ± 0.596
1.769PheGln: 1.769 ± 0.472
2.477PheArg: 2.477 ± 1.033
3.539PheSer: 3.539 ± 1.37
2.123PheThr: 2.123 ± 0.609
1.769PheVal: 1.769 ± 0.472
1.062PheTrp: 1.062 ± 0.382
0.354PheTyr: 0.354 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
1.062GlyAla: 1.062 ± 0.382
1.062GlyCys: 1.062 ± 0.572
1.415GlyAsp: 1.415 ± 0.762
3.539GlyGlu: 3.539 ± 1.906
2.123GlyPhe: 2.123 ± 1.02
3.892GlyGly: 3.892 ± 2.215
1.769GlyHis: 1.769 ± 0.814
3.185GlyIle: 3.185 ± 1.253
6.016GlyLys: 6.016 ± 1.848
3.892GlyLeu: 3.892 ± 2.215
0.708GlyMet: 0.708 ± 0.381
4.6GlyAsn: 4.6 ± 1.827
3.185GlyPro: 3.185 ± 0.912
2.831GlyGln: 2.831 ± 0.763
2.477GlyArg: 2.477 ± 0.88
4.954GlySer: 4.954 ± 3.436
2.123GlyThr: 2.123 ± 1.752
2.477GlyVal: 2.477 ± 1.334
1.062GlyTrp: 1.062 ± 0.739
1.062GlyTyr: 1.062 ± 0.572
0.0GlyXaa: 0.0 ± 0.0
His
0.354HisAla: 0.354 ± 0.191
0.708HisCys: 0.708 ± 1.699
3.185HisAsp: 3.185 ± 0.734
1.769HisGlu: 1.769 ± 0.814
2.123HisPhe: 2.123 ± 1.143
0.708HisGly: 0.708 ± 0.381
0.708HisHis: 0.708 ± 0.381
0.708HisIle: 0.708 ± 0.381
1.062HisLys: 1.062 ± 0.572
1.769HisLeu: 1.769 ± 0.558
1.769HisMet: 1.769 ± 0.953
0.354HisAsn: 0.354 ± 0.191
1.415HisPro: 1.415 ± 0.596
1.415HisGln: 1.415 ± 0.596
1.062HisArg: 1.062 ± 1.055
2.477HisSer: 2.477 ± 0.768
0.708HisThr: 0.708 ± 0.381
0.0HisVal: 0.0 ± 0.0
0.708HisTrp: 0.708 ± 0.381
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.308IleAla: 5.308 ± 1.279
3.185IleCys: 3.185 ± 0.84
4.954IleAsp: 4.954 ± 2.364
4.954IleGlu: 4.954 ± 2.085
4.954IlePhe: 4.954 ± 1.72
2.477IleGly: 2.477 ± 1.033
1.062IleHis: 1.062 ± 0.572
3.185IleIle: 3.185 ± 1.961
6.016IleLys: 6.016 ± 2.699
3.539IleLeu: 3.539 ± 2.324
1.769IleMet: 1.769 ± 0.558
4.954IleAsn: 4.954 ± 1.961
1.415IlePro: 1.415 ± 0.762
1.769IleGln: 1.769 ± 1.16
4.6IleArg: 4.6 ± 0.882
4.6IleSer: 4.6 ± 1.061
2.123IleThr: 2.123 ± 1.143
6.369IleVal: 6.369 ± 0.869
0.354IleTrp: 0.354 ± 0.758
2.123IleTyr: 2.123 ± 1.143
0.0IleXaa: 0.0 ± 0.0
Lys
4.246LysAla: 4.246 ± 3.41
1.415LysCys: 1.415 ± 2.459
5.662LysAsp: 5.662 ± 0.986
9.2LysGlu: 9.2 ± 2.094
4.6LysPhe: 4.6 ± 1.382
4.6LysGly: 4.6 ± 1.373
1.769LysHis: 1.769 ± 0.472
4.246LysIle: 4.246 ± 1.172
10.97LysLys: 10.97 ± 3.952
7.077LysLeu: 7.077 ± 0.638
1.769LysMet: 1.769 ± 1.124
6.016LysAsn: 6.016 ± 1.85
1.769LysPro: 1.769 ± 0.472
1.415LysGln: 1.415 ± 0.762
5.308LysArg: 5.308 ± 1.064
8.493LysSer: 8.493 ± 0.764
2.831LysThr: 2.831 ± 0.679
7.077LysVal: 7.077 ± 1.541
2.123LysTrp: 2.123 ± 0.609
2.123LysTyr: 2.123 ± 0.763
0.0LysXaa: 0.0 ± 0.0
Leu
3.539LeuAla: 3.539 ± 2.474
1.062LeuCys: 1.062 ± 0.572
2.123LeuAsp: 2.123 ± 0.452
4.954LeuGlu: 4.954 ± 0.878
4.6LeuPhe: 4.6 ± 1.22
5.308LeuGly: 5.308 ± 2.487
2.477LeuHis: 2.477 ± 1.289
3.892LeuIle: 3.892 ± 0.55
8.493LeuLys: 8.493 ± 2.364
5.662LeuLeu: 5.662 ± 0.896
1.415LeuMet: 1.415 ± 0.596
5.662LeuAsn: 5.662 ± 1.518
1.415LeuPro: 1.415 ± 0.385
3.185LeuGln: 3.185 ± 1.233
6.369LeuArg: 6.369 ± 1.469
7.785LeuSer: 7.785 ± 1.822
3.539LeuThr: 3.539 ± 1.096
4.6LeuVal: 4.6 ± 1.494
0.0LeuTrp: 0.0 ± 0.0
2.477LeuTyr: 2.477 ± 0.768
0.0LeuXaa: 0.0 ± 0.0
Met
1.415MetAla: 1.415 ± 0.596
1.062MetCys: 1.062 ± 0.572
1.415MetAsp: 1.415 ± 0.762
2.831MetGlu: 2.831 ± 1.098
1.769MetPhe: 1.769 ± 0.954
1.062MetGly: 1.062 ± 0.572
1.415MetHis: 1.415 ± 0.762
2.831MetIle: 2.831 ± 0.638
2.831MetLys: 2.831 ± 0.929
2.123MetLeu: 2.123 ± 1.274
1.415MetMet: 1.415 ± 0.754
1.769MetAsn: 1.769 ± 0.653
0.708MetPro: 0.708 ± 0.654
0.0MetGln: 0.0 ± 0.0
1.062MetArg: 1.062 ± 0.572
2.831MetSer: 2.831 ± 2.475
1.769MetThr: 1.769 ± 0.653
1.769MetVal: 1.769 ± 1.5
0.0MetTrp: 0.0 ± 0.0
0.354MetTyr: 0.354 ± 0.849
0.0MetXaa: 0.0 ± 0.0
Asn
1.769AsnAla: 1.769 ± 0.829
2.831AsnCys: 2.831 ± 1.204
2.477AsnAsp: 2.477 ± 1.334
4.246AsnGlu: 4.246 ± 1.527
3.185AsnPhe: 3.185 ± 0.987
3.892AsnGly: 3.892 ± 0.998
1.415AsnHis: 1.415 ± 0.754
3.892AsnIle: 3.892 ± 1.932
6.016AsnLys: 6.016 ± 1.865
6.016AsnLeu: 6.016 ± 2.104
1.415AsnMet: 1.415 ± 0.754
2.123AsnAsn: 2.123 ± 0.609
1.769AsnPro: 1.769 ± 0.829
1.769AsnGln: 1.769 ± 0.814
2.831AsnArg: 2.831 ± 0.698
6.723AsnSer: 6.723 ± 3.407
1.415AsnThr: 1.415 ± 1.309
4.954AsnVal: 4.954 ± 2.34
0.708AsnTrp: 0.708 ± 0.773
2.123AsnTyr: 2.123 ± 1.549
0.0AsnXaa: 0.0 ± 0.0
Pro
0.354ProAla: 0.354 ± 0.758
0.0ProCys: 0.0 ± 0.0
1.415ProAsp: 1.415 ± 0.762
1.769ProGlu: 1.769 ± 0.472
2.477ProPhe: 2.477 ± 0.768
1.769ProGly: 1.769 ± 0.829
0.354ProHis: 0.354 ± 0.191
2.477ProIle: 2.477 ± 1.534
2.477ProLys: 2.477 ± 0.742
2.123ProLeu: 2.123 ± 0.609
1.769ProMet: 1.769 ± 0.534
2.123ProAsn: 2.123 ± 0.756
0.708ProPro: 0.708 ± 0.381
0.708ProGln: 0.708 ± 0.654
1.062ProArg: 1.062 ± 0.572
1.769ProSer: 1.769 ± 1.895
1.062ProThr: 1.062 ± 0.382
1.769ProVal: 1.769 ± 0.829
0.0ProTrp: 0.0 ± 0.0
0.708ProTyr: 0.708 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
1.062GlnAla: 1.062 ± 0.382
1.062GlnCys: 1.062 ± 0.382
1.769GlnAsp: 1.769 ± 0.558
2.477GlnGlu: 2.477 ± 1.155
0.354GlnPhe: 0.354 ± 0.191
0.708GlnGly: 0.708 ± 0.773
0.354GlnHis: 0.354 ± 0.191
2.831GlnIle: 2.831 ± 0.638
2.477GlnLys: 2.477 ± 0.412
1.769GlnLeu: 1.769 ± 0.653
1.062GlnMet: 1.062 ± 0.913
1.769GlnAsn: 1.769 ± 0.814
1.062GlnPro: 1.062 ± 0.572
0.354GlnGln: 0.354 ± 0.599
1.415GlnArg: 1.415 ± 0.596
3.185GlnSer: 3.185 ± 1.253
1.062GlnThr: 1.062 ± 0.739
0.708GlnVal: 0.708 ± 0.465
0.354GlnTrp: 0.354 ± 0.599
0.354GlnTyr: 0.354 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
2.123ArgAla: 2.123 ± 1.478
1.415ArgCys: 1.415 ± 0.776
1.769ArgAsp: 1.769 ± 0.814
5.662ArgGlu: 5.662 ± 0.903
3.892ArgPhe: 3.892 ± 0.53
3.185ArgGly: 3.185 ± 1.233
1.062ArgHis: 1.062 ± 0.596
3.892ArgIle: 3.892 ± 1.4
4.246ArgLys: 4.246 ± 0.736
5.662ArgLeu: 5.662 ± 0.913
2.477ArgMet: 2.477 ± 0.412
1.769ArgAsn: 1.769 ± 1.16
0.354ArgPro: 0.354 ± 0.599
1.062ArgGln: 1.062 ± 0.862
3.185ArgArg: 3.185 ± 0.681
3.892ArgSer: 3.892 ± 2.152
3.185ArgThr: 3.185 ± 0.987
3.539ArgVal: 3.539 ± 0.646
1.062ArgTrp: 1.062 ± 0.382
2.831ArgTyr: 2.831 ± 1.175
0.0ArgXaa: 0.0 ± 0.0
Ser
3.185SerAla: 3.185 ± 2.166
1.062SerCys: 1.062 ± 0.913
5.308SerAsp: 5.308 ± 1.012
5.308SerGlu: 5.308 ± 3.292
5.662SerPhe: 5.662 ± 1.859
4.246SerGly: 4.246 ± 2.223
2.123SerHis: 2.123 ± 0.756
8.493SerIle: 8.493 ± 2.382
6.016SerLys: 6.016 ± 1.434
4.954SerLeu: 4.954 ± 1.961
1.415SerMet: 1.415 ± 1.035
6.369SerAsn: 6.369 ± 0.586
1.415SerPro: 1.415 ± 0.596
2.123SerGln: 2.123 ± 0.609
3.892SerArg: 3.892 ± 0.998
4.954SerSer: 4.954 ± 0.825
2.477SerThr: 2.477 ± 1.155
4.246SerVal: 4.246 ± 1.154
0.354SerTrp: 0.354 ± 0.191
3.185SerTyr: 3.185 ± 1.715
0.0SerXaa: 0.0 ± 0.0
Thr
1.415ThrAla: 1.415 ± 0.754
0.354ThrCys: 0.354 ± 0.191
1.415ThrAsp: 1.415 ± 0.596
2.477ThrGlu: 2.477 ± 0.742
4.246ThrPhe: 4.246 ± 2.287
2.477ThrGly: 2.477 ± 0.837
0.708ThrHis: 0.708 ± 0.654
3.185ThrIle: 3.185 ± 0.681
3.539ThrLys: 3.539 ± 1.656
2.831ThrLeu: 2.831 ± 1.039
1.415ThrMet: 1.415 ± 1.309
2.831ThrAsn: 2.831 ± 1.449
1.415ThrPro: 1.415 ± 0.385
0.708ThrGln: 0.708 ± 0.381
2.831ThrArg: 2.831 ± 1.525
1.769ThrSer: 1.769 ± 0.558
1.062ThrThr: 1.062 ± 0.862
2.123ThrVal: 2.123 ± 1.725
0.0ThrTrp: 0.0 ± 0.0
0.354ThrTyr: 0.354 ± 0.758
0.0ThrXaa: 0.0 ± 0.0
Val
1.769ValAla: 1.769 ± 0.814
1.062ValCys: 1.062 ± 0.596
5.662ValAsp: 5.662 ± 0.903
5.308ValGlu: 5.308 ± 1.608
2.123ValPhe: 2.123 ± 0.993
2.831ValGly: 2.831 ± 0.94
2.831ValHis: 2.831 ± 0.94
3.892ValIle: 3.892 ± 0.53
7.431ValLys: 7.431 ± 2.012
2.477ValLeu: 2.477 ± 0.768
3.892ValMet: 3.892 ± 1.576
3.892ValAsn: 3.892 ± 1.859
2.123ValPro: 2.123 ± 2.849
1.415ValGln: 1.415 ± 0.754
3.185ValArg: 3.185 ± 0.588
3.892ValSer: 3.892 ± 0.976
2.831ValThr: 2.831 ± 0.94
3.539ValVal: 3.539 ± 1.299
0.0ValTrp: 0.0 ± 0.0
1.415ValTyr: 1.415 ± 0.762
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.708TrpAsp: 0.708 ± 0.465
1.415TrpGlu: 1.415 ± 0.93
0.354TrpPhe: 0.354 ± 0.191
0.354TrpGly: 0.354 ± 0.599
0.354TrpHis: 0.354 ± 0.191
1.062TrpIle: 1.062 ± 0.596
0.354TrpLys: 0.354 ± 0.758
1.062TrpLeu: 1.062 ± 0.572
1.062TrpMet: 1.062 ± 0.739
0.354TrpAsn: 0.354 ± 0.191
0.708TrpPro: 0.708 ± 0.381
0.0TrpGln: 0.0 ± 0.0
1.415TrpArg: 1.415 ± 0.596
0.708TrpSer: 0.708 ± 0.465
0.0TrpThr: 0.0 ± 0.0
0.708TrpVal: 0.708 ± 0.773
0.0TrpTrp: 0.0 ± 0.0
0.354TrpTyr: 0.354 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.415TyrAla: 1.415 ± 0.385
1.062TyrCys: 1.062 ± 0.382
2.831TyrAsp: 2.831 ± 0.94
2.477TyrGlu: 2.477 ± 1.334
0.354TyrPhe: 0.354 ± 0.191
1.769TyrGly: 1.769 ± 0.667
0.708TyrHis: 0.708 ± 0.381
3.539TyrIle: 3.539 ± 1.307
2.477TyrLys: 2.477 ± 2.316
2.123TyrLeu: 2.123 ± 0.96
0.354TyrMet: 0.354 ± 0.191
1.415TyrAsn: 1.415 ± 0.762
0.708TyrPro: 0.708 ± 0.381
1.062TyrGln: 1.062 ± 0.739
1.769TyrArg: 1.769 ± 0.953
2.831TyrSer: 2.831 ± 0.456
1.062TyrThr: 1.062 ± 0.572
1.062TyrVal: 1.062 ± 0.572
0.0TyrTrp: 0.0 ± 0.0
1.062TyrTyr: 1.062 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2827 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski