Amino acid dipepetide frequency for Oropouche virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.005AlaAla: 3.005 ± 2.242
1.002AlaCys: 1.002 ± 0.305
3.005AlaAsp: 3.005 ± 0.914
4.508AlaGlu: 4.508 ± 1.946
3.256AlaPhe: 3.256 ± 1.464
2.004AlaGly: 2.004 ± 1.12
1.753AlaHis: 1.753 ± 0.451
3.506AlaIle: 3.506 ± 1.067
3.005AlaLys: 3.005 ± 0.853
4.508AlaLeu: 4.508 ± 1.194
2.004AlaMet: 2.004 ± 0.993
1.753AlaAsn: 1.753 ± 0.734
0.0AlaPro: 0.0 ± 0.0
1.503AlaGln: 1.503 ± 0.457
2.755AlaArg: 2.755 ± 2.353
1.503AlaSer: 1.503 ± 0.457
1.753AlaThr: 1.753 ± 0.361
1.753AlaVal: 1.753 ± 1.106
0.751AlaTrp: 0.751 ± 0.545
2.254AlaTyr: 2.254 ± 0.728
0.0AlaXaa: 0.0 ± 0.0
Cys
1.503CysAla: 1.503 ± 0.457
0.0CysCys: 0.0 ± 0.0
0.501CysAsp: 0.501 ± 0.152
1.252CysGlu: 1.252 ± 0.829
1.002CysPhe: 1.002 ± 0.939
3.005CysGly: 3.005 ± 1.788
0.501CysHis: 0.501 ± 0.47
2.755CysIle: 2.755 ± 1.24
2.504CysLys: 2.504 ± 1.327
2.755CysLeu: 2.755 ± 1.24
1.002CysMet: 1.002 ± 0.307
1.753CysAsn: 1.753 ± 0.657
1.252CysPro: 1.252 ± 0.51
1.002CysGln: 1.002 ± 0.975
1.002CysArg: 1.002 ± 0.941
2.504CysSer: 2.504 ± 0.872
1.753CysThr: 1.753 ± 1.452
0.751CysVal: 0.751 ± 0.704
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
1.252AspAla: 1.252 ± 0.455
1.002AspCys: 1.002 ± 0.596
3.506AspAsp: 3.506 ± 1.194
3.506AspGlu: 3.506 ± 0.868
3.506AspPhe: 3.506 ± 1.486
1.503AspGly: 1.503 ± 1.118
0.25AspHis: 0.25 ± 0.15
7.764AspIle: 7.764 ± 1.27
5.76AspLys: 5.76 ± 1.132
6.762AspLeu: 6.762 ± 1.011
2.504AspMet: 2.504 ± 0.676
2.755AspAsn: 2.755 ± 0.466
1.503AspPro: 1.503 ± 0.82
1.252AspGln: 1.252 ± 0.445
1.753AspArg: 1.753 ± 0.734
2.755AspSer: 2.755 ± 0.746
3.005AspThr: 3.005 ± 1.686
3.506AspVal: 3.506 ± 1.368
0.501AspTrp: 0.501 ± 0.47
2.755AspTyr: 2.755 ± 0.794
0.0AspXaa: 0.0 ± 0.0
Glu
3.757GluAla: 3.757 ± 1.466
1.753GluCys: 1.753 ± 0.657
4.508GluAsp: 4.508 ± 1.499
3.757GluGlu: 3.757 ± 0.929
5.51GluPhe: 5.51 ± 1.1
1.753GluGly: 1.753 ± 0.657
2.004GluHis: 2.004 ± 0.874
7.764GluIle: 7.764 ± 1.26
4.758GluLys: 4.758 ± 1.638
5.009GluLeu: 5.009 ± 1.351
2.755GluMet: 2.755 ± 0.846
2.004GluAsn: 2.004 ± 0.493
2.254GluPro: 2.254 ± 0.75
2.755GluGln: 2.755 ± 0.231
2.755GluArg: 2.755 ± 0.679
4.007GluSer: 4.007 ± 0.582
2.004GluThr: 2.004 ± 0.609
2.254GluVal: 2.254 ± 0.409
0.0GluTrp: 0.0 ± 0.0
2.504GluTyr: 2.504 ± 0.622
0.0GluXaa: 0.0 ± 0.0
Phe
2.755PheAla: 2.755 ± 1.476
2.254PheCys: 2.254 ± 1.03
3.005PheAsp: 3.005 ± 0.252
4.007PheGlu: 4.007 ± 0.799
2.004PhePhe: 2.004 ± 1.115
2.755PheGly: 2.755 ± 1.253
1.002PheHis: 1.002 ± 0.307
3.506PheIle: 3.506 ± 0.292
4.758PheLys: 4.758 ± 1.348
5.009PheLeu: 5.009 ± 2.019
1.002PheMet: 1.002 ± 0.512
2.755PheAsn: 2.755 ± 0.391
1.252PhePro: 1.252 ± 1.149
1.503PheGln: 1.503 ± 0.527
2.254PheArg: 2.254 ± 0.409
4.758PheSer: 4.758 ± 0.938
3.757PheThr: 3.757 ± 0.703
1.753PheVal: 1.753 ± 0.734
0.25PheTrp: 0.25 ± 0.15
1.753PheTyr: 1.753 ± 0.657
0.0PheXaa: 0.0 ± 0.0
Gly
0.751GlyAla: 0.751 ± 0.627
2.004GlyCys: 2.004 ± 0.869
2.755GlyAsp: 2.755 ± 0.803
3.256GlyGlu: 3.256 ± 1.056
2.004GlyPhe: 2.004 ± 0.993
1.002GlyGly: 1.002 ± 0.56
1.252GlyHis: 1.252 ± 0.51
3.005GlyIle: 3.005 ± 0.676
2.504GlyLys: 2.504 ± 0.622
4.508GlyLeu: 4.508 ± 0.911
0.751GlyMet: 0.751 ± 0.627
2.755GlyAsn: 2.755 ± 0.679
2.254GlyPro: 2.254 ± 0.721
2.504GlyGln: 2.504 ± 0.911
2.254GlyArg: 2.254 ± 0.597
3.757GlySer: 3.757 ± 1.433
2.254GlyThr: 2.254 ± 1.076
2.004GlyVal: 2.004 ± 0.993
0.751GlyTrp: 0.751 ± 0.366
1.753GlyTyr: 1.753 ± 0.361
0.0GlyXaa: 0.0 ± 0.0
His
1.252HisAla: 1.252 ± 0.51
1.252HisCys: 1.252 ± 0.829
0.501HisAsp: 0.501 ± 0.3
1.252HisGlu: 1.252 ± 0.51
2.254HisPhe: 2.254 ± 0.484
1.002HisGly: 1.002 ± 0.6
0.751HisHis: 0.751 ± 0.366
0.501HisIle: 0.501 ± 0.47
1.753HisLys: 1.753 ± 0.734
2.254HisLeu: 2.254 ± 2.967
0.501HisMet: 0.501 ± 1.052
2.755HisAsn: 2.755 ± 0.654
1.002HisPro: 1.002 ± 0.305
0.0HisGln: 0.0 ± 0.0
1.503HisArg: 1.503 ± 2.052
2.755HisSer: 2.755 ± 0.728
1.002HisThr: 1.002 ± 0.596
1.503HisVal: 1.503 ± 0.457
0.501HisTrp: 0.501 ± 0.3
1.252HisTyr: 1.252 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
3.256IleAla: 3.256 ± 1.322
3.506IleCys: 3.506 ± 1.922
4.508IleAsp: 4.508 ± 1.701
7.764IleGlu: 7.764 ± 1.378
4.758IlePhe: 4.758 ± 0.938
4.758IleGly: 4.758 ± 1.823
2.504IleHis: 2.504 ± 0.889
7.012IleIle: 7.012 ± 2.365
6.511IleLys: 6.511 ± 0.969
7.764IleLeu: 7.764 ± 1.969
2.254IleMet: 2.254 ± 0.399
5.76IleAsn: 5.76 ± 1.865
2.755IlePro: 2.755 ± 0.746
2.755IleGln: 2.755 ± 0.728
3.005IleArg: 3.005 ± 0.681
6.762IleSer: 6.762 ± 1.677
5.259IleThr: 5.259 ± 0.665
4.257IleVal: 4.257 ± 0.803
1.002IleTrp: 1.002 ± 0.6
2.504IleTyr: 2.504 ± 0.676
0.0IleXaa: 0.0 ± 0.0
Lys
4.508LysAla: 4.508 ± 1.026
2.004LysCys: 2.004 ± 1.113
5.259LysAsp: 5.259 ± 0.745
5.009LysGlu: 5.009 ± 1.524
4.508LysPhe: 4.508 ± 1.144
4.007LysGly: 4.007 ± 0.391
1.252LysHis: 1.252 ± 0.75
7.764LysIle: 7.764 ± 2.473
6.011LysLys: 6.011 ± 0.501
7.513LysLeu: 7.513 ± 2.105
2.504LysMet: 2.504 ± 0.758
4.007LysAsn: 4.007 ± 1.219
2.004LysPro: 2.004 ± 0.493
1.252LysGln: 1.252 ± 0.489
3.506LysArg: 3.506 ± 1.245
6.511LysSer: 6.511 ± 1.66
5.76LysThr: 5.76 ± 0.615
4.007LysVal: 4.007 ± 0.391
1.002LysTrp: 1.002 ± 0.6
2.254LysTyr: 2.254 ± 0.572
0.0LysXaa: 0.0 ± 0.0
Leu
4.007LeuAla: 4.007 ± 1.49
1.252LeuCys: 1.252 ± 0.829
5.76LeuAsp: 5.76 ± 1.755
6.511LeuGlu: 6.511 ± 0.865
4.758LeuPhe: 4.758 ± 1.348
4.257LeuGly: 4.257 ± 0.803
2.504LeuHis: 2.504 ± 2.348
6.762LeuIle: 6.762 ± 1.288
6.511LeuLys: 6.511 ± 0.442
7.764LeuLeu: 7.764 ± 2.02
2.504LeuMet: 2.504 ± 0.889
5.76LeuAsn: 5.76 ± 1.227
3.506LeuPro: 3.506 ± 0.589
3.256LeuGln: 3.256 ± 0.941
2.755LeuArg: 2.755 ± 1.946
7.263LeuSer: 7.263 ± 1.281
7.012LeuThr: 7.012 ± 0.688
3.256LeuVal: 3.256 ± 2.067
0.751LeuTrp: 0.751 ± 0.45
2.504LeuTyr: 2.504 ± 0.622
0.0LeuXaa: 0.0 ± 0.0
Met
0.751MetAla: 0.751 ± 0.191
1.753MetCys: 1.753 ± 0.657
1.252MetAsp: 1.252 ± 0.455
0.751MetGlu: 0.751 ± 0.704
0.501MetPhe: 0.501 ± 0.627
1.252MetGly: 1.252 ± 0.51
0.751MetHis: 0.751 ± 0.545
2.004MetIle: 2.004 ± 0.874
2.254MetLys: 2.254 ± 1.165
1.753MetLeu: 1.753 ± 0.944
0.0MetMet: 0.0 ± 0.0
1.002MetAsn: 1.002 ± 0.307
1.753MetPro: 1.753 ± 0.734
1.252MetGln: 1.252 ± 0.445
1.503MetArg: 1.503 ± 1.09
4.007MetSer: 4.007 ± 1.188
2.254MetThr: 2.254 ± 0.572
1.753MetVal: 1.753 ± 1.031
0.501MetTrp: 0.501 ± 1.052
1.503MetTyr: 1.503 ± 1.343
0.0MetXaa: 0.0 ± 0.0
Asn
2.504AsnAla: 2.504 ± 0.297
1.503AsnCys: 1.503 ± 1.062
5.76AsnAsp: 5.76 ± 0.9
3.005AsnGlu: 3.005 ± 1.176
2.004AsnPhe: 2.004 ± 0.4
2.004AsnGly: 2.004 ± 1.235
1.503AsnHis: 1.503 ± 0.823
4.508AsnIle: 4.508 ± 1.371
3.005AsnLys: 3.005 ± 0.914
5.259AsnLeu: 5.259 ± 0.816
2.254AsnMet: 2.254 ± 0.75
4.007AsnAsn: 4.007 ± 1.074
2.004AsnPro: 2.004 ± 0.4
2.755AsnGln: 2.755 ± 0.391
2.254AsnArg: 2.254 ± 0.75
2.504AsnSer: 2.504 ± 0.889
3.757AsnThr: 3.757 ± 1.046
1.753AsnVal: 1.753 ± 0.734
1.503AsnTrp: 1.503 ± 0.381
4.007AsnTyr: 4.007 ± 1.463
0.0AsnXaa: 0.0 ± 0.0
Pro
3.005ProAla: 3.005 ± 0.358
0.25ProCys: 0.25 ± 1.089
1.753ProAsp: 1.753 ± 0.601
3.506ProGlu: 3.506 ± 0.468
1.503ProPhe: 1.503 ± 0.381
1.503ProGly: 1.503 ± 0.588
1.002ProHis: 1.002 ± 0.596
3.757ProIle: 3.757 ± 0.362
2.004ProLys: 2.004 ± 1.113
1.503ProLeu: 1.503 ± 1.09
0.25ProMet: 0.25 ± 0.658
0.501ProAsn: 0.501 ± 0.152
0.25ProPro: 0.25 ± 0.15
0.751ProGln: 0.751 ± 1.264
0.751ProArg: 0.751 ± 0.366
2.254ProSer: 2.254 ± 1.03
1.252ProThr: 1.252 ± 0.311
2.254ProVal: 2.254 ± 0.484
1.002ProTrp: 1.002 ± 1.041
2.004ProTyr: 2.004 ± 0.609
0.0ProXaa: 0.0 ± 0.0
Gln
1.503GlnAla: 1.503 ± 0.527
1.252GlnCys: 1.252 ± 1.019
2.254GlnAsp: 2.254 ± 0.721
0.751GlnGlu: 0.751 ± 0.191
1.503GlnPhe: 1.503 ± 1.806
1.002GlnGly: 1.002 ± 0.307
0.501GlnHis: 0.501 ± 1.052
3.506GlnIle: 3.506 ± 0.978
3.757GlnLys: 3.757 ± 1.219
1.753GlnLeu: 1.753 ± 0.657
1.503GlnMet: 1.503 ± 0.823
1.503GlnAsn: 1.503 ± 0.381
0.501GlnPro: 0.501 ± 0.152
1.002GlnGln: 1.002 ± 0.307
2.254GlnArg: 2.254 ± 1.143
2.254GlnSer: 2.254 ± 0.262
1.753GlnThr: 1.753 ± 0.601
2.254GlnVal: 2.254 ± 0.484
1.002GlnTrp: 1.002 ± 1.503
1.002GlnTyr: 1.002 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
1.753ArgAla: 1.753 ± 1.031
1.252ArgCys: 1.252 ± 0.51
3.256ArgAsp: 3.256 ± 0.991
3.757ArgGlu: 3.757 ± 1.275
1.503ArgPhe: 1.503 ± 0.457
1.503ArgGly: 1.503 ± 0.843
2.504ArgHis: 2.504 ± 0.877
3.757ArgIle: 3.757 ± 1.362
3.757ArgLys: 3.757 ± 0.604
4.257ArgLeu: 4.257 ± 0.889
0.751ArgMet: 0.751 ± 0.704
2.004ArgAsn: 2.004 ± 0.615
1.252ArgPro: 1.252 ± 1.419
1.753ArgGln: 1.753 ± 1.959
1.002ArgArg: 1.002 ± 2.073
4.007ArgSer: 4.007 ± 0.745
1.252ArgThr: 1.252 ± 2.037
2.504ArgVal: 2.504 ± 1.19
0.25ArgTrp: 0.25 ± 0.658
2.254ArgTyr: 2.254 ± 0.973
0.0ArgXaa: 0.0 ± 0.0
Ser
2.755SerAla: 2.755 ± 0.794
3.256SerCys: 3.256 ± 1.693
2.504SerAsp: 2.504 ± 0.676
4.508SerGlu: 4.508 ± 0.734
2.254SerPhe: 2.254 ± 0.597
3.005SerGly: 3.005 ± 0.676
1.503SerHis: 1.503 ± 0.843
8.014SerIle: 8.014 ± 2.007
8.264SerLys: 8.264 ± 2.097
6.511SerLeu: 6.511 ± 2.365
2.755SerMet: 2.755 ± 0.654
4.758SerAsn: 4.758 ± 0.647
3.005SerPro: 3.005 ± 0.849
2.004SerGln: 2.004 ± 0.615
5.259SerArg: 5.259 ± 2.024
6.011SerSer: 6.011 ± 4.814
5.259SerThr: 5.259 ± 0.775
3.005SerVal: 3.005 ± 0.763
0.501SerTrp: 0.501 ± 0.47
2.504SerTyr: 2.504 ± 0.709
0.0SerXaa: 0.0 ± 0.0
Thr
3.256ThrAla: 3.256 ± 0.898
0.751ThrCys: 0.751 ± 0.704
2.504ThrAsp: 2.504 ± 0.707
2.755ThrGlu: 2.755 ± 0.956
3.506ThrPhe: 3.506 ± 2.212
4.007ThrGly: 4.007 ± 0.59
2.004ThrHis: 2.004 ± 0.493
5.009ThrIle: 5.009 ± 0.679
5.009ThrLys: 5.009 ± 0.634
3.506ThrLeu: 3.506 ± 1.496
0.751ThrMet: 0.751 ± 0.646
4.257ThrAsn: 4.257 ± 1.197
1.002ThrPro: 1.002 ± 0.307
1.503ThrGln: 1.503 ± 0.527
1.503ThrArg: 1.503 ± 0.381
5.51ThrSer: 5.51 ± 0.783
3.757ThrThr: 3.757 ± 0.819
3.256ThrVal: 3.256 ± 2.091
1.002ThrTrp: 1.002 ± 0.803
2.504ThrTyr: 2.504 ± 0.762
0.0ThrXaa: 0.0 ± 0.0
Val
2.004ValAla: 2.004 ± 0.993
1.002ValCys: 1.002 ± 0.941
2.504ValAsp: 2.504 ± 0.939
2.254ValGlu: 2.254 ± 1.03
2.504ValPhe: 2.504 ± 0.594
1.503ValGly: 1.503 ± 1.416
0.751ValHis: 0.751 ± 0.366
3.757ValIle: 3.757 ± 1.007
3.506ValLys: 3.506 ± 0.292
5.259ValLeu: 5.259 ± 3.401
1.002ValMet: 1.002 ± 0.307
3.757ValAsn: 3.757 ± 0.628
2.254ValPro: 2.254 ± 1.643
2.004ValGln: 2.004 ± 0.541
2.755ValArg: 2.755 ± 1.876
3.256ValSer: 3.256 ± 1.056
2.004ValThr: 2.004 ± 0.493
2.004ValVal: 2.004 ± 1.686
0.0ValTrp: 0.0 ± 0.0
1.753ValTyr: 1.753 ± 0.961
0.0ValXaa: 0.0 ± 0.0
Trp
1.002TrpAla: 1.002 ± 1.25
0.0TrpCys: 0.0 ± 0.0
0.751TrpAsp: 0.751 ± 0.191
0.25TrpGlu: 0.25 ± 0.15
1.252TrpPhe: 1.252 ± 0.51
0.25TrpGly: 0.25 ± 0.235
0.25TrpHis: 0.25 ± 1.089
0.751TrpIle: 0.751 ± 0.45
0.0TrpLys: 0.0 ± 0.0
1.753TrpLeu: 1.753 ± 1.923
0.25TrpMet: 0.25 ± 0.658
0.751TrpAsn: 0.751 ± 0.627
0.501TrpPro: 0.501 ± 0.47
0.501TrpGln: 0.501 ± 0.3
0.751TrpArg: 0.751 ± 0.191
1.503TrpSer: 1.503 ± 0.381
0.501TrpThr: 0.501 ± 0.152
0.501TrpVal: 0.501 ± 0.3
0.0TrpTrp: 0.0 ± 0.0
0.25TrpTyr: 0.25 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.002TyrAla: 1.002 ± 0.305
0.501TyrCys: 0.501 ± 0.47
1.252TyrAsp: 1.252 ± 0.445
1.503TyrGlu: 1.503 ± 0.588
2.004TyrPhe: 2.004 ± 0.615
2.004TyrGly: 2.004 ± 1.12
1.002TyrHis: 1.002 ± 0.975
3.256TyrIle: 3.256 ± 0.801
5.009TyrLys: 5.009 ± 2.338
3.256TyrLeu: 3.256 ± 1.419
1.002TyrMet: 1.002 ± 0.307
3.256TyrAsn: 3.256 ± 0.801
0.751TyrPro: 0.751 ± 0.191
1.503TyrGln: 1.503 ± 0.733
2.755TyrArg: 2.755 ± 0.956
3.757TyrSer: 3.757 ± 0.932
2.254TyrThr: 2.254 ± 0.597
1.503TyrVal: 1.503 ± 1.367
0.501TyrTrp: 0.501 ± 0.152
1.002TyrTyr: 1.002 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3994 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski