Amino acid dipepetide frequency for Orchid fleck dichorhavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.52AlaAla: 5.52 ± 1.606
1.104AlaCys: 1.104 ± 0.436
3.312AlaAsp: 3.312 ± 1.44
4.692AlaGlu: 4.692 ± 1.428
2.208AlaPhe: 2.208 ± 0.662
3.312AlaGly: 3.312 ± 1.67
1.656AlaHis: 1.656 ± 0.495
3.036AlaIle: 3.036 ± 0.629
2.76AlaLys: 2.76 ± 0.607
5.796AlaLeu: 5.796 ± 1.661
1.38AlaMet: 1.38 ± 1.355
1.932AlaAsn: 1.932 ± 0.609
2.76AlaPro: 2.76 ± 0.989
0.828AlaGln: 0.828 ± 0.336
3.312AlaArg: 3.312 ± 1.102
3.864AlaSer: 3.864 ± 0.974
4.692AlaThr: 4.692 ± 1.662
5.244AlaVal: 5.244 ± 0.524
0.828AlaTrp: 0.828 ± 0.478
1.932AlaTyr: 1.932 ± 0.9
0.0AlaXaa: 0.0 ± 0.0
Cys
2.208CysAla: 2.208 ± 0.761
0.828CysCys: 0.828 ± 0.347
2.208CysAsp: 2.208 ± 0.638
1.38CysGlu: 1.38 ± 0.693
0.0CysPhe: 0.0 ± 0.0
1.104CysGly: 1.104 ± 0.529
1.38CysHis: 1.38 ± 0.515
1.104CysIle: 1.104 ± 0.372
1.104CysLys: 1.104 ± 0.529
1.932CysLeu: 1.932 ± 0.397
0.828CysMet: 0.828 ± 0.505
1.38CysAsn: 1.38 ± 0.381
0.276CysPro: 0.276 ± 0.168
0.276CysGln: 0.276 ± 0.168
1.104CysArg: 1.104 ± 0.466
1.38CysSer: 1.38 ± 0.561
0.552CysThr: 0.552 ± 0.702
1.38CysVal: 1.38 ± 0.372
0.0CysTrp: 0.0 ± 0.0
0.276CysTyr: 0.276 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.588AspAla: 3.588 ± 0.932
1.104AspCys: 1.104 ± 0.525
3.588AspAsp: 3.588 ± 0.73
3.036AspGlu: 3.036 ± 0.822
1.932AspPhe: 1.932 ± 0.595
2.76AspGly: 2.76 ± 1.072
1.932AspHis: 1.932 ± 0.704
4.14AspIle: 4.14 ± 0.922
3.588AspLys: 3.588 ± 1.136
5.52AspLeu: 5.52 ± 0.754
3.312AspMet: 3.312 ± 0.827
4.14AspAsn: 4.14 ± 0.903
2.484AspPro: 2.484 ± 1.071
0.552AspGln: 0.552 ± 0.351
2.484AspArg: 2.484 ± 0.628
3.036AspSer: 3.036 ± 0.559
2.208AspThr: 2.208 ± 1.13
3.036AspVal: 3.036 ± 1.254
0.828AspTrp: 0.828 ± 0.377
1.104AspTyr: 1.104 ± 0.372
0.0AspXaa: 0.0 ± 0.0
Glu
4.968GluAla: 4.968 ± 1.583
1.104GluCys: 1.104 ± 0.673
2.484GluAsp: 2.484 ± 0.601
3.036GluGlu: 3.036 ± 0.928
1.38GluPhe: 1.38 ± 0.613
5.796GluGly: 5.796 ± 1.579
2.484GluHis: 2.484 ± 0.625
3.864GluIle: 3.864 ± 0.897
3.036GluLys: 3.036 ± 0.837
4.416GluLeu: 4.416 ± 1.669
2.76GluMet: 2.76 ± 1.135
2.208GluAsn: 2.208 ± 1.025
1.38GluPro: 1.38 ± 0.561
1.38GluGln: 1.38 ± 0.693
1.38GluArg: 1.38 ± 0.549
3.588GluSer: 3.588 ± 1.234
3.036GluThr: 3.036 ± 0.835
3.588GluVal: 3.588 ± 1.762
1.38GluTrp: 1.38 ± 0.561
2.76GluTyr: 2.76 ± 0.948
0.0GluXaa: 0.0 ± 0.0
Phe
1.932PheAla: 1.932 ± 0.914
0.552PheCys: 0.552 ± 0.265
0.552PheAsp: 0.552 ± 0.579
1.38PheGlu: 1.38 ± 0.566
1.104PhePhe: 1.104 ± 0.462
0.828PheGly: 0.828 ± 0.319
0.828PheHis: 0.828 ± 0.319
1.932PheIle: 1.932 ± 1.081
3.864PheLys: 3.864 ± 1.092
3.312PheLeu: 3.312 ± 1.111
1.656PheMet: 1.656 ± 0.673
1.104PheAsn: 1.104 ± 0.661
1.656PhePro: 1.656 ± 0.713
1.104PheGln: 1.104 ± 0.673
2.76PheArg: 2.76 ± 0.742
0.828PheSer: 0.828 ± 0.319
2.76PheThr: 2.76 ± 0.807
0.552PheVal: 0.552 ± 0.546
0.552PheTrp: 0.552 ± 0.287
0.552PheTyr: 0.552 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
3.312GlyAla: 3.312 ± 1.505
0.552GlyCys: 0.552 ± 0.265
4.416GlyAsp: 4.416 ± 1.755
4.692GlyGlu: 4.692 ± 0.873
2.208GlyPhe: 2.208 ± 1.224
5.796GlyGly: 5.796 ± 1.213
0.552GlyHis: 0.552 ± 0.265
2.484GlyIle: 2.484 ± 0.391
3.588GlyLys: 3.588 ± 0.971
6.348GlyLeu: 6.348 ± 1.175
4.692GlyMet: 4.692 ± 1.52
3.312GlyAsn: 3.312 ± 0.352
4.968GlyPro: 4.968 ± 2.84
1.656GlyGln: 1.656 ± 0.355
3.588GlyArg: 3.588 ± 0.792
3.588GlySer: 3.588 ± 1.019
1.932GlyThr: 1.932 ± 0.811
4.968GlyVal: 4.968 ± 1.047
0.552GlyTrp: 0.552 ± 0.265
1.656GlyTyr: 1.656 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.828HisCys: 0.828 ± 0.336
2.484HisAsp: 2.484 ± 0.601
1.104HisGlu: 1.104 ± 0.346
0.276HisPhe: 0.276 ± 0.168
2.484HisGly: 2.484 ± 0.692
1.104HisHis: 1.104 ± 0.861
1.104HisIle: 1.104 ± 0.577
1.38HisLys: 1.38 ± 0.566
3.312HisLeu: 3.312 ± 1.372
0.828HisMet: 0.828 ± 0.319
1.656HisAsn: 1.656 ± 0.593
1.104HisPro: 1.104 ± 0.466
1.104HisGln: 1.104 ± 0.429
1.38HisArg: 1.38 ± 0.566
2.76HisSer: 2.76 ± 0.967
2.484HisThr: 2.484 ± 0.494
2.484HisVal: 2.484 ± 1.613
0.276HisTrp: 0.276 ± 0.168
1.104HisTyr: 1.104 ± 0.712
0.0HisXaa: 0.0 ± 0.0
Ile
3.864IleAla: 3.864 ± 1.579
2.76IleCys: 2.76 ± 1.207
3.588IleAsp: 3.588 ± 1.517
2.76IleGlu: 2.76 ± 0.868
2.208IlePhe: 2.208 ± 0.872
3.864IleGly: 3.864 ± 1.432
1.656IleHis: 1.656 ± 0.535
5.52IleIle: 5.52 ± 2.316
4.968IleLys: 4.968 ± 1.875
6.072IleLeu: 6.072 ± 0.952
2.484IleMet: 2.484 ± 0.936
3.312IleAsn: 3.312 ± 0.788
3.036IlePro: 3.036 ± 0.777
2.484IleGln: 2.484 ± 0.644
2.484IleArg: 2.484 ± 0.762
3.312IleSer: 3.312 ± 0.995
4.416IleThr: 4.416 ± 1.716
1.656IleVal: 1.656 ± 0.47
0.828IleTrp: 0.828 ± 0.377
1.932IleTyr: 1.932 ± 0.892
0.0IleXaa: 0.0 ± 0.0
Lys
3.312LysAla: 3.312 ± 1.292
1.38LysCys: 1.38 ± 0.364
3.312LysAsp: 3.312 ± 1.435
3.864LysGlu: 3.864 ± 0.385
1.656LysPhe: 1.656 ± 0.756
3.588LysGly: 3.588 ± 0.849
2.208LysHis: 2.208 ± 0.978
2.484LysIle: 2.484 ± 0.957
3.036LysLys: 3.036 ± 0.915
3.312LysLeu: 3.312 ± 1.18
2.208LysMet: 2.208 ± 0.58
3.036LysAsn: 3.036 ± 0.787
3.588LysPro: 3.588 ± 1.142
1.656LysGln: 1.656 ± 0.659
3.036LysArg: 3.036 ± 0.859
4.692LysSer: 4.692 ± 0.682
4.692LysThr: 4.692 ± 1.052
4.692LysVal: 4.692 ± 0.862
1.656LysTrp: 1.656 ± 0.706
1.104LysTyr: 1.104 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
4.692LeuAla: 4.692 ± 0.685
1.656LeuCys: 1.656 ± 0.503
5.244LeuAsp: 5.244 ± 1.437
4.692LeuGlu: 4.692 ± 0.777
3.312LeuPhe: 3.312 ± 0.86
4.416LeuGly: 4.416 ± 1.368
3.036LeuHis: 3.036 ± 0.531
4.692LeuIle: 4.692 ± 0.469
4.968LeuLys: 4.968 ± 2.009
6.072LeuLeu: 6.072 ± 1.261
5.244LeuMet: 5.244 ± 1.312
2.484LeuAsn: 2.484 ± 1.061
2.208LeuPro: 2.208 ± 0.739
1.38LeuGln: 1.38 ± 0.953
5.244LeuArg: 5.244 ± 0.634
11.041LeuSer: 11.041 ± 0.367
6.072LeuThr: 6.072 ± 1.25
5.796LeuVal: 5.796 ± 1.751
1.656LeuTrp: 1.656 ± 1.46
2.208LeuTyr: 2.208 ± 0.513
0.0LeuXaa: 0.0 ± 0.0
Met
3.588MetAla: 3.588 ± 0.843
1.104MetCys: 1.104 ± 0.896
2.208MetAsp: 2.208 ± 0.579
3.036MetGlu: 3.036 ± 1.205
2.208MetPhe: 2.208 ± 0.723
2.208MetGly: 2.208 ± 0.679
0.276MetHis: 0.276 ± 0.168
3.864MetIle: 3.864 ± 0.911
1.38MetLys: 1.38 ± 0.404
2.484MetLeu: 2.484 ± 0.962
1.656MetMet: 1.656 ± 1.009
1.38MetAsn: 1.38 ± 0.668
0.828MetPro: 0.828 ± 0.695
0.828MetGln: 0.828 ± 0.346
2.76MetArg: 2.76 ± 1.207
5.244MetSer: 5.244 ± 0.731
3.588MetThr: 3.588 ± 1.539
1.932MetVal: 1.932 ± 0.366
0.552MetTrp: 0.552 ± 0.336
1.932MetTyr: 1.932 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
0.828AsnAla: 0.828 ± 0.598
0.0AsnCys: 0.0 ± 0.0
3.312AsnAsp: 3.312 ± 1.123
2.484AsnGlu: 2.484 ± 0.535
0.552AsnPhe: 0.552 ± 0.265
1.932AsnGly: 1.932 ± 0.747
0.552AsnHis: 0.552 ± 0.336
3.864AsnIle: 3.864 ± 0.824
3.588AsnLys: 3.588 ± 0.985
3.036AsnLeu: 3.036 ± 0.862
1.932AsnMet: 1.932 ± 0.671
0.828AsnAsn: 0.828 ± 0.347
3.588AsnPro: 3.588 ± 1.65
1.656AsnGln: 1.656 ± 0.835
2.208AsnArg: 2.208 ± 0.291
2.208AsnSer: 2.208 ± 0.566
3.588AsnThr: 3.588 ± 1.532
2.76AsnVal: 2.76 ± 0.973
0.828AsnTrp: 0.828 ± 0.392
2.484AsnTyr: 2.484 ± 0.746
0.0AsnXaa: 0.0 ± 0.0
Pro
3.588ProAla: 3.588 ± 1.462
0.828ProCys: 0.828 ± 0.377
1.932ProAsp: 1.932 ± 0.669
1.38ProGlu: 1.38 ± 0.389
0.828ProPhe: 0.828 ± 0.347
3.312ProGly: 3.312 ± 1.011
1.656ProHis: 1.656 ± 0.733
3.036ProIle: 3.036 ± 0.631
3.312ProLys: 3.312 ± 0.713
4.416ProLeu: 4.416 ± 0.512
1.656ProMet: 1.656 ± 0.713
1.38ProAsn: 1.38 ± 0.579
2.76ProPro: 2.76 ± 2.218
0.828ProGln: 0.828 ± 0.392
1.656ProArg: 1.656 ± 1.184
3.036ProSer: 3.036 ± 1.363
4.692ProThr: 4.692 ± 1.183
4.14ProVal: 4.14 ± 0.908
1.104ProTrp: 1.104 ± 0.725
1.38ProTyr: 1.38 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.76GlnAla: 2.76 ± 0.66
1.656GlnCys: 1.656 ± 0.594
0.828GlnAsp: 0.828 ± 0.461
3.036GlnGlu: 3.036 ± 0.915
0.828GlnPhe: 0.828 ± 0.319
2.208GlnGly: 2.208 ± 1.11
0.828GlnHis: 0.828 ± 0.346
1.656GlnIle: 1.656 ± 0.639
0.828GlnLys: 0.828 ± 0.347
1.656GlnLeu: 1.656 ± 0.64
1.104GlnMet: 1.104 ± 0.462
0.276GlnAsn: 0.276 ± 0.38
0.0GlnPro: 0.0 ± 0.0
1.932GlnGln: 1.932 ± 0.764
0.276GlnArg: 0.276 ± 0.38
0.828GlnSer: 0.828 ± 0.336
1.932GlnThr: 1.932 ± 0.37
3.036GlnVal: 3.036 ± 0.798
0.828GlnTrp: 0.828 ± 0.66
0.828GlnTyr: 0.828 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
3.312ArgAla: 3.312 ± 0.713
1.38ArgCys: 1.38 ± 0.579
2.484ArgAsp: 2.484 ± 0.892
3.588ArgGlu: 3.588 ± 0.329
1.38ArgPhe: 1.38 ± 0.599
4.416ArgGly: 4.416 ± 1.299
1.38ArgHis: 1.38 ± 0.372
2.484ArgIle: 2.484 ± 0.911
3.864ArgLys: 3.864 ± 1.213
2.484ArgLeu: 2.484 ± 0.651
1.932ArgMet: 1.932 ± 0.982
1.656ArgAsn: 1.656 ± 0.486
2.208ArgPro: 2.208 ± 0.637
1.38ArgGln: 1.38 ± 0.533
3.864ArgArg: 3.864 ± 0.927
4.416ArgSer: 4.416 ± 1.707
2.484ArgThr: 2.484 ± 0.604
3.864ArgVal: 3.864 ± 1.122
0.276ArgTrp: 0.276 ± 0.168
2.76ArgTyr: 2.76 ± 0.767
0.0ArgXaa: 0.0 ± 0.0
Ser
4.14SerAla: 4.14 ± 1.788
1.656SerCys: 1.656 ± 0.638
4.416SerAsp: 4.416 ± 1.247
3.588SerGlu: 3.588 ± 0.962
1.932SerPhe: 1.932 ± 1.032
4.692SerGly: 4.692 ± 0.882
2.76SerHis: 2.76 ± 2.194
5.244SerIle: 5.244 ± 1.851
3.312SerLys: 3.312 ± 0.809
8.28SerLeu: 8.28 ± 1.387
2.208SerMet: 2.208 ± 1.024
3.588SerAsn: 3.588 ± 0.945
3.864SerPro: 3.864 ± 1.401
2.76SerGln: 2.76 ± 0.818
4.692SerArg: 4.692 ± 1.669
11.593SerSer: 11.593 ± 1.706
3.864SerThr: 3.864 ± 2.605
5.244SerVal: 5.244 ± 1.006
1.656SerTrp: 1.656 ± 0.733
2.76SerTyr: 2.76 ± 0.64
0.0SerXaa: 0.0 ± 0.0
Thr
3.312ThrAla: 3.312 ± 0.97
0.276ThrCys: 0.276 ± 0.38
3.588ThrAsp: 3.588 ± 1.145
3.864ThrGlu: 3.864 ± 1.132
1.932ThrPhe: 1.932 ± 0.818
4.416ThrGly: 4.416 ± 1.256
1.38ThrHis: 1.38 ± 0.665
4.14ThrIle: 4.14 ± 1.07
4.14ThrLys: 4.14 ± 0.773
6.348ThrLeu: 6.348 ± 0.406
3.312ThrMet: 3.312 ± 1.071
2.484ThrAsn: 2.484 ± 0.942
4.416ThrPro: 4.416 ± 0.942
1.656ThrGln: 1.656 ± 0.929
3.036ThrArg: 3.036 ± 1.015
4.692ThrSer: 4.692 ± 1.316
4.968ThrThr: 4.968 ± 1.569
5.796ThrVal: 5.796 ± 3.025
1.932ThrTrp: 1.932 ± 0.366
1.38ThrTyr: 1.38 ± 0.648
0.0ThrXaa: 0.0 ± 0.0
Val
2.76ValAla: 2.76 ± 1.144
1.104ValCys: 1.104 ± 0.436
3.036ValAsp: 3.036 ± 1.043
2.484ValGlu: 2.484 ± 0.596
1.38ValPhe: 1.38 ± 0.566
4.416ValGly: 4.416 ± 0.793
1.104ValHis: 1.104 ± 0.599
4.692ValIle: 4.692 ± 1.285
4.14ValLys: 4.14 ± 1.619
7.728ValLeu: 7.728 ± 1.466
2.484ValMet: 2.484 ± 0.92
4.416ValAsn: 4.416 ± 0.648
3.864ValPro: 3.864 ± 0.809
2.484ValGln: 2.484 ± 0.772
3.036ValArg: 3.036 ± 0.753
7.728ValSer: 7.728 ± 0.422
5.244ValThr: 5.244 ± 1.04
3.864ValVal: 3.864 ± 1.138
0.828ValTrp: 0.828 ± 0.403
1.656ValTyr: 1.656 ± 0.543
0.0ValXaa: 0.0 ± 0.0
Trp
1.656TrpAla: 1.656 ± 0.535
0.276TrpCys: 0.276 ± 0.168
0.828TrpAsp: 0.828 ± 0.392
0.276TrpGlu: 0.276 ± 0.168
0.0TrpPhe: 0.0 ± 0.0
0.828TrpGly: 0.828 ± 0.505
0.276TrpHis: 0.276 ± 0.168
2.208TrpIle: 2.208 ± 0.76
0.276TrpLys: 0.276 ± 0.308
1.656TrpLeu: 1.656 ± 1.159
0.276TrpMet: 0.276 ± 0.38
1.104TrpAsn: 1.104 ± 0.346
0.276TrpPro: 0.276 ± 0.168
0.552TrpGln: 0.552 ± 0.781
1.656TrpArg: 1.656 ± 0.545
1.656TrpSer: 1.656 ± 0.506
1.104TrpThr: 1.104 ± 0.673
1.656TrpVal: 1.656 ± 0.706
0.552TrpTrp: 0.552 ± 0.265
0.276TrpTyr: 0.276 ± 0.38
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.828TyrAla: 0.828 ± 0.764
0.552TyrCys: 0.552 ± 0.761
0.828TyrAsp: 0.828 ± 0.587
1.656TyrGlu: 1.656 ± 0.324
2.484TyrPhe: 2.484 ± 0.494
2.484TyrGly: 2.484 ± 0.601
2.208TyrHis: 2.208 ± 1.112
1.656TyrIle: 1.656 ± 0.585
1.38TyrLys: 1.38 ± 0.841
1.932TyrLeu: 1.932 ± 0.37
0.828TyrMet: 0.828 ± 0.346
0.552TyrAsn: 0.552 ± 0.637
1.656TyrPro: 1.656 ± 0.47
0.828TyrGln: 0.828 ± 0.346
1.656TyrArg: 1.656 ± 0.563
2.76TyrSer: 2.76 ± 1.433
2.76TyrThr: 2.76 ± 0.687
2.76TyrVal: 2.76 ± 0.848
0.276TyrTrp: 0.276 ± 0.168
0.552TyrTyr: 0.552 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski