Amino acid dipepetide frequency for Mangshi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.188AlaAla: 7.188 ± 1.308
1.17AlaCys: 1.17 ± 0.537
6.185AlaAsp: 6.185 ± 0.959
3.845AlaGlu: 3.845 ± 0.768
2.675AlaPhe: 2.675 ± 0.592
3.678AlaGly: 3.678 ± 0.865
2.006AlaHis: 2.006 ± 0.868
4.012AlaIle: 4.012 ± 0.696
3.009AlaLys: 3.009 ± 0.74
7.857AlaLeu: 7.857 ± 1.536
2.842AlaMet: 2.842 ± 0.587
3.678AlaAsn: 3.678 ± 0.953
2.842AlaPro: 2.842 ± 0.44
2.508AlaGln: 2.508 ± 0.953
3.678AlaArg: 3.678 ± 0.947
4.179AlaSer: 4.179 ± 1.046
5.851AlaThr: 5.851 ± 0.789
4.346AlaVal: 4.346 ± 1.157
0.334AlaTrp: 0.334 ± 0.236
3.343AlaTyr: 3.343 ± 0.824
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.217
0.502CysCys: 0.502 ± 0.272
1.337CysAsp: 1.337 ± 0.429
0.669CysGlu: 0.669 ± 0.387
0.167CysPhe: 0.167 ± 0.19
0.669CysGly: 0.669 ± 0.222
0.167CysHis: 0.167 ± 0.161
0.502CysIle: 0.502 ± 0.304
0.669CysLys: 0.669 ± 0.372
1.337CysLeu: 1.337 ± 0.512
0.669CysMet: 0.669 ± 0.34
1.003CysAsn: 1.003 ± 0.292
0.167CysPro: 0.167 ± 0.224
0.669CysGln: 0.669 ± 0.37
0.167CysArg: 0.167 ± 0.161
0.669CysSer: 0.669 ± 0.575
0.334CysThr: 0.334 ± 0.263
0.502CysVal: 0.502 ± 0.356
0.0CysTrp: 0.0 ± 0.0
0.167CysTyr: 0.167 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
6.52AspAla: 6.52 ± 1.733
0.0AspCys: 0.0 ± 0.0
3.343AspAsp: 3.343 ± 0.861
4.681AspGlu: 4.681 ± 0.71
2.675AspPhe: 2.675 ± 0.534
4.681AspGly: 4.681 ± 0.532
1.17AspHis: 1.17 ± 0.373
5.851AspIle: 5.851 ± 0.629
3.176AspLys: 3.176 ± 0.673
4.346AspLeu: 4.346 ± 0.682
1.505AspMet: 1.505 ± 0.391
2.842AspAsn: 2.842 ± 0.782
2.675AspPro: 2.675 ± 0.812
2.508AspGln: 2.508 ± 0.648
3.845AspArg: 3.845 ± 0.895
3.343AspSer: 3.343 ± 0.647
3.009AspThr: 3.009 ± 0.499
4.514AspVal: 4.514 ± 0.709
0.836AspTrp: 0.836 ± 0.355
2.006AspTyr: 2.006 ± 0.551
0.0AspXaa: 0.0 ± 0.0
Glu
3.343GluAla: 3.343 ± 0.803
0.502GluCys: 0.502 ± 0.246
2.173GluAsp: 2.173 ± 0.516
1.839GluGlu: 1.839 ± 0.526
1.672GluPhe: 1.672 ± 0.64
2.173GluGly: 2.173 ± 0.61
1.672GluHis: 1.672 ± 0.469
3.511GluIle: 3.511 ± 0.956
2.006GluLys: 2.006 ± 0.393
4.681GluLeu: 4.681 ± 0.779
1.672GluMet: 1.672 ± 0.932
2.508GluAsn: 2.508 ± 0.616
2.34GluPro: 2.34 ± 0.518
2.508GluGln: 2.508 ± 0.374
4.179GluArg: 4.179 ± 0.654
2.842GluSer: 2.842 ± 0.584
2.508GluThr: 2.508 ± 0.61
5.182GluVal: 5.182 ± 1.104
0.334GluTrp: 0.334 ± 0.197
3.176GluTyr: 3.176 ± 0.822
0.0GluXaa: 0.0 ± 0.0
Phe
1.839PheAla: 1.839 ± 0.54
0.502PheCys: 0.502 ± 0.265
2.006PheAsp: 2.006 ± 0.465
2.508PheGlu: 2.508 ± 0.453
2.006PhePhe: 2.006 ± 0.715
2.842PheGly: 2.842 ± 0.362
0.334PheHis: 0.334 ± 0.195
2.34PheIle: 2.34 ± 0.581
3.009PheLys: 3.009 ± 0.5
3.176PheLeu: 3.176 ± 0.715
1.003PheMet: 1.003 ± 0.465
2.842PheAsn: 2.842 ± 0.438
0.836PhePro: 0.836 ± 0.308
1.17PheGln: 1.17 ± 0.537
3.176PheArg: 3.176 ± 0.337
2.34PheSer: 2.34 ± 0.569
2.173PheThr: 2.173 ± 0.447
3.009PheVal: 3.009 ± 0.711
0.0PheTrp: 0.0 ± 0.0
0.502PheTyr: 0.502 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
3.343GlyAla: 3.343 ± 0.899
0.167GlyCys: 0.167 ± 0.123
2.508GlyAsp: 2.508 ± 0.601
2.173GlyGlu: 2.173 ± 0.449
1.672GlyPhe: 1.672 ± 0.464
3.845GlyGly: 3.845 ± 0.974
0.334GlyHis: 0.334 ± 0.227
2.34GlyIle: 2.34 ± 0.494
2.34GlyLys: 2.34 ± 0.618
7.021GlyLeu: 7.021 ± 0.837
1.839GlyMet: 1.839 ± 0.618
4.012GlyAsn: 4.012 ± 1.022
1.672GlyPro: 1.672 ± 0.665
2.842GlyGln: 2.842 ± 0.618
3.343GlyArg: 3.343 ± 0.786
6.52GlySer: 6.52 ± 0.69
4.346GlyThr: 4.346 ± 0.681
5.182GlyVal: 5.182 ± 0.869
0.669GlyTrp: 0.669 ± 0.225
2.173GlyTyr: 2.173 ± 0.724
0.0GlyXaa: 0.0 ± 0.0
His
2.006HisAla: 2.006 ± 0.614
0.334HisCys: 0.334 ± 0.217
1.839HisAsp: 1.839 ± 0.487
1.337HisGlu: 1.337 ± 0.375
1.17HisPhe: 1.17 ± 0.419
1.003HisGly: 1.003 ± 0.241
0.167HisHis: 0.167 ± 0.153
1.17HisIle: 1.17 ± 0.376
1.003HisLys: 1.003 ± 0.465
1.003HisLeu: 1.003 ± 0.358
0.836HisMet: 0.836 ± 0.423
0.669HisAsn: 0.669 ± 0.286
1.17HisPro: 1.17 ± 0.399
0.669HisGln: 0.669 ± 0.341
0.334HisArg: 0.334 ± 0.263
1.505HisSer: 1.505 ± 0.421
0.836HisThr: 0.836 ± 0.396
1.672HisVal: 1.672 ± 0.531
0.0HisTrp: 0.0 ± 0.0
1.505HisTyr: 1.505 ± 0.374
0.0HisXaa: 0.0 ± 0.0
Ile
5.349IleAla: 5.349 ± 0.879
1.003IleCys: 1.003 ± 0.451
5.015IleAsp: 5.015 ± 0.815
2.34IleGlu: 2.34 ± 0.47
2.173IlePhe: 2.173 ± 0.437
3.678IleGly: 3.678 ± 1.027
0.836IleHis: 0.836 ± 0.339
2.675IleIle: 2.675 ± 0.752
4.346IleLys: 4.346 ± 0.674
5.182IleLeu: 5.182 ± 0.893
2.006IleMet: 2.006 ± 0.622
4.012IleAsn: 4.012 ± 0.731
2.842IlePro: 2.842 ± 0.84
2.006IleGln: 2.006 ± 0.602
2.675IleArg: 2.675 ± 0.402
6.018IleSer: 6.018 ± 0.668
4.012IleThr: 4.012 ± 0.493
4.179IleVal: 4.179 ± 0.812
0.334IleTrp: 0.334 ± 0.247
1.672IleTyr: 1.672 ± 0.557
0.0IleXaa: 0.0 ± 0.0
Lys
2.675LysAla: 2.675 ± 0.519
0.669LysCys: 0.669 ± 0.262
3.343LysAsp: 3.343 ± 0.633
2.675LysGlu: 2.675 ± 0.874
2.006LysPhe: 2.006 ± 0.537
3.511LysGly: 3.511 ± 0.866
1.672LysHis: 1.672 ± 0.454
3.678LysIle: 3.678 ± 0.612
3.176LysLys: 3.176 ± 0.943
5.182LysLeu: 5.182 ± 0.824
1.672LysMet: 1.672 ± 0.664
3.009LysAsn: 3.009 ± 0.503
2.34LysPro: 2.34 ± 0.524
1.337LysGln: 1.337 ± 0.498
3.511LysArg: 3.511 ± 0.761
3.845LysSer: 3.845 ± 0.722
2.34LysThr: 2.34 ± 0.796
4.681LysVal: 4.681 ± 1.152
0.334LysTrp: 0.334 ± 0.195
2.173LysTyr: 2.173 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
5.851LeuAla: 5.851 ± 0.857
0.669LeuCys: 0.669 ± 0.405
5.015LeuAsp: 5.015 ± 0.621
5.015LeuGlu: 5.015 ± 0.817
2.842LeuPhe: 2.842 ± 0.737
4.346LeuGly: 4.346 ± 1.222
1.337LeuHis: 1.337 ± 0.288
5.517LeuIle: 5.517 ± 1.189
5.684LeuLys: 5.684 ± 0.948
6.687LeuLeu: 6.687 ± 0.616
2.34LeuMet: 2.34 ± 0.603
5.015LeuAsn: 5.015 ± 1.127
4.179LeuPro: 4.179 ± 0.749
2.842LeuGln: 2.842 ± 0.972
4.681LeuArg: 4.681 ± 0.656
6.018LeuSer: 6.018 ± 1.021
7.188LeuThr: 7.188 ± 1.19
6.185LeuVal: 6.185 ± 0.646
0.669LeuTrp: 0.669 ± 0.528
3.009LeuTyr: 3.009 ± 0.877
0.167LeuXaa: 0.167 ± 0.169
Met
3.009MetAla: 3.009 ± 1.22
0.0MetCys: 0.0 ± 0.0
2.34MetAsp: 2.34 ± 0.478
1.003MetGlu: 1.003 ± 0.523
1.839MetPhe: 1.839 ± 0.761
0.836MetGly: 0.836 ± 0.465
1.17MetHis: 1.17 ± 0.34
2.842MetIle: 2.842 ± 0.559
1.505MetLys: 1.505 ± 0.398
2.508MetLeu: 2.508 ± 0.577
0.836MetMet: 0.836 ± 0.44
0.836MetAsn: 0.836 ± 0.378
1.839MetPro: 1.839 ± 0.489
1.17MetGln: 1.17 ± 0.33
1.672MetArg: 1.672 ± 0.34
2.34MetSer: 2.34 ± 0.597
3.678MetThr: 3.678 ± 0.819
2.173MetVal: 2.173 ± 0.439
0.0MetTrp: 0.0 ± 0.0
0.502MetTyr: 0.502 ± 0.205
0.167MetXaa: 0.167 ± 0.169
Asn
3.343AsnAla: 3.343 ± 0.587
0.836AsnCys: 0.836 ± 0.498
3.176AsnAsp: 3.176 ± 0.452
3.176AsnGlu: 3.176 ± 0.369
2.508AsnPhe: 2.508 ± 0.41
5.015AsnGly: 5.015 ± 0.91
0.334AsnHis: 0.334 ± 0.186
4.848AsnIle: 4.848 ± 0.683
2.842AsnLys: 2.842 ± 0.644
4.681AsnLeu: 4.681 ± 0.876
1.839AsnMet: 1.839 ± 0.404
3.009AsnAsn: 3.009 ± 0.923
2.675AsnPro: 2.675 ± 0.556
1.672AsnGln: 1.672 ± 0.366
3.009AsnArg: 3.009 ± 0.575
2.34AsnSer: 2.34 ± 0.585
4.012AsnThr: 4.012 ± 0.766
4.346AsnVal: 4.346 ± 1.167
0.502AsnTrp: 0.502 ± 0.262
2.34AsnTyr: 2.34 ± 0.812
0.0AsnXaa: 0.0 ± 0.0
Pro
4.848ProAla: 4.848 ± 0.935
0.167ProCys: 0.167 ± 0.152
2.675ProAsp: 2.675 ± 0.577
1.672ProGlu: 1.672 ± 0.595
2.173ProPhe: 2.173 ± 0.599
2.675ProGly: 2.675 ± 0.874
1.17ProHis: 1.17 ± 0.379
2.675ProIle: 2.675 ± 0.544
1.672ProLys: 1.672 ± 0.451
3.343ProLeu: 3.343 ± 1.039
0.836ProMet: 0.836 ± 0.367
1.672ProAsn: 1.672 ± 0.588
1.003ProPro: 1.003 ± 0.41
1.839ProGln: 1.839 ± 0.472
2.006ProArg: 2.006 ± 0.684
3.678ProSer: 3.678 ± 0.874
2.34ProThr: 2.34 ± 0.842
2.675ProVal: 2.675 ± 0.375
0.334ProTrp: 0.334 ± 0.236
1.672ProTyr: 1.672 ± 0.733
0.0ProXaa: 0.0 ± 0.0
Gln
3.009GlnAla: 3.009 ± 1.521
0.167GlnCys: 0.167 ± 0.19
1.672GlnAsp: 1.672 ± 0.334
1.839GlnGlu: 1.839 ± 0.572
2.006GlnPhe: 2.006 ± 0.625
2.842GlnGly: 2.842 ± 0.61
1.505GlnHis: 1.505 ± 0.441
2.508GlnIle: 2.508 ± 0.637
1.003GlnLys: 1.003 ± 0.403
3.009GlnLeu: 3.009 ± 0.959
0.669GlnMet: 0.669 ± 0.327
1.672GlnAsn: 1.672 ± 0.675
1.003GlnPro: 1.003 ± 0.346
1.672GlnGln: 1.672 ± 0.901
1.505GlnArg: 1.505 ± 0.613
3.511GlnSer: 3.511 ± 0.988
2.508GlnThr: 2.508 ± 0.808
4.514GlnVal: 4.514 ± 0.729
0.502GlnTrp: 0.502 ± 0.265
0.669GlnTyr: 0.669 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
4.012ArgAla: 4.012 ± 0.828
0.334ArgCys: 0.334 ± 0.211
3.511ArgAsp: 3.511 ± 1.002
3.009ArgGlu: 3.009 ± 0.688
2.173ArgPhe: 2.173 ± 0.45
2.842ArgGly: 2.842 ± 0.461
1.17ArgHis: 1.17 ± 0.421
2.842ArgIle: 2.842 ± 0.741
2.675ArgLys: 2.675 ± 0.499
5.517ArgLeu: 5.517 ± 1.054
2.173ArgMet: 2.173 ± 0.667
4.346ArgAsn: 4.346 ± 0.944
2.34ArgPro: 2.34 ± 0.369
2.006ArgGln: 2.006 ± 0.455
3.511ArgArg: 3.511 ± 0.725
4.346ArgSer: 4.346 ± 0.533
3.511ArgThr: 3.511 ± 0.443
3.845ArgVal: 3.845 ± 1.152
0.167ArgTrp: 0.167 ± 0.169
2.34ArgTyr: 2.34 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
5.015SerAla: 5.015 ± 0.934
0.836SerCys: 0.836 ± 0.421
3.678SerAsp: 3.678 ± 0.798
4.012SerGlu: 4.012 ± 0.914
2.508SerPhe: 2.508 ± 0.578
3.845SerGly: 3.845 ± 0.486
1.337SerHis: 1.337 ± 0.676
4.514SerIle: 4.514 ± 0.738
4.681SerLys: 4.681 ± 0.949
4.848SerLeu: 4.848 ± 0.906
2.675SerMet: 2.675 ± 0.44
4.179SerAsn: 4.179 ± 0.537
3.678SerPro: 3.678 ± 0.587
1.839SerGln: 1.839 ± 0.534
4.346SerArg: 4.346 ± 0.823
5.015SerSer: 5.015 ± 1.155
4.514SerThr: 4.514 ± 0.831
5.517SerVal: 5.517 ± 0.972
0.836SerTrp: 0.836 ± 0.49
3.343SerTyr: 3.343 ± 0.546
0.0SerXaa: 0.0 ± 0.0
Thr
5.851ThrAla: 5.851 ± 0.673
0.334ThrCys: 0.334 ± 0.238
4.514ThrAsp: 4.514 ± 0.614
3.343ThrGlu: 3.343 ± 1.225
1.839ThrPhe: 1.839 ± 0.398
3.176ThrGly: 3.176 ± 0.394
1.505ThrHis: 1.505 ± 0.503
4.012ThrIle: 4.012 ± 0.618
2.842ThrLys: 2.842 ± 0.878
4.179ThrLeu: 4.179 ± 0.777
2.173ThrMet: 2.173 ± 0.667
2.675ThrAsn: 2.675 ± 0.656
3.009ThrPro: 3.009 ± 0.699
3.845ThrGln: 3.845 ± 0.801
3.009ThrArg: 3.009 ± 0.644
5.182ThrSer: 5.182 ± 1.193
5.684ThrThr: 5.684 ± 1.046
6.185ThrVal: 6.185 ± 0.868
0.167ThrTrp: 0.167 ± 0.123
2.173ThrTyr: 2.173 ± 0.629
0.0ThrXaa: 0.0 ± 0.0
Val
4.179ValAla: 4.179 ± 1.148
1.505ValCys: 1.505 ± 0.492
6.352ValAsp: 6.352 ± 0.966
3.678ValGlu: 3.678 ± 0.704
2.173ValPhe: 2.173 ± 0.42
4.179ValGly: 4.179 ± 0.501
1.337ValHis: 1.337 ± 0.514
3.511ValIle: 3.511 ± 0.555
5.015ValLys: 5.015 ± 0.909
6.352ValLeu: 6.352 ± 1.271
2.842ValMet: 2.842 ± 0.612
6.018ValAsn: 6.018 ± 1.589
3.176ValPro: 3.176 ± 0.803
2.675ValGln: 2.675 ± 0.37
5.015ValArg: 5.015 ± 0.739
5.684ValSer: 5.684 ± 1.153
5.015ValThr: 5.015 ± 1.155
4.848ValVal: 4.848 ± 1.068
0.167ValTrp: 0.167 ± 0.169
3.009ValTyr: 3.009 ± 0.476
0.0ValXaa: 0.0 ± 0.0
Trp
0.334TrpAla: 0.334 ± 0.217
0.0TrpCys: 0.0 ± 0.0
0.334TrpAsp: 0.334 ± 0.19
0.502TrpGlu: 0.502 ± 0.226
0.0TrpPhe: 0.0 ± 0.0
0.669TrpGly: 0.669 ± 0.285
0.167TrpHis: 0.167 ± 0.169
0.167TrpIle: 0.167 ± 0.152
0.502TrpLys: 0.502 ± 0.211
1.003TrpLeu: 1.003 ± 0.304
0.0TrpMet: 0.0 ± 0.0
0.669TrpAsn: 0.669 ± 0.298
0.0TrpPro: 0.0 ± 0.0
0.502TrpGln: 0.502 ± 0.273
0.0TrpArg: 0.0 ± 0.0
0.669TrpSer: 0.669 ± 0.285
0.502TrpThr: 0.502 ± 0.226
0.167TrpVal: 0.167 ± 0.123
0.0TrpTrp: 0.0 ± 0.0
0.167TrpTyr: 0.167 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.009TyrAla: 3.009 ± 0.722
1.17TyrCys: 1.17 ± 0.472
2.508TyrAsp: 2.508 ± 0.806
1.505TyrGlu: 1.505 ± 0.4
1.337TyrPhe: 1.337 ± 0.441
1.839TyrGly: 1.839 ± 0.674
0.669TyrHis: 0.669 ± 0.354
3.009TyrIle: 3.009 ± 0.508
2.675TyrLys: 2.675 ± 0.722
3.343TyrLeu: 3.343 ± 0.818
1.505TyrMet: 1.505 ± 0.278
1.839TyrAsn: 1.839 ± 0.548
1.337TyrPro: 1.337 ± 0.378
1.505TyrGln: 1.505 ± 0.569
3.009TyrArg: 3.009 ± 0.509
1.17TyrSer: 1.17 ± 0.476
1.337TyrThr: 1.337 ± 0.399
3.009TyrVal: 3.009 ± 0.557
0.167TyrTrp: 0.167 ± 0.153
1.337TyrTyr: 1.337 ± 0.399
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.167XaaHis: 0.167 ± 0.169
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.167XaaMet: 0.167 ± 0.169
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski