Amino acid dipepetide frequency for Mycoplasma virus P1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.559AlaAla: 0.559 ± 0.353
0.0AlaCys: 0.0 ± 0.0
0.838AlaAsp: 0.838 ± 0.349
1.118AlaGlu: 1.118 ± 0.371
3.633AlaPhe: 3.633 ± 1.14
1.397AlaGly: 1.397 ± 0.465
0.838AlaHis: 0.838 ± 0.527
4.192AlaIle: 4.192 ± 1.415
3.354AlaLys: 3.354 ± 1.143
5.031AlaLeu: 5.031 ± 1.247
0.838AlaMet: 0.838 ± 0.705
3.354AlaAsn: 3.354 ± 0.402
0.0AlaPro: 0.0 ± 0.0
3.354AlaGln: 3.354 ± 0.978
0.279AlaArg: 0.279 ± 0.254
3.074AlaSer: 3.074 ± 0.428
1.397AlaThr: 1.397 ± 0.375
1.956AlaVal: 1.956 ± 1.028
0.0AlaTrp: 0.0 ± 0.0
2.236AlaTyr: 2.236 ± 0.687
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.279CysAsp: 0.279 ± 0.359
0.279CysGlu: 0.279 ± 0.235
0.559CysPhe: 0.559 ± 0.486
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.279CysIle: 0.279 ± 0.246
0.559CysLys: 0.559 ± 0.391
0.279CysLeu: 0.279 ± 0.359
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.279CysArg: 0.279 ± 0.313
0.0CysSer: 0.0 ± 0.0
0.559CysThr: 0.559 ± 0.263
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.559CysTyr: 0.559 ± 0.263
0.0CysXaa: 0.0 ± 0.0
Asp
1.397AspAla: 1.397 ± 0.496
0.838AspCys: 0.838 ± 0.488
1.118AspAsp: 1.118 ± 0.477
2.236AspGlu: 2.236 ± 1.2
6.149AspPhe: 6.149 ± 1.667
0.559AspGly: 0.559 ± 0.308
0.279AspHis: 0.279 ± 0.254
5.869AspIle: 5.869 ± 1.031
6.708AspLys: 6.708 ± 1.337
7.826AspLeu: 7.826 ± 1.42
0.838AspMet: 0.838 ± 0.389
4.751AspAsn: 4.751 ± 1.748
0.838AspPro: 0.838 ± 0.404
2.515AspGln: 2.515 ± 0.931
2.236AspArg: 2.236 ± 0.843
6.708AspSer: 6.708 ± 0.565
1.956AspThr: 1.956 ± 0.738
2.515AspVal: 2.515 ± 0.787
0.559AspTrp: 0.559 ± 0.356
3.913AspTyr: 3.913 ± 0.951
0.0AspXaa: 0.0 ± 0.0
Glu
3.074GluAla: 3.074 ± 0.8
0.0GluCys: 0.0 ± 0.0
5.31GluAsp: 5.31 ± 1.233
6.149GluGlu: 6.149 ± 1.62
3.633GluPhe: 3.633 ± 1.033
2.795GluGly: 2.795 ± 0.994
1.677GluHis: 1.677 ± 0.717
6.708GluIle: 6.708 ± 1.264
5.869GluLys: 5.869 ± 1.136
5.31GluLeu: 5.31 ± 1.295
1.956GluMet: 1.956 ± 0.701
6.428GluAsn: 6.428 ± 1.974
1.677GluPro: 1.677 ± 0.686
2.515GluGln: 2.515 ± 0.587
4.472GluArg: 4.472 ± 1.285
2.236GluSer: 2.236 ± 0.907
3.913GluThr: 3.913 ± 1.215
5.031GluVal: 5.031 ± 0.772
1.677GluTrp: 1.677 ± 0.412
3.074GluTyr: 3.074 ± 1.078
0.0GluXaa: 0.0 ± 0.0
Phe
1.118PheAla: 1.118 ± 0.375
0.559PheCys: 0.559 ± 0.295
2.795PheAsp: 2.795 ± 0.779
4.751PheGlu: 4.751 ± 1.327
2.515PhePhe: 2.515 ± 0.714
3.074PheGly: 3.074 ± 0.678
1.397PheHis: 1.397 ± 0.925
5.869PheIle: 5.869 ± 0.822
7.546PheLys: 7.546 ± 1.891
5.869PheLeu: 5.869 ± 1.333
1.118PheMet: 1.118 ± 0.577
7.267PheAsn: 7.267 ± 1.934
1.397PhePro: 1.397 ± 0.674
3.354PheGln: 3.354 ± 1.05
2.236PheArg: 2.236 ± 0.7
5.31PheSer: 5.31 ± 1.227
2.795PheThr: 2.795 ± 0.584
3.633PheVal: 3.633 ± 0.896
1.397PheTrp: 1.397 ± 1.232
3.074PheTyr: 3.074 ± 0.761
0.0PheXaa: 0.0 ± 0.0
Gly
2.515GlyAla: 2.515 ± 0.816
0.0GlyCys: 0.0 ± 0.0
1.677GlyAsp: 1.677 ± 0.925
1.677GlyGlu: 1.677 ± 0.728
1.956GlyPhe: 1.956 ± 0.967
1.677GlyGly: 1.677 ± 0.906
0.559GlyHis: 0.559 ± 0.352
1.397GlyIle: 1.397 ± 0.345
3.913GlyLys: 3.913 ± 0.882
3.913GlyLeu: 3.913 ± 0.706
0.559GlyMet: 0.559 ± 0.303
1.397GlyAsn: 1.397 ± 0.379
2.515GlyPro: 2.515 ± 2.085
1.397GlyGln: 1.397 ± 0.634
0.838GlyArg: 0.838 ± 0.423
2.515GlySer: 2.515 ± 0.65
0.838GlyThr: 0.838 ± 0.405
2.515GlyVal: 2.515 ± 0.735
0.0GlyTrp: 0.0 ± 0.0
1.397GlyTyr: 1.397 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.279HisCys: 0.279 ± 0.235
0.559HisAsp: 0.559 ± 0.308
0.838HisGlu: 0.838 ± 0.391
3.354HisPhe: 3.354 ± 1.105
0.838HisGly: 0.838 ± 0.594
0.0HisHis: 0.0 ± 0.0
1.956HisIle: 1.956 ± 0.922
1.956HisLys: 1.956 ± 0.653
1.118HisLeu: 1.118 ± 0.482
0.0HisMet: 0.0 ± 0.0
0.838HisAsn: 0.838 ± 0.578
0.279HisPro: 0.279 ± 0.313
0.559HisGln: 0.559 ± 0.362
0.279HisArg: 0.279 ± 0.246
0.559HisSer: 0.559 ± 0.308
0.559HisThr: 0.559 ± 0.378
0.279HisVal: 0.279 ± 0.254
0.559HisTrp: 0.559 ± 0.352
0.838HisTyr: 0.838 ± 0.495
0.0HisXaa: 0.0 ± 0.0
Ile
6.428IleAla: 6.428 ± 1.218
0.559IleCys: 0.559 ± 0.428
6.149IleAsp: 6.149 ± 1.295
6.708IleGlu: 6.708 ± 1.427
4.192IlePhe: 4.192 ± 1.198
1.956IleGly: 1.956 ± 0.764
0.838IleHis: 0.838 ± 0.623
4.751IleIle: 4.751 ± 1.034
9.782IleLys: 9.782 ± 1.428
5.031IleLeu: 5.031 ± 1.33
1.956IleMet: 1.956 ± 0.864
9.503IleAsn: 9.503 ± 2.048
1.118IlePro: 1.118 ± 0.439
2.515IleGln: 2.515 ± 0.908
3.633IleArg: 3.633 ± 0.815
5.31IleSer: 5.31 ± 0.838
3.354IleThr: 3.354 ± 1.259
2.795IleVal: 2.795 ± 0.654
0.559IleTrp: 0.559 ± 0.405
5.31IleTyr: 5.31 ± 1.513
0.0IleXaa: 0.0 ± 0.0
Lys
5.031LysAla: 5.031 ± 1.655
0.0LysCys: 0.0 ± 0.0
8.385LysAsp: 8.385 ± 1.236
8.944LysGlu: 8.944 ± 1.459
6.987LysPhe: 6.987 ± 1.062
4.192LysGly: 4.192 ± 0.869
1.677LysHis: 1.677 ± 0.517
7.546LysIle: 7.546 ± 1.596
8.664LysLys: 8.664 ± 1.563
5.869LysLeu: 5.869 ± 1.293
1.397LysMet: 1.397 ± 0.752
9.782LysAsn: 9.782 ± 1.194
2.795LysPro: 2.795 ± 0.718
3.913LysGln: 3.913 ± 0.88
6.149LysArg: 6.149 ± 1.309
4.192LysSer: 4.192 ± 1.023
5.31LysThr: 5.31 ± 1.111
4.192LysVal: 4.192 ± 0.885
1.118LysTrp: 1.118 ± 0.606
5.869LysTyr: 5.869 ± 1.172
0.0LysXaa: 0.0 ± 0.0
Leu
3.354LeuAla: 3.354 ± 1.331
0.0LeuCys: 0.0 ± 0.0
8.105LeuAsp: 8.105 ± 1.557
8.664LeuGlu: 8.664 ± 1.037
6.987LeuPhe: 6.987 ± 1.503
1.397LeuGly: 1.397 ± 0.711
0.838LeuHis: 0.838 ± 0.559
7.826LeuIle: 7.826 ± 1.795
12.856LeuLys: 12.856 ± 2.018
6.149LeuLeu: 6.149 ± 1.022
1.118LeuMet: 1.118 ± 0.41
5.59LeuAsn: 5.59 ± 1.212
1.956LeuPro: 1.956 ± 0.878
4.472LeuGln: 4.472 ± 1.729
3.633LeuArg: 3.633 ± 1.347
4.751LeuSer: 4.751 ± 1.352
6.987LeuThr: 6.987 ± 1.07
3.354LeuVal: 3.354 ± 1.157
1.118LeuTrp: 1.118 ± 0.786
4.472LeuTyr: 4.472 ± 1.316
0.0LeuXaa: 0.0 ± 0.0
Met
0.279MetAla: 0.279 ± 0.286
0.0MetCys: 0.0 ± 0.0
1.118MetAsp: 1.118 ± 0.464
2.795MetGlu: 2.795 ± 0.866
0.838MetPhe: 0.838 ± 0.532
0.279MetGly: 0.279 ± 0.235
0.559MetHis: 0.559 ± 0.353
1.677MetIle: 1.677 ± 0.536
1.118MetLys: 1.118 ± 0.637
1.677MetLeu: 1.677 ± 0.625
0.279MetMet: 0.279 ± 0.259
2.236MetAsn: 2.236 ± 0.568
0.279MetPro: 0.279 ± 0.259
0.279MetGln: 0.279 ± 0.311
0.838MetArg: 0.838 ± 0.41
1.118MetSer: 1.118 ± 0.586
1.118MetThr: 1.118 ± 0.645
0.279MetVal: 0.279 ± 0.235
0.279MetTrp: 0.279 ± 0.313
0.559MetTyr: 0.559 ± 0.494
0.0MetXaa: 0.0 ± 0.0
Asn
2.515AsnAla: 2.515 ± 1.128
0.559AsnCys: 0.559 ± 0.428
5.31AsnAsp: 5.31 ± 1.959
6.987AsnGlu: 6.987 ± 0.952
5.869AsnPhe: 5.869 ± 1.775
2.795AsnGly: 2.795 ± 0.745
2.236AsnHis: 2.236 ± 0.604
5.869AsnIle: 5.869 ± 1.151
8.944AsnLys: 8.944 ± 1.456
7.267AsnLeu: 7.267 ± 1.565
2.515AsnMet: 2.515 ± 1.082
10.061AsnAsn: 10.061 ± 2.069
1.397AsnPro: 1.397 ± 0.458
5.59AsnGln: 5.59 ± 1.419
2.515AsnArg: 2.515 ± 0.918
5.869AsnSer: 5.869 ± 1.363
6.428AsnThr: 6.428 ± 1.524
3.913AsnVal: 3.913 ± 0.968
0.838AsnTrp: 0.838 ± 0.534
4.472AsnTyr: 4.472 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
0.279ProAla: 0.279 ± 0.296
0.0ProCys: 0.0 ± 0.0
0.559ProAsp: 0.559 ± 0.327
0.838ProGlu: 0.838 ± 0.469
1.956ProPhe: 1.956 ± 0.565
0.559ProGly: 0.559 ± 0.47
0.279ProHis: 0.279 ± 0.235
1.397ProIle: 1.397 ± 0.642
3.074ProLys: 3.074 ± 0.855
1.956ProLeu: 1.956 ± 0.472
0.279ProMet: 0.279 ± 0.259
1.118ProAsn: 1.118 ± 0.722
0.559ProPro: 0.559 ± 0.263
1.397ProGln: 1.397 ± 1.039
0.0ProArg: 0.0 ± 0.0
1.677ProSer: 1.677 ± 0.945
1.677ProThr: 1.677 ± 0.914
1.397ProVal: 1.397 ± 0.557
0.279ProTrp: 0.279 ± 0.313
0.838ProTyr: 0.838 ± 0.442
0.0ProXaa: 0.0 ± 0.0
Gln
1.956GlnAla: 1.956 ± 0.894
0.0GlnCys: 0.0 ± 0.0
2.236GlnAsp: 2.236 ± 1.3
5.031GlnGlu: 5.031 ± 1.646
1.397GlnPhe: 1.397 ± 0.513
3.633GlnGly: 3.633 ± 2.32
0.0GlnHis: 0.0 ± 0.0
4.751GlnIle: 4.751 ± 1.519
4.472GlnLys: 4.472 ± 1.26
1.956GlnLeu: 1.956 ± 0.753
0.279GlnMet: 0.279 ± 0.28
2.515GlnAsn: 2.515 ± 0.772
0.0GlnPro: 0.0 ± 0.0
1.118GlnGln: 1.118 ± 0.667
2.515GlnArg: 2.515 ± 0.923
5.031GlnSer: 5.031 ± 1.445
1.397GlnThr: 1.397 ± 0.839
1.956GlnVal: 1.956 ± 0.755
0.559GlnTrp: 0.559 ± 0.372
1.397GlnTyr: 1.397 ± 0.926
0.0GlnXaa: 0.0 ± 0.0
Arg
1.956ArgAla: 1.956 ± 0.69
0.279ArgCys: 0.279 ± 0.359
3.074ArgAsp: 3.074 ± 0.789
1.677ArgGlu: 1.677 ± 0.39
2.515ArgPhe: 2.515 ± 0.978
2.236ArgGly: 2.236 ± 1.619
0.559ArgHis: 0.559 ± 0.372
2.236ArgIle: 2.236 ± 0.748
2.236ArgLys: 2.236 ± 0.502
4.751ArgLeu: 4.751 ± 0.583
1.118ArgMet: 1.118 ± 0.785
1.397ArgAsn: 1.397 ± 0.493
0.559ArgPro: 0.559 ± 0.295
0.838ArgGln: 0.838 ± 0.529
1.118ArgArg: 1.118 ± 0.424
1.397ArgSer: 1.397 ± 0.899
1.397ArgThr: 1.397 ± 0.531
3.074ArgVal: 3.074 ± 0.882
0.559ArgTrp: 0.559 ± 0.493
4.472ArgTyr: 4.472 ± 0.655
0.0ArgXaa: 0.0 ± 0.0
Ser
1.677SerAla: 1.677 ± 0.858
0.279SerCys: 0.279 ± 0.235
4.472SerAsp: 4.472 ± 0.952
3.913SerGlu: 3.913 ± 0.82
4.751SerPhe: 4.751 ± 1.16
0.838SerGly: 0.838 ± 0.405
1.956SerHis: 1.956 ± 0.566
5.31SerIle: 5.31 ± 1.064
4.751SerLys: 4.751 ± 1.333
10.62SerLeu: 10.62 ± 1.269
0.559SerMet: 0.559 ± 0.397
5.869SerAsn: 5.869 ± 1.139
2.236SerPro: 2.236 ± 0.607
2.515SerGln: 2.515 ± 0.638
1.677SerArg: 1.677 ± 0.617
4.472SerSer: 4.472 ± 0.893
1.677SerThr: 1.677 ± 0.835
1.677SerVal: 1.677 ± 0.735
1.397SerTrp: 1.397 ± 0.734
3.913SerTyr: 3.913 ± 0.675
0.0SerXaa: 0.0 ± 0.0
Thr
0.559ThrAla: 0.559 ± 0.393
0.0ThrCys: 0.0 ± 0.0
0.838ThrAsp: 0.838 ± 0.334
1.956ThrGlu: 1.956 ± 0.732
3.913ThrPhe: 3.913 ± 1.269
1.677ThrGly: 1.677 ± 0.614
0.838ThrHis: 0.838 ± 0.519
4.472ThrIle: 4.472 ± 1.489
5.869ThrLys: 5.869 ± 1.744
6.708ThrLeu: 6.708 ± 1.131
1.397ThrMet: 1.397 ± 0.576
7.267ThrAsn: 7.267 ± 1.138
0.838ThrPro: 0.838 ± 0.839
1.118ThrGln: 1.118 ± 0.493
1.397ThrArg: 1.397 ± 0.575
3.074ThrSer: 3.074 ± 1.009
2.236ThrThr: 2.236 ± 0.817
0.559ThrVal: 0.559 ± 0.378
0.279ThrTrp: 0.279 ± 0.259
2.236ThrTyr: 2.236 ± 0.753
0.0ThrXaa: 0.0 ± 0.0
Val
1.677ValAla: 1.677 ± 0.789
0.0ValCys: 0.0 ± 0.0
2.236ValAsp: 2.236 ± 0.619
5.031ValGlu: 5.031 ± 0.683
1.397ValPhe: 1.397 ± 0.944
1.118ValGly: 1.118 ± 0.385
0.279ValHis: 0.279 ± 0.313
3.913ValIle: 3.913 ± 0.821
5.031ValLys: 5.031 ± 0.782
3.913ValLeu: 3.913 ± 0.975
0.559ValMet: 0.559 ± 0.353
5.59ValAsn: 5.59 ± 1.204
0.279ValPro: 0.279 ± 0.246
1.677ValGln: 1.677 ± 0.64
2.236ValArg: 2.236 ± 0.721
2.236ValSer: 2.236 ± 0.716
0.838ValThr: 0.838 ± 0.59
2.236ValVal: 2.236 ± 0.951
0.279ValTrp: 0.279 ± 0.235
2.795ValTyr: 2.795 ± 1.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.559TrpAla: 0.559 ± 0.323
0.0TrpCys: 0.0 ± 0.0
0.559TrpAsp: 0.559 ± 0.405
1.397TrpGlu: 1.397 ± 0.495
0.279TrpPhe: 0.279 ± 0.246
0.0TrpGly: 0.0 ± 0.0
0.279TrpHis: 0.279 ± 0.296
1.397TrpIle: 1.397 ± 0.715
0.559TrpLys: 0.559 ± 0.341
1.956TrpLeu: 1.956 ± 0.907
0.0TrpMet: 0.0 ± 0.0
0.559TrpAsn: 0.559 ± 0.405
0.279TrpPro: 0.279 ± 0.286
0.838TrpGln: 0.838 ± 0.449
0.279TrpArg: 0.279 ± 0.296
0.559TrpSer: 0.559 ± 0.372
1.118TrpThr: 1.118 ± 0.588
0.279TrpVal: 0.279 ± 0.235
0.0TrpTrp: 0.0 ± 0.0
0.559TrpTyr: 0.559 ± 0.341
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.515TyrAla: 2.515 ± 0.465
0.279TyrCys: 0.279 ± 0.246
4.192TyrAsp: 4.192 ± 0.849
2.795TyrGlu: 2.795 ± 0.876
4.192TyrPhe: 4.192 ± 0.99
2.236TyrGly: 2.236 ± 0.669
0.559TyrHis: 0.559 ± 0.341
5.031TyrIle: 5.031 ± 0.813
4.751TyrLys: 4.751 ± 1.231
6.428TyrLeu: 6.428 ± 1.41
0.559TyrMet: 0.559 ± 0.356
6.428TyrAsn: 6.428 ± 1.316
1.118TyrPro: 1.118 ± 0.597
2.236TyrGln: 2.236 ± 0.552
0.838TyrArg: 0.838 ± 0.451
4.472TyrSer: 4.472 ± 1.375
1.677TyrThr: 1.677 ± 0.857
1.677TyrVal: 1.677 ± 0.639
0.0TyrTrp: 0.0 ± 0.0
3.633TyrTyr: 3.633 ± 0.9
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (3579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski