Amino acid dipepetide frequency for Murine pneumonia virus (strain 15) (MPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.436AlaAla: 3.436 ± 1.549
1.503AlaCys: 1.503 ± 1.039
2.577AlaAsp: 2.577 ± 0.701
3.436AlaGlu: 3.436 ± 1.589
1.503AlaPhe: 1.503 ± 0.501
5.155AlaGly: 5.155 ± 1.767
0.859AlaHis: 0.859 ± 0.352
3.436AlaIle: 3.436 ± 1.021
2.577AlaLys: 2.577 ± 0.779
4.725AlaLeu: 4.725 ± 1.017
1.718AlaMet: 1.718 ± 0.719
2.792AlaAsn: 2.792 ± 0.779
3.007AlaPro: 3.007 ± 1.325
1.074AlaGln: 1.074 ± 0.526
3.436AlaArg: 3.436 ± 1.193
2.148AlaSer: 2.148 ± 0.527
2.577AlaThr: 2.577 ± 1.149
4.51AlaVal: 4.51 ± 2.047
0.43AlaTrp: 0.43 ± 0.453
1.503AlaTyr: 1.503 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
0.859CysAla: 0.859 ± 0.739
0.644CysCys: 0.644 ± 0.465
1.074CysAsp: 1.074 ± 0.756
0.859CysGlu: 0.859 ± 0.385
1.074CysPhe: 1.074 ± 0.497
0.859CysGly: 0.859 ± 0.429
1.074CysHis: 1.074 ± 0.816
1.289CysIle: 1.289 ± 0.665
2.148CysLys: 2.148 ± 0.876
1.503CysLeu: 1.503 ± 0.687
0.215CysMet: 0.215 ± 0.247
2.148CysAsn: 2.148 ± 1.287
1.289CysPro: 1.289 ± 0.602
0.215CysGln: 0.215 ± 0.135
0.859CysArg: 0.859 ± 0.419
2.148CysSer: 2.148 ± 0.577
1.933CysThr: 1.933 ± 0.588
1.933CysVal: 1.933 ± 1.036
0.215CysTrp: 0.215 ± 0.268
0.859CysTyr: 0.859 ± 0.394
0.0CysXaa: 0.0 ± 0.0
Asp
3.222AspAla: 3.222 ± 1.123
1.289AspCys: 1.289 ± 1.114
4.296AspAsp: 4.296 ± 1.323
2.363AspGlu: 2.363 ± 0.939
1.289AspPhe: 1.289 ± 0.553
1.289AspGly: 1.289 ± 0.737
1.074AspHis: 1.074 ± 0.534
3.436AspIle: 3.436 ± 1.163
3.222AspLys: 3.222 ± 0.808
5.584AspLeu: 5.584 ± 0.762
1.289AspMet: 1.289 ± 0.584
2.148AspAsn: 2.148 ± 0.465
2.363AspPro: 2.363 ± 0.592
1.503AspGln: 1.503 ± 0.913
2.577AspArg: 2.577 ± 1.092
3.222AspSer: 3.222 ± 0.998
4.296AspThr: 4.296 ± 0.971
3.866AspVal: 3.866 ± 1.341
0.644AspTrp: 0.644 ± 0.501
2.363AspTyr: 2.363 ± 0.803
0.0AspXaa: 0.0 ± 0.0
Glu
2.577GluAla: 2.577 ± 1.039
1.503GluCys: 1.503 ± 0.39
2.363GluAsp: 2.363 ± 1.417
3.222GluGlu: 3.222 ± 2.772
2.577GluPhe: 2.577 ± 0.941
1.718GluGly: 1.718 ± 0.88
1.503GluHis: 1.503 ± 0.385
3.222GluIle: 3.222 ± 0.891
3.866GluLys: 3.866 ± 1.848
6.873GluLeu: 6.873 ± 0.87
1.289GluMet: 1.289 ± 0.491
1.933GluAsn: 1.933 ± 0.627
1.718GluPro: 1.718 ± 1.016
1.718GluGln: 1.718 ± 0.63
2.792GluArg: 2.792 ± 0.953
3.651GluSer: 3.651 ± 0.93
2.792GluThr: 2.792 ± 0.79
2.792GluVal: 2.792 ± 0.6
0.859GluTrp: 0.859 ± 0.424
1.718GluTyr: 1.718 ± 0.626
0.0GluXaa: 0.0 ± 0.0
Phe
0.859PheAla: 0.859 ± 0.555
0.859PheCys: 0.859 ± 0.608
1.718PheAsp: 1.718 ± 0.588
1.074PheGlu: 1.074 ± 0.416
1.289PhePhe: 1.289 ± 0.79
1.074PheGly: 1.074 ± 0.348
1.289PheHis: 1.289 ± 0.507
3.007PheIle: 3.007 ± 0.66
1.718PheLys: 1.718 ± 0.382
4.51PheLeu: 4.51 ± 1.007
0.644PheMet: 0.644 ± 0.378
2.792PheAsn: 2.792 ± 0.865
1.933PhePro: 1.933 ± 0.61
1.289PheGln: 1.289 ± 0.336
1.718PheArg: 1.718 ± 1.082
2.363PheSer: 2.363 ± 0.736
1.933PheThr: 1.933 ± 0.722
2.148PheVal: 2.148 ± 0.834
0.215PheTrp: 0.215 ± 0.135
1.933PheTyr: 1.933 ± 0.639
0.0PheXaa: 0.0 ± 0.0
Gly
2.363GlyAla: 2.363 ± 0.741
1.289GlyCys: 1.289 ± 0.698
1.718GlyAsp: 1.718 ± 0.614
2.363GlyGlu: 2.363 ± 0.517
1.933GlyPhe: 1.933 ± 0.52
2.792GlyGly: 2.792 ± 0.863
1.933GlyHis: 1.933 ± 0.514
2.577GlyIle: 2.577 ± 0.768
1.933GlyLys: 1.933 ± 0.48
7.302GlyLeu: 7.302 ± 2.059
1.289GlyMet: 1.289 ± 0.556
1.718GlyAsn: 1.718 ± 0.472
1.718GlyPro: 1.718 ± 0.705
1.289GlyGln: 1.289 ± 0.621
2.363GlyArg: 2.363 ± 0.713
4.081GlySer: 4.081 ± 0.607
1.933GlyThr: 1.933 ± 0.688
5.369GlyVal: 5.369 ± 0.838
0.644GlyTrp: 0.644 ± 0.501
1.933GlyTyr: 1.933 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.552
0.43HisCys: 0.43 ± 0.27
1.289HisAsp: 1.289 ± 0.564
1.074HisGlu: 1.074 ± 0.362
0.859HisPhe: 0.859 ± 0.541
1.074HisGly: 1.074 ± 0.549
1.289HisHis: 1.289 ± 0.621
1.718HisIle: 1.718 ± 0.597
1.503HisLys: 1.503 ± 0.768
2.148HisLeu: 2.148 ± 0.952
1.503HisMet: 1.503 ± 0.752
2.363HisAsn: 2.363 ± 1.046
1.503HisPro: 1.503 ± 0.803
0.43HisGln: 0.43 ± 0.334
1.074HisArg: 1.074 ± 0.511
1.503HisSer: 1.503 ± 0.365
0.859HisThr: 0.859 ± 0.407
0.859HisVal: 0.859 ± 0.555
1.289HisTrp: 1.289 ± 0.642
0.859HisTyr: 0.859 ± 0.632
0.0HisXaa: 0.0 ± 0.0
Ile
3.222IleAla: 3.222 ± 0.596
1.289IleCys: 1.289 ± 0.572
3.436IleAsp: 3.436 ± 0.95
4.51IleGlu: 4.51 ± 1.307
1.718IlePhe: 1.718 ± 0.786
3.222IleGly: 3.222 ± 0.767
1.289IleHis: 1.289 ± 0.906
5.369IleIle: 5.369 ± 1.518
3.222IleLys: 3.222 ± 1.024
8.162IleLeu: 8.162 ± 1.15
3.007IleMet: 3.007 ± 0.933
5.369IleAsn: 5.369 ± 0.761
2.792IlePro: 2.792 ± 0.527
2.148IleGln: 2.148 ± 0.644
4.725IleArg: 4.725 ± 1.364
4.94IleSer: 4.94 ± 1.018
4.725IleThr: 4.725 ± 0.948
3.007IleVal: 3.007 ± 0.934
0.43IleTrp: 0.43 ± 0.27
1.074IleTyr: 1.074 ± 0.453
0.0IleXaa: 0.0 ± 0.0
Lys
4.081LysAla: 4.081 ± 0.856
0.859LysCys: 0.859 ± 0.335
2.577LysAsp: 2.577 ± 1.201
3.866LysGlu: 3.866 ± 1.014
2.577LysPhe: 2.577 ± 1.358
2.363LysGly: 2.363 ± 0.731
2.363LysHis: 2.363 ± 0.824
3.436LysIle: 3.436 ± 0.795
3.651LysLys: 3.651 ± 0.785
6.014LysLeu: 6.014 ± 1.199
1.289LysMet: 1.289 ± 0.514
3.436LysAsn: 3.436 ± 0.741
3.436LysPro: 3.436 ± 1.272
1.718LysGln: 1.718 ± 0.614
2.792LysArg: 2.792 ± 1.092
5.799LysSer: 5.799 ± 1.858
3.007LysThr: 3.007 ± 0.748
4.725LysVal: 4.725 ± 1.367
0.43LysTrp: 0.43 ± 0.27
3.007LysTyr: 3.007 ± 0.779
0.0LysXaa: 0.0 ± 0.0
Leu
6.014LeuAla: 6.014 ± 1.418
1.289LeuCys: 1.289 ± 0.61
6.014LeuAsp: 6.014 ± 0.985
4.94LeuGlu: 4.94 ± 1.483
3.436LeuPhe: 3.436 ± 0.858
5.155LeuGly: 5.155 ± 0.925
3.866LeuHis: 3.866 ± 1.355
7.088LeuIle: 7.088 ± 1.092
9.88LeuLys: 9.88 ± 2.102
9.88LeuLeu: 9.88 ± 1.679
2.148LeuMet: 2.148 ± 0.914
6.014LeuAsn: 6.014 ± 1.117
5.799LeuPro: 5.799 ± 1.225
4.081LeuGln: 4.081 ± 1.215
5.369LeuArg: 5.369 ± 0.956
11.168LeuSer: 11.168 ± 2.399
9.665LeuThr: 9.665 ± 1.398
4.725LeuVal: 4.725 ± 1.298
0.43LeuTrp: 0.43 ± 0.251
3.007LeuTyr: 3.007 ± 0.77
0.0LeuXaa: 0.0 ± 0.0
Met
1.289MetAla: 1.289 ± 0.745
0.859MetCys: 0.859 ± 0.591
1.503MetAsp: 1.503 ± 0.502
1.503MetGlu: 1.503 ± 0.99
1.074MetPhe: 1.074 ± 0.412
1.289MetGly: 1.289 ± 0.605
0.215MetHis: 0.215 ± 0.294
2.148MetIle: 2.148 ± 0.571
1.718MetLys: 1.718 ± 0.56
3.651MetLeu: 3.651 ± 0.671
1.074MetMet: 1.074 ± 0.432
0.644MetAsn: 0.644 ± 0.494
0.859MetPro: 0.859 ± 0.458
1.503MetGln: 1.503 ± 0.581
0.43MetArg: 0.43 ± 0.251
3.651MetSer: 3.651 ± 0.852
1.718MetThr: 1.718 ± 0.745
1.503MetVal: 1.503 ± 0.652
0.215MetTrp: 0.215 ± 0.135
1.074MetTyr: 1.074 ± 0.546
0.0MetXaa: 0.0 ± 0.0
Asn
3.007AsnAla: 3.007 ± 0.842
1.503AsnCys: 1.503 ± 0.706
1.718AsnAsp: 1.718 ± 0.385
1.289AsnGlu: 1.289 ± 0.626
2.577AsnPhe: 2.577 ± 0.359
2.148AsnGly: 2.148 ± 1.229
0.644AsnHis: 0.644 ± 0.435
5.155AsnIle: 5.155 ± 0.963
4.725AsnLys: 4.725 ± 1.01
7.302AsnLeu: 7.302 ± 1.657
2.792AsnMet: 2.792 ± 0.705
3.222AsnAsn: 3.222 ± 0.611
2.148AsnPro: 2.148 ± 0.564
2.148AsnGln: 2.148 ± 0.605
3.651AsnArg: 3.651 ± 1.186
4.081AsnSer: 4.081 ± 1.18
4.296AsnThr: 4.296 ± 1.513
3.007AsnVal: 3.007 ± 1.155
0.644AsnTrp: 0.644 ± 0.345
2.363AsnTyr: 2.363 ± 0.743
0.0AsnXaa: 0.0 ± 0.0
Pro
1.718ProAla: 1.718 ± 0.607
0.859ProCys: 0.859 ± 0.443
3.007ProAsp: 3.007 ± 0.627
1.289ProGlu: 1.289 ± 1.01
0.644ProPhe: 0.644 ± 0.308
1.933ProGly: 1.933 ± 0.841
0.644ProHis: 0.644 ± 0.318
2.363ProIle: 2.363 ± 0.642
3.651ProLys: 3.651 ± 0.579
4.081ProLeu: 4.081 ± 1.662
1.503ProMet: 1.503 ± 0.589
3.222ProAsn: 3.222 ± 0.983
3.007ProPro: 3.007 ± 1.626
1.503ProGln: 1.503 ± 0.609
2.577ProArg: 2.577 ± 1.776
3.222ProSer: 3.222 ± 0.97
4.296ProThr: 4.296 ± 2.153
2.577ProVal: 2.577 ± 0.729
0.859ProTrp: 0.859 ± 0.541
2.363ProTyr: 2.363 ± 1.16
0.0ProXaa: 0.0 ± 0.0
Gln
2.577GlnAla: 2.577 ± 0.786
0.644GlnCys: 0.644 ± 0.435
1.289GlnAsp: 1.289 ± 0.642
2.363GlnGlu: 2.363 ± 0.667
1.718GlnPhe: 1.718 ± 0.915
1.933GlnGly: 1.933 ± 0.712
0.859GlnHis: 0.859 ± 0.373
1.933GlnIle: 1.933 ± 0.718
1.289GlnLys: 1.289 ± 0.483
3.436GlnLeu: 3.436 ± 0.835
0.43GlnMet: 0.43 ± 0.251
1.503GlnAsn: 1.503 ± 0.781
0.859GlnPro: 0.859 ± 0.51
1.074GlnGln: 1.074 ± 0.409
1.289GlnArg: 1.289 ± 0.484
2.148GlnSer: 2.148 ± 0.492
1.718GlnThr: 1.718 ± 0.852
1.933GlnVal: 1.933 ± 0.777
0.0GlnTrp: 0.0 ± 0.0
1.289GlnTyr: 1.289 ± 0.441
0.0GlnXaa: 0.0 ± 0.0
Arg
2.792ArgAla: 2.792 ± 0.859
1.074ArgCys: 1.074 ± 0.504
3.651ArgAsp: 3.651 ± 0.897
3.436ArgGlu: 3.436 ± 1.042
2.363ArgPhe: 2.363 ± 0.691
2.792ArgGly: 2.792 ± 0.888
0.859ArgHis: 0.859 ± 0.318
3.007ArgIle: 3.007 ± 0.823
2.363ArgLys: 2.363 ± 0.463
5.155ArgLeu: 5.155 ± 1.117
0.859ArgMet: 0.859 ± 0.692
3.222ArgAsn: 3.222 ± 0.79
1.718ArgPro: 1.718 ± 0.741
2.363ArgGln: 2.363 ± 0.876
3.007ArgArg: 3.007 ± 0.722
3.866ArgSer: 3.866 ± 0.52
3.222ArgThr: 3.222 ± 0.963
3.007ArgVal: 3.007 ± 0.861
1.074ArgTrp: 1.074 ± 0.426
1.933ArgTyr: 1.933 ± 0.77
0.0ArgXaa: 0.0 ± 0.0
Ser
3.222SerAla: 3.222 ± 0.63
2.577SerCys: 2.577 ± 0.972
2.792SerAsp: 2.792 ± 1.075
4.081SerGlu: 4.081 ± 1.048
3.436SerPhe: 3.436 ± 0.98
3.866SerGly: 3.866 ± 0.799
0.859SerHis: 0.859 ± 0.54
3.866SerIle: 3.866 ± 1.148
5.155SerLys: 5.155 ± 1.283
10.309SerLeu: 10.309 ± 2.478
1.074SerMet: 1.074 ± 0.389
6.229SerAsn: 6.229 ± 1.434
2.577SerPro: 2.577 ± 0.764
1.503SerGln: 1.503 ± 0.584
3.651SerArg: 3.651 ± 1.34
9.235SerSer: 9.235 ± 1.417
6.658SerThr: 6.658 ± 1.286
7.088SerVal: 7.088 ± 1.004
0.859SerTrp: 0.859 ± 0.477
3.651SerTyr: 3.651 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
4.94ThrAla: 4.94 ± 1.624
1.289ThrCys: 1.289 ± 0.727
4.296ThrAsp: 4.296 ± 0.955
3.222ThrGlu: 3.222 ± 0.913
1.289ThrPhe: 1.289 ± 0.576
3.866ThrGly: 3.866 ± 0.749
1.718ThrHis: 1.718 ± 0.479
3.436ThrIle: 3.436 ± 1.067
3.436ThrLys: 3.436 ± 0.625
5.155ThrLeu: 5.155 ± 1.207
2.577ThrMet: 2.577 ± 0.608
3.866ThrAsn: 3.866 ± 0.927
2.577ThrPro: 2.577 ± 0.976
2.363ThrGln: 2.363 ± 0.852
4.51ThrArg: 4.51 ± 0.846
6.658ThrSer: 6.658 ± 1.55
4.725ThrThr: 4.725 ± 2.125
3.866ThrVal: 3.866 ± 1.868
0.644ThrTrp: 0.644 ± 0.308
2.148ThrTyr: 2.148 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
3.222ValAla: 3.222 ± 0.892
2.148ValCys: 2.148 ± 0.618
3.436ValAsp: 3.436 ± 0.794
3.651ValGlu: 3.651 ± 0.999
1.933ValPhe: 1.933 ± 0.496
4.081ValGly: 4.081 ± 0.82
1.074ValHis: 1.074 ± 0.501
6.229ValIle: 6.229 ± 1.224
2.363ValLys: 2.363 ± 0.761
7.517ValLeu: 7.517 ± 1.367
1.933ValMet: 1.933 ± 0.752
1.933ValAsn: 1.933 ± 0.563
3.007ValPro: 3.007 ± 0.869
1.933ValGln: 1.933 ± 0.728
1.933ValArg: 1.933 ± 0.532
5.584ValSer: 5.584 ± 1.392
4.081ValThr: 4.081 ± 0.865
4.081ValVal: 4.081 ± 1.03
0.644ValTrp: 0.644 ± 0.432
2.577ValTyr: 2.577 ± 0.861
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.43TrpCys: 0.43 ± 0.296
0.43TrpAsp: 0.43 ± 0.27
0.43TrpGlu: 0.43 ± 0.421
0.43TrpPhe: 0.43 ± 0.25
0.43TrpGly: 0.43 ± 0.372
0.43TrpHis: 0.43 ± 0.251
1.289TrpIle: 1.289 ± 0.47
1.074TrpLys: 1.074 ± 0.441
1.718TrpLeu: 1.718 ± 0.696
0.215TrpMet: 0.215 ± 0.311
0.859TrpAsn: 0.859 ± 0.876
0.43TrpPro: 0.43 ± 0.28
0.215TrpGln: 0.215 ± 0.258
0.0TrpArg: 0.0 ± 0.0
0.644TrpSer: 0.644 ± 0.37
0.43TrpThr: 0.43 ± 0.27
1.289TrpVal: 1.289 ± 0.626
0.0TrpTrp: 0.0 ± 0.0
0.859TrpTyr: 0.859 ± 0.393
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.718TyrAla: 1.718 ± 0.898
0.859TyrCys: 0.859 ± 0.352
2.148TyrAsp: 2.148 ± 0.576
1.718TyrGlu: 1.718 ± 0.446
0.859TyrPhe: 0.859 ± 0.393
1.503TyrGly: 1.503 ± 0.502
1.074TyrHis: 1.074 ± 0.412
3.651TyrIle: 3.651 ± 0.665
1.933TyrLys: 1.933 ± 0.487
4.94TyrLeu: 4.94 ± 1.265
0.859TyrMet: 0.859 ± 0.396
3.007TyrAsn: 3.007 ± 1.466
2.577TyrPro: 2.577 ± 0.865
0.215TyrGln: 0.215 ± 0.328
3.007TyrArg: 3.007 ± 1.266
2.577TyrSer: 2.577 ± 0.728
1.718TyrThr: 1.718 ± 0.644
1.074TyrVal: 1.074 ± 0.511
1.074TyrTrp: 1.074 ± 0.985
1.074TyrTyr: 1.074 ± 0.595
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4657 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski