Amino acid dipepetide frequency for Moloney murine leukemia virus (isolate Shinnick) (MoMLV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.782AlaAla: 5.782 ± 1.424
0.68AlaCys: 0.68 ± 0.288
3.061AlaAsp: 3.061 ± 0.35
3.401AlaGlu: 3.401 ± 1.312
4.422AlaPhe: 4.422 ± 1.536
6.463AlaGly: 6.463 ± 0.731
2.381AlaHis: 2.381 ± 0.44
1.02AlaIle: 1.02 ± 0.602
3.741AlaLys: 3.741 ± 0.782
9.524AlaLeu: 9.524 ± 1.222
0.0AlaMet: 0.0 ± 0.0
0.34AlaAsn: 0.34 ± 0.425
4.762AlaPro: 4.762 ± 0.619
2.041AlaGln: 2.041 ± 1.204
3.061AlaArg: 3.061 ± 0.839
2.041AlaSer: 2.041 ± 0.363
4.422AlaThr: 4.422 ± 0.681
3.741AlaVal: 3.741 ± 0.796
1.361AlaTrp: 1.361 ± 0.36
3.741AlaTyr: 3.741 ± 0.304
0.0AlaXaa: 0.0 ± 0.0
Cys
2.041CysAla: 2.041 ± 0.363
0.68CysCys: 0.68 ± 0.85
0.0CysAsp: 0.0 ± 0.0
0.68CysGlu: 0.68 ± 0.85
0.34CysPhe: 0.34 ± 0.425
0.34CysGly: 0.34 ± 0.425
0.0CysHis: 0.0 ± 0.0
1.02CysIle: 1.02 ± 1.275
1.361CysLys: 1.361 ± 0.176
1.361CysLeu: 1.361 ± 0.36
0.34CysMet: 0.34 ± 0.425
1.02CysAsn: 1.02 ± 1.275
1.701CysPro: 1.701 ± 0.068
1.701CysGln: 1.701 ± 0.656
0.34CysArg: 0.34 ± 0.201
2.041CysSer: 2.041 ± 1.559
0.0CysThr: 0.0 ± 0.0
0.34CysVal: 0.34 ± 0.425
0.34CysTrp: 0.34 ± 0.425
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.041AspAla: 2.041 ± 0.243
2.381AspCys: 2.381 ± 0.368
2.721AspAsp: 2.721 ± 0.71
2.041AspGlu: 2.041 ± 0.838
1.701AspPhe: 1.701 ± 0.523
3.061AspGly: 3.061 ± 0.678
0.68AspHis: 0.68 ± 0.288
1.701AspIle: 1.701 ± 0.656
1.02AspLys: 1.02 ± 0.257
8.163AspLeu: 8.163 ± 0.751
0.68AspMet: 0.68 ± 0.272
0.68AspAsn: 0.68 ± 0.288
6.463AspPro: 6.463 ± 1.937
4.422AspGln: 4.422 ± 0.183
5.102AspArg: 5.102 ± 0.77
2.381AspSer: 2.381 ± 0.545
1.701AspThr: 1.701 ± 1.003
1.701AspVal: 1.701 ± 0.637
1.361AspTrp: 1.361 ± 0.488
1.361AspTyr: 1.361 ± 0.176
0.0AspXaa: 0.0 ± 0.0
Glu
6.803GluAla: 6.803 ± 2.365
0.68GluCys: 0.68 ± 0.85
4.082GluAsp: 4.082 ± 1.572
4.762GluGlu: 4.762 ± 1.12
0.34GluPhe: 0.34 ± 0.201
3.401GluGly: 3.401 ± 1.015
0.68GluHis: 0.68 ± 0.401
2.721GluIle: 2.721 ± 1.605
4.422GluLys: 4.422 ± 1.207
1.701GluLeu: 1.701 ± 0.523
1.361GluMet: 1.361 ± 0.488
0.68GluAsn: 0.68 ± 0.288
3.741GluPro: 3.741 ± 0.304
1.02GluGln: 1.02 ± 0.357
7.483GluArg: 7.483 ± 2.335
2.041GluSer: 2.041 ± 0.743
5.442GluThr: 5.442 ± 0.95
3.061GluVal: 3.061 ± 1.054
1.02GluTrp: 1.02 ± 0.357
0.68GluTyr: 0.68 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
1.02PheAla: 1.02 ± 0.602
1.361PheCys: 1.361 ± 0.488
1.02PheAsp: 1.02 ± 0.357
2.381PheGlu: 2.381 ± 0.784
0.34PhePhe: 0.34 ± 0.201
1.361PheGly: 1.361 ± 0.576
0.34PheHis: 0.34 ± 0.201
1.361PheIle: 1.361 ± 0.176
0.34PheLys: 0.34 ± 0.201
2.721PheLeu: 2.721 ± 0.639
0.0PheMet: 0.0 ± 0.0
2.721PheAsn: 2.721 ± 0.71
2.721PhePro: 2.721 ± 0.977
0.34PheGln: 0.34 ± 0.425
0.34PheArg: 0.34 ± 0.201
3.061PheSer: 3.061 ± 0.678
1.361PheThr: 1.361 ± 0.576
1.701PheVal: 1.701 ± 0.068
0.0PheTrp: 0.0 ± 0.0
1.02PheTyr: 1.02 ± 1.275
0.0PheXaa: 0.0 ± 0.0
Gly
3.401GlyAla: 3.401 ± 0.529
0.68GlyCys: 0.68 ± 0.85
2.041GlyAsp: 2.041 ± 0.414
3.061GlyGlu: 3.061 ± 1.242
1.361GlyPhe: 1.361 ± 0.36
6.463GlyGly: 6.463 ± 0.606
3.401GlyHis: 3.401 ± 0.703
3.401GlyIle: 3.401 ± 0.444
2.721GlyLys: 2.721 ± 1.22
6.803GlyLeu: 6.803 ± 3.422
0.68GlyMet: 0.68 ± 0.401
2.381GlyAsn: 2.381 ± 0.368
9.864GlyPro: 9.864 ± 1.989
6.803GlyGln: 6.803 ± 1.431
4.422GlyArg: 4.422 ± 1.207
2.381GlySer: 2.381 ± 0.784
6.463GlyThr: 6.463 ± 0.933
1.02GlyVal: 1.02 ± 0.698
1.701GlyTrp: 1.701 ± 0.068
1.361GlyTyr: 1.361 ± 0.802
0.0GlyXaa: 0.0 ± 0.0
His
0.68HisAla: 0.68 ± 0.401
0.68HisCys: 0.68 ± 0.401
0.0HisAsp: 0.0 ± 0.0
0.68HisGlu: 0.68 ± 0.401
0.68HisPhe: 0.68 ± 0.401
2.041HisGly: 2.041 ± 0.243
0.34HisHis: 0.34 ± 0.425
0.68HisIle: 0.68 ± 0.401
1.02HisLys: 1.02 ± 1.275
1.701HisLeu: 1.701 ± 0.656
0.0HisMet: 0.0 ± 0.0
1.02HisAsn: 1.02 ± 0.357
3.061HisPro: 3.061 ± 0.553
3.061HisGln: 3.061 ± 0.508
2.041HisArg: 2.041 ± 0.243
1.701HisSer: 1.701 ± 0.523
1.02HisThr: 1.02 ± 0.698
0.68HisVal: 0.68 ± 0.401
1.701HisTrp: 1.701 ± 0.637
1.701HisTyr: 1.701 ± 0.068
0.0HisXaa: 0.0 ± 0.0
Ile
2.041IleAla: 2.041 ± 0.243
0.34IleCys: 0.34 ± 0.201
1.701IleAsp: 1.701 ± 0.656
2.041IleGlu: 2.041 ± 0.243
1.02IlePhe: 1.02 ± 0.357
1.701IleGly: 1.701 ± 1.142
1.701IleHis: 1.701 ± 1.003
0.68IleIle: 0.68 ± 0.401
3.061IleLys: 3.061 ± 1.415
3.401IleLeu: 3.401 ± 1.015
0.34IleMet: 0.34 ± 0.425
0.0IleAsn: 0.0 ± 0.0
1.02IlePro: 1.02 ± 0.257
0.68IleGln: 0.68 ± 0.401
1.701IleArg: 1.701 ± 0.544
1.701IleSer: 1.701 ± 1.542
3.741IleThr: 3.741 ± 0.454
1.02IleVal: 1.02 ± 0.371
1.02IleTrp: 1.02 ± 0.357
0.68IleTyr: 0.68 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
4.422LysAla: 4.422 ± 2.205
0.0LysCys: 0.0 ± 0.0
3.401LysAsp: 3.401 ± 0.137
6.122LysGlu: 6.122 ± 1.12
0.0LysPhe: 0.0 ± 0.0
2.381LysGly: 2.381 ± 0.831
0.0LysHis: 0.0 ± 0.0
1.701LysIle: 1.701 ± 0.637
3.741LysLys: 3.741 ± 1.492
6.463LysLeu: 6.463 ± 0.369
0.68LysMet: 0.68 ± 0.401
3.061LysAsn: 3.061 ± 0.553
7.143LysPro: 7.143 ± 1.174
3.401LysGln: 3.401 ± 1.312
2.721LysArg: 2.721 ± 0.425
2.381LysSer: 2.381 ± 1.816
2.381LysThr: 2.381 ± 0.22
3.741LysVal: 3.741 ± 0.893
0.34LysTrp: 0.34 ± 0.425
1.02LysTyr: 1.02 ± 0.602
0.0LysXaa: 0.0 ± 0.0
Leu
7.143LeuAla: 7.143 ± 1.008
1.701LeuCys: 1.701 ± 2.125
4.762LeuAsp: 4.762 ± 0.88
5.782LeuGlu: 5.782 ± 1.373
3.401LeuPhe: 3.401 ± 1.44
7.823LeuGly: 7.823 ± 1.373
2.041LeuHis: 2.041 ± 1.204
5.442LeuIle: 5.442 ± 0.705
6.463LeuLys: 6.463 ± 0.677
16.667LeuLeu: 16.667 ± 2.724
1.02LeuMet: 1.02 ± 1.275
3.061LeuAsn: 3.061 ± 1.55
5.782LeuPro: 5.782 ± 1.731
4.422LeuGln: 4.422 ± 0.681
4.422LeuArg: 4.422 ± 1.151
5.442LeuSer: 5.442 ± 0.727
15.986LeuThr: 15.986 ± 1.409
6.463LeuVal: 6.463 ± 1.961
1.02LeuTrp: 1.02 ± 0.698
3.401LeuTyr: 3.401 ± 0.715
0.0LeuXaa: 0.0 ± 0.0
Met
2.721MetAla: 2.721 ± 0.631
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.401MetGly: 3.401 ± 0.529
0.0MetHis: 0.0 ± 0.0
0.34MetIle: 0.34 ± 0.425
0.34MetLys: 0.34 ± 0.201
1.02MetLeu: 1.02 ± 0.698
0.0MetMet: 0.0 ± 0.0
0.34MetAsn: 0.34 ± 0.201
0.68MetPro: 0.68 ± 0.401
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.701MetSer: 1.701 ± 0.637
0.68MetThr: 0.68 ± 0.401
0.34MetVal: 0.34 ± 0.201
0.34MetTrp: 0.34 ± 0.425
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.361AsnAla: 1.361 ± 0.488
0.68AsnCys: 0.68 ± 0.288
0.34AsnAsp: 0.34 ± 0.201
1.701AsnGlu: 1.701 ± 0.536
0.34AsnPhe: 0.34 ± 0.201
2.041AsnGly: 2.041 ± 0.414
1.361AsnHis: 1.361 ± 0.735
1.02AsnIle: 1.02 ± 0.257
2.721AsnLys: 2.721 ± 0.352
3.741AsnLeu: 3.741 ± 1.902
0.0AsnMet: 0.0 ± 0.0
2.041AsnAsn: 2.041 ± 0.743
3.061AsnPro: 3.061 ± 0.508
1.02AsnGln: 1.02 ± 0.371
3.741AsnArg: 3.741 ± 1.902
1.02AsnSer: 1.02 ± 0.602
1.361AsnThr: 1.361 ± 0.576
1.701AsnVal: 1.701 ± 0.637
1.02AsnTrp: 1.02 ± 0.357
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.401ProAla: 3.401 ± 0.137
1.701ProCys: 1.701 ± 0.979
8.163ProAsp: 8.163 ± 0.97
2.381ProGlu: 2.381 ± 0.831
2.041ProPhe: 2.041 ± 0.414
6.463ProGly: 6.463 ± 0.371
2.721ProHis: 2.721 ± 0.212
0.68ProIle: 0.68 ± 0.85
3.401ProLys: 3.401 ± 0.556
12.585ProLeu: 12.585 ± 0.396
1.701ProMet: 1.701 ± 0.367
2.041ProAsn: 2.041 ± 0.414
12.925ProPro: 12.925 ± 2.836
2.721ProGln: 2.721 ± 0.977
7.823ProArg: 7.823 ± 1.412
7.483ProSer: 7.483 ± 1.57
5.442ProThr: 5.442 ± 0.85
4.762ProVal: 4.762 ± 0.872
1.701ProTrp: 1.701 ± 0.068
4.762ProTyr: 4.762 ± 1.201
0.0ProXaa: 0.0 ± 0.0
Gln
5.782GlnAla: 5.782 ± 0.652
0.68GlnCys: 0.68 ± 0.31
1.701GlnAsp: 1.701 ± 0.544
2.721GlnGlu: 2.721 ± 0.639
1.361GlnPhe: 1.361 ± 0.576
5.102GlnGly: 5.102 ± 1.079
1.701GlnHis: 1.701 ± 0.523
0.68GlnIle: 0.68 ± 0.401
3.061GlnLys: 3.061 ± 0.553
5.782GlnLeu: 5.782 ± 0.867
0.0GlnMet: 0.0 ± 0.0
2.041GlnAsn: 2.041 ± 0.363
3.401GlnPro: 3.401 ± 1.039
2.721GlnGln: 2.721 ± 1.212
2.721GlnArg: 2.721 ± 0.639
2.381GlnSer: 2.381 ± 0.368
3.061GlnThr: 3.061 ± 0.177
4.762GlnVal: 4.762 ± 0.619
0.34GlnTrp: 0.34 ± 0.201
1.701GlnTyr: 1.701 ± 0.068
0.0GlnXaa: 0.0 ± 0.0
Arg
2.381ArgAla: 2.381 ± 0.44
0.34ArgCys: 0.34 ± 0.425
6.463ArgAsp: 6.463 ± 1.367
6.803ArgGlu: 6.803 ± 1.392
0.68ArgPhe: 0.68 ± 0.288
5.102ArgGly: 5.102 ± 0.896
1.361ArgHis: 1.361 ± 0.176
2.381ArgIle: 2.381 ± 0.368
2.721ArgLys: 2.721 ± 0.977
7.143ArgLeu: 7.143 ± 0.659
1.361ArgMet: 1.361 ± 0.802
1.02ArgAsn: 1.02 ± 0.357
5.442ArgPro: 5.442 ± 1.383
2.381ArgGln: 2.381 ± 0.368
8.844ArgArg: 8.844 ± 2.595
4.422ArgSer: 4.422 ± 0.87
1.361ArgThr: 1.361 ± 0.488
3.741ArgVal: 3.741 ± 0.466
2.041ArgTrp: 2.041 ± 0.838
1.701ArgTyr: 1.701 ± 0.068
0.0ArgXaa: 0.0 ± 0.0
Ser
4.762SerAla: 4.762 ± 1.138
0.34SerCys: 0.34 ± 0.425
3.061SerAsp: 3.061 ± 0.678
2.381SerGlu: 2.381 ± 0.22
2.381SerPhe: 2.381 ± 0.368
4.422SerGly: 4.422 ± 0.566
0.68SerHis: 0.68 ± 0.401
1.02SerIle: 1.02 ± 0.257
3.401SerLys: 3.401 ± 0.444
5.782SerLeu: 5.782 ± 1.042
1.02SerMet: 1.02 ± 0.371
1.701SerAsn: 1.701 ± 1.542
7.823SerPro: 7.823 ± 2.823
3.061SerGln: 3.061 ± 0.553
2.381SerArg: 2.381 ± 0.22
5.442SerSer: 5.442 ± 1.446
4.422SerThr: 4.422 ± 1.745
2.721SerVal: 2.721 ± 0.886
0.68SerTrp: 0.68 ± 0.85
1.02SerTyr: 1.02 ± 1.275
0.0SerXaa: 0.0 ± 0.0
Thr
3.401ThrAla: 3.401 ± 0.444
0.34ThrCys: 0.34 ± 0.201
3.061ThrAsp: 3.061 ± 0.874
4.422ThrGlu: 4.422 ± 1.041
3.401ThrPhe: 3.401 ± 0.703
5.102ThrGly: 5.102 ± 1.891
2.721ThrHis: 2.721 ± 0.212
1.02ThrIle: 1.02 ± 0.698
3.741ThrLys: 3.741 ± 1.681
7.483ThrLeu: 7.483 ± 0.589
1.02ThrMet: 1.02 ± 0.698
2.381ThrAsn: 2.381 ± 0.368
8.163ThrPro: 8.163 ± 0.86
5.782ThrGln: 5.782 ± 1.574
2.041ThrArg: 2.041 ± 1.204
6.803ThrSer: 6.803 ± 1.824
5.782ThrThr: 5.782 ± 2.462
4.422ThrVal: 4.422 ± 0.681
3.401ThrTrp: 3.401 ± 0.137
0.68ThrTyr: 0.68 ± 0.85
0.0ThrXaa: 0.0 ± 0.0
Val
3.741ValAla: 3.741 ± 0.796
0.68ValCys: 0.68 ± 0.85
3.061ValAsp: 3.061 ± 0.553
2.381ValGlu: 2.381 ± 0.22
1.02ValPhe: 1.02 ± 0.357
1.361ValGly: 1.361 ± 0.488
1.701ValHis: 1.701 ± 0.637
1.02ValIle: 1.02 ± 0.257
5.102ValLys: 5.102 ± 0.77
6.803ValLeu: 6.803 ± 0.778
1.02ValMet: 1.02 ± 0.602
1.701ValAsn: 1.701 ± 0.507
2.721ValPro: 2.721 ± 0.212
3.061ValGln: 3.061 ± 1.067
2.721ValArg: 2.721 ± 0.425
3.741ValSer: 3.741 ± 0.304
5.782ValThr: 5.782 ± 0.667
2.381ValVal: 2.381 ± 0.22
1.02ValTrp: 1.02 ± 0.257
1.361ValTyr: 1.361 ± 0.576
0.0ValXaa: 0.0 ± 0.0
Trp
2.041TrpAla: 2.041 ± 0.363
0.34TrpCys: 0.34 ± 0.425
1.701TrpAsp: 1.701 ± 0.536
1.361TrpGlu: 1.361 ± 0.176
0.68TrpPhe: 0.68 ± 0.85
1.361TrpGly: 1.361 ± 1.119
0.0TrpHis: 0.0 ± 0.0
1.02TrpIle: 1.02 ± 0.602
2.381TrpLys: 2.381 ± 0.368
1.361TrpLeu: 1.361 ± 0.36
0.0TrpMet: 0.0 ± 0.0
0.68TrpAsn: 0.68 ± 0.288
3.061TrpPro: 3.061 ± 0.701
0.68TrpGln: 0.68 ± 0.31
1.361TrpArg: 1.361 ± 0.802
0.0TrpSer: 0.0 ± 0.0
1.361TrpThr: 1.361 ± 0.576
2.721TrpVal: 2.721 ± 0.977
0.34TrpTrp: 0.34 ± 0.425
0.68TrpTyr: 0.68 ± 0.401
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.02TyrAla: 1.02 ± 0.257
1.361TyrCys: 1.361 ± 0.735
1.02TyrAsp: 1.02 ± 0.357
1.02TyrGlu: 1.02 ± 0.698
0.0TyrPhe: 0.0 ± 0.0
1.02TyrGly: 1.02 ± 0.257
0.68TyrHis: 0.68 ± 0.85
0.68TyrIle: 0.68 ± 0.31
0.34TyrLys: 0.34 ± 0.201
2.041TyrLeu: 2.041 ± 0.705
0.34TyrMet: 0.34 ± 0.201
1.361TyrAsn: 1.361 ± 0.176
1.701TyrPro: 1.701 ± 0.656
2.381TyrGln: 2.381 ± 0.784
4.422TyrArg: 4.422 ± 1.626
0.34TyrSer: 0.34 ± 0.425
3.401TyrThr: 3.401 ± 1.312
1.361TyrVal: 1.361 ± 1.119
2.381TyrTrp: 2.381 ± 1.382
1.02TyrTyr: 1.02 ± 0.698
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski