Amino acid dipepetide frequency for Rhodococcus phage REQ3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.17AlaAla: 20.17 ± 2.291
1.296AlaCys: 1.296 ± 0.484
8.91AlaAsp: 8.91 ± 1.123
7.695AlaGlu: 7.695 ± 0.819
2.997AlaPhe: 2.997 ± 0.49
9.721AlaGly: 9.721 ± 0.875
2.187AlaHis: 2.187 ± 0.398
5.751AlaIle: 5.751 ± 0.53
4.779AlaLys: 4.779 ± 0.589
10.288AlaLeu: 10.288 ± 0.95
4.455AlaMet: 4.455 ± 0.556
4.293AlaAsn: 4.293 ± 0.52
5.346AlaPro: 5.346 ± 0.594
4.131AlaGln: 4.131 ± 0.692
7.533AlaArg: 7.533 ± 0.751
6.642AlaSer: 6.642 ± 0.861
8.505AlaThr: 8.505 ± 0.94
8.586AlaVal: 8.586 ± 0.83
1.863AlaTrp: 1.863 ± 0.33
2.43AlaTyr: 2.43 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
1.053CysAla: 1.053 ± 0.331
0.162CysCys: 0.162 ± 0.109
0.81CysAsp: 0.81 ± 0.289
0.324CysGlu: 0.324 ± 0.17
0.324CysPhe: 0.324 ± 0.16
0.972CysGly: 0.972 ± 0.325
0.162CysHis: 0.162 ± 0.115
0.567CysIle: 0.567 ± 0.306
0.324CysLys: 0.324 ± 0.166
0.324CysLeu: 0.324 ± 0.168
0.162CysMet: 0.162 ± 0.119
0.0CysAsn: 0.0 ± 0.0
0.567CysPro: 0.567 ± 0.248
0.486CysGln: 0.486 ± 0.185
0.729CysArg: 0.729 ± 0.23
1.053CysSer: 1.053 ± 0.284
0.486CysThr: 0.486 ± 0.273
0.486CysVal: 0.486 ± 0.209
0.081CysTrp: 0.081 ± 0.091
0.162CysTyr: 0.162 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
8.505AspAla: 8.505 ± 0.788
0.405AspCys: 0.405 ± 0.172
5.67AspAsp: 5.67 ± 0.849
4.617AspGlu: 4.617 ± 0.625
1.62AspPhe: 1.62 ± 0.337
6.966AspGly: 6.966 ± 0.705
1.296AspHis: 1.296 ± 0.305
2.43AspIle: 2.43 ± 0.445
2.511AspLys: 2.511 ± 0.487
5.67AspLeu: 5.67 ± 0.763
1.296AspMet: 1.296 ± 0.434
1.782AspAsn: 1.782 ± 0.345
4.86AspPro: 4.86 ± 0.575
2.187AspGln: 2.187 ± 0.492
4.698AspArg: 4.698 ± 0.548
2.916AspSer: 2.916 ± 0.454
4.131AspThr: 4.131 ± 0.505
2.673AspVal: 2.673 ± 0.464
0.729AspTrp: 0.729 ± 0.208
2.106AspTyr: 2.106 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
7.209GluAla: 7.209 ± 0.732
0.486GluCys: 0.486 ± 0.202
2.835GluAsp: 2.835 ± 0.431
3.807GluGlu: 3.807 ± 0.58
2.025GluPhe: 2.025 ± 0.389
3.888GluGly: 3.888 ± 0.623
1.377GluHis: 1.377 ± 0.319
2.997GluIle: 2.997 ± 0.523
2.43GluLys: 2.43 ± 0.529
6.723GluLeu: 6.723 ± 0.785
0.972GluMet: 0.972 ± 0.237
1.134GluAsn: 1.134 ± 0.269
2.43GluPro: 2.43 ± 0.675
3.564GluGln: 3.564 ± 0.507
5.265GluArg: 5.265 ± 0.874
2.997GluSer: 2.997 ± 0.488
3.564GluThr: 3.564 ± 0.506
4.941GluVal: 4.941 ± 0.718
1.458GluTrp: 1.458 ± 0.424
1.296GluTyr: 1.296 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
2.43PheAla: 2.43 ± 0.43
0.243PheCys: 0.243 ± 0.132
2.106PheAsp: 2.106 ± 0.388
1.62PheGlu: 1.62 ± 0.376
0.567PhePhe: 0.567 ± 0.186
2.916PheGly: 2.916 ± 0.638
0.891PheHis: 0.891 ± 0.335
0.729PheIle: 0.729 ± 0.241
1.053PheLys: 1.053 ± 0.32
1.215PheLeu: 1.215 ± 0.324
0.729PheMet: 0.729 ± 0.251
1.215PheAsn: 1.215 ± 0.298
1.377PhePro: 1.377 ± 0.312
0.972PheGln: 0.972 ± 0.25
1.701PheArg: 1.701 ± 0.362
1.458PheSer: 1.458 ± 0.352
2.43PheThr: 2.43 ± 0.427
1.296PheVal: 1.296 ± 0.316
0.162PheTrp: 0.162 ± 0.119
0.648PheTyr: 0.648 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
10.207GlyAla: 10.207 ± 1.232
0.81GlyCys: 0.81 ± 0.274
4.779GlyAsp: 4.779 ± 0.57
3.969GlyGlu: 3.969 ± 0.594
2.997GlyPhe: 2.997 ± 0.508
7.614GlyGly: 7.614 ± 1.056
1.782GlyHis: 1.782 ± 0.401
4.86GlyIle: 4.86 ± 0.694
3.564GlyLys: 3.564 ± 0.579
6.156GlyLeu: 6.156 ± 0.745
3.078GlyMet: 3.078 ± 0.506
2.754GlyAsn: 2.754 ± 0.471
2.754GlyPro: 2.754 ± 0.463
3.969GlyGln: 3.969 ± 0.793
4.374GlyArg: 4.374 ± 0.64
5.427GlySer: 5.427 ± 0.675
5.67GlyThr: 5.67 ± 0.755
5.832GlyVal: 5.832 ± 0.561
1.539GlyTrp: 1.539 ± 0.344
3.564GlyTyr: 3.564 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
2.106HisAla: 2.106 ± 0.516
0.567HisCys: 0.567 ± 0.238
1.782HisAsp: 1.782 ± 0.328
1.782HisGlu: 1.782 ± 0.375
0.405HisPhe: 0.405 ± 0.172
1.62HisGly: 1.62 ± 0.358
0.486HisHis: 0.486 ± 0.198
0.972HisIle: 0.972 ± 0.206
0.405HisLys: 0.405 ± 0.177
1.458HisLeu: 1.458 ± 0.318
0.405HisMet: 0.405 ± 0.147
0.324HisAsn: 0.324 ± 0.133
1.215HisPro: 1.215 ± 0.261
0.567HisGln: 0.567 ± 0.243
1.701HisArg: 1.701 ± 0.375
1.053HisSer: 1.053 ± 0.265
0.972HisThr: 0.972 ± 0.308
0.972HisVal: 0.972 ± 0.241
0.243HisTrp: 0.243 ± 0.126
0.567HisTyr: 0.567 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
5.832IleAla: 5.832 ± 0.668
0.648IleCys: 0.648 ± 0.183
4.374IleAsp: 4.374 ± 0.595
3.24IleGlu: 3.24 ± 0.469
1.134IlePhe: 1.134 ± 0.273
4.536IleGly: 4.536 ± 0.62
1.134IleHis: 1.134 ± 0.319
1.377IleIle: 1.377 ± 0.348
1.944IleLys: 1.944 ± 0.393
2.916IleLeu: 2.916 ± 0.499
0.567IleMet: 0.567 ± 0.165
1.134IleAsn: 1.134 ± 0.256
2.511IlePro: 2.511 ± 0.385
1.539IleGln: 1.539 ± 0.431
3.24IleArg: 3.24 ± 0.591
1.62IleSer: 1.62 ± 0.355
3.24IleThr: 3.24 ± 0.561
3.726IleVal: 3.726 ± 0.493
0.81IleTrp: 0.81 ± 0.253
1.215IleTyr: 1.215 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
5.427LysAla: 5.427 ± 0.731
0.324LysCys: 0.324 ± 0.154
1.701LysAsp: 1.701 ± 0.305
1.62LysGlu: 1.62 ± 0.438
0.567LysPhe: 0.567 ± 0.273
2.997LysGly: 2.997 ± 0.393
0.729LysHis: 0.729 ± 0.229
0.891LysIle: 0.891 ± 0.252
0.972LysLys: 0.972 ± 0.313
2.997LysLeu: 2.997 ± 0.471
0.486LysMet: 0.486 ± 0.186
0.648LysAsn: 0.648 ± 0.224
2.43LysPro: 2.43 ± 0.458
0.972LysGln: 0.972 ± 0.263
3.888LysArg: 3.888 ± 0.671
2.349LysSer: 2.349 ± 0.33
2.592LysThr: 2.592 ± 0.462
2.673LysVal: 2.673 ± 0.385
0.567LysTrp: 0.567 ± 0.156
0.486LysTyr: 0.486 ± 0.193
0.0LysXaa: 0.0 ± 0.0
Leu
10.045LeuAla: 10.045 ± 0.904
0.324LeuCys: 0.324 ± 0.169
6.075LeuAsp: 6.075 ± 0.747
5.589LeuGlu: 5.589 ± 0.74
1.134LeuPhe: 1.134 ± 0.359
5.994LeuGly: 5.994 ± 0.813
1.539LeuHis: 1.539 ± 0.365
3.402LeuIle: 3.402 ± 0.415
3.564LeuLys: 3.564 ± 0.6
4.779LeuLeu: 4.779 ± 0.57
1.053LeuMet: 1.053 ± 0.307
2.106LeuAsn: 2.106 ± 0.398
4.617LeuPro: 4.617 ± 0.591
1.863LeuGln: 1.863 ± 0.371
6.642LeuArg: 6.642 ± 0.638
4.941LeuSer: 4.941 ± 0.732
6.156LeuThr: 6.156 ± 0.727
3.807LeuVal: 3.807 ± 0.611
1.296LeuTrp: 1.296 ± 0.285
1.215LeuTyr: 1.215 ± 0.281
0.0LeuXaa: 0.0 ± 0.0
Met
2.835MetAla: 2.835 ± 0.496
0.405MetCys: 0.405 ± 0.184
1.215MetAsp: 1.215 ± 0.268
0.972MetGlu: 0.972 ± 0.22
0.729MetPhe: 0.729 ± 0.238
1.782MetGly: 1.782 ± 0.477
0.486MetHis: 0.486 ± 0.223
0.243MetIle: 0.243 ± 0.128
0.486MetLys: 0.486 ± 0.174
1.62MetLeu: 1.62 ± 0.333
0.243MetMet: 0.243 ± 0.122
1.053MetAsn: 1.053 ± 0.268
1.863MetPro: 1.863 ± 0.376
0.648MetGln: 0.648 ± 0.229
1.215MetArg: 1.215 ± 0.305
1.62MetSer: 1.62 ± 0.321
3.078MetThr: 3.078 ± 0.572
1.053MetVal: 1.053 ± 0.257
0.486MetTrp: 0.486 ± 0.193
0.324MetTyr: 0.324 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
4.455AsnAla: 4.455 ± 0.549
0.162AsnCys: 0.162 ± 0.108
1.539AsnAsp: 1.539 ± 0.34
1.62AsnGlu: 1.62 ± 0.371
0.567AsnPhe: 0.567 ± 0.204
2.835AsnGly: 2.835 ± 0.447
0.405AsnHis: 0.405 ± 0.176
1.539AsnIle: 1.539 ± 0.315
0.972AsnLys: 0.972 ± 0.33
2.43AsnLeu: 2.43 ± 0.431
0.243AsnMet: 0.243 ± 0.119
0.405AsnAsn: 0.405 ± 0.24
3.159AsnPro: 3.159 ± 0.486
0.972AsnGln: 0.972 ± 0.393
1.701AsnArg: 1.701 ± 0.35
1.134AsnSer: 1.134 ± 0.255
2.43AsnThr: 2.43 ± 0.435
2.592AsnVal: 2.592 ± 0.572
0.405AsnTrp: 0.405 ± 0.163
0.486AsnTyr: 0.486 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
7.776ProAla: 7.776 ± 0.708
0.162ProCys: 0.162 ± 0.108
4.374ProAsp: 4.374 ± 0.687
3.888ProGlu: 3.888 ± 0.558
1.296ProPhe: 1.296 ± 0.286
4.212ProGly: 4.212 ± 0.518
0.648ProHis: 0.648 ± 0.184
2.592ProIle: 2.592 ± 0.486
1.539ProLys: 1.539 ± 0.382
3.078ProLeu: 3.078 ± 0.492
0.81ProMet: 0.81 ± 0.187
2.592ProAsn: 2.592 ± 0.528
3.888ProPro: 3.888 ± 0.718
1.539ProGln: 1.539 ± 0.379
3.321ProArg: 3.321 ± 0.615
3.402ProSer: 3.402 ± 0.494
4.05ProThr: 4.05 ± 0.721
3.321ProVal: 3.321 ± 0.441
0.891ProTrp: 0.891 ± 0.326
0.972ProTyr: 0.972 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
4.779GlnAla: 4.779 ± 0.758
0.405GlnCys: 0.405 ± 0.185
1.944GlnAsp: 1.944 ± 0.293
1.863GlnGlu: 1.863 ± 0.322
1.539GlnPhe: 1.539 ± 0.413
2.43GlnGly: 2.43 ± 0.475
0.567GlnHis: 0.567 ± 0.178
1.701GlnIle: 1.701 ± 0.274
1.377GlnLys: 1.377 ± 0.35
3.078GlnLeu: 3.078 ± 0.429
0.891GlnMet: 0.891 ± 0.202
1.053GlnAsn: 1.053 ± 0.277
1.782GlnPro: 1.782 ± 0.315
1.377GlnGln: 1.377 ± 0.369
3.321GlnArg: 3.321 ± 0.44
1.539GlnSer: 1.539 ± 0.274
1.539GlnThr: 1.539 ± 0.345
2.673GlnVal: 2.673 ± 0.508
0.486GlnTrp: 0.486 ± 0.213
0.81GlnTyr: 0.81 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
8.343ArgAla: 8.343 ± 0.824
1.053ArgCys: 1.053 ± 0.275
4.293ArgAsp: 4.293 ± 0.562
5.508ArgGlu: 5.508 ± 0.759
1.701ArgPhe: 1.701 ± 0.327
5.751ArgGly: 5.751 ± 0.778
1.863ArgHis: 1.863 ± 0.479
4.779ArgIle: 4.779 ± 0.695
2.187ArgLys: 2.187 ± 0.415
4.86ArgLeu: 4.86 ± 0.686
2.025ArgMet: 2.025 ± 0.39
2.025ArgAsn: 2.025 ± 0.418
2.997ArgPro: 2.997 ± 0.423
2.511ArgGln: 2.511 ± 0.567
4.455ArgArg: 4.455 ± 0.774
3.726ArgSer: 3.726 ± 0.48
4.131ArgThr: 4.131 ± 0.569
5.184ArgVal: 5.184 ± 0.547
1.539ArgTrp: 1.539 ± 0.341
1.863ArgTyr: 1.863 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
6.399SerAla: 6.399 ± 0.782
0.486SerCys: 0.486 ± 0.179
2.835SerAsp: 2.835 ± 0.397
3.24SerGlu: 3.24 ± 0.512
1.701SerPhe: 1.701 ± 0.338
6.156SerGly: 6.156 ± 1.053
0.972SerHis: 0.972 ± 0.345
2.187SerIle: 2.187 ± 0.373
1.944SerLys: 1.944 ± 0.36
4.374SerLeu: 4.374 ± 0.763
1.296SerMet: 1.296 ± 0.338
2.106SerAsn: 2.106 ± 0.455
2.511SerPro: 2.511 ± 0.405
2.106SerGln: 2.106 ± 0.348
3.888SerArg: 3.888 ± 0.528
3.321SerSer: 3.321 ± 0.605
4.779SerThr: 4.779 ± 0.743
3.483SerVal: 3.483 ± 0.678
1.296SerTrp: 1.296 ± 0.373
1.053SerTyr: 1.053 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
8.505ThrAla: 8.505 ± 0.703
0.567ThrCys: 0.567 ± 0.202
4.374ThrAsp: 4.374 ± 0.616
3.564ThrGlu: 3.564 ± 0.491
1.944ThrPhe: 1.944 ± 0.405
7.047ThrGly: 7.047 ± 0.772
1.053ThrHis: 1.053 ± 0.332
4.05ThrIle: 4.05 ± 0.41
1.782ThrLys: 1.782 ± 0.385
5.832ThrLeu: 5.832 ± 0.616
1.053ThrMet: 1.053 ± 0.332
2.43ThrAsn: 2.43 ± 0.526
5.103ThrPro: 5.103 ± 0.665
1.377ThrGln: 1.377 ± 0.408
3.888ThrArg: 3.888 ± 0.553
3.726ThrSer: 3.726 ± 0.539
5.346ThrThr: 5.346 ± 0.665
6.075ThrVal: 6.075 ± 0.828
1.053ThrTrp: 1.053 ± 0.384
1.539ThrTyr: 1.539 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
7.938ValAla: 7.938 ± 0.769
0.324ValCys: 0.324 ± 0.187
4.05ValAsp: 4.05 ± 0.473
4.05ValGlu: 4.05 ± 0.403
1.62ValPhe: 1.62 ± 0.346
4.86ValGly: 4.86 ± 0.668
1.053ValHis: 1.053 ± 0.265
3.564ValIle: 3.564 ± 0.539
2.511ValLys: 2.511 ± 0.446
4.536ValLeu: 4.536 ± 0.515
1.377ValMet: 1.377 ± 0.291
1.944ValAsn: 1.944 ± 0.367
3.402ValPro: 3.402 ± 0.426
3.159ValGln: 3.159 ± 0.439
6.237ValArg: 6.237 ± 0.792
4.698ValSer: 4.698 ± 0.471
4.212ValThr: 4.212 ± 0.795
4.941ValVal: 4.941 ± 0.658
1.377ValTrp: 1.377 ± 0.372
1.134ValTyr: 1.134 ± 0.243
0.0ValXaa: 0.0 ± 0.0
Trp
1.62TrpAla: 1.62 ± 0.411
0.081TrpCys: 0.081 ± 0.09
1.701TrpAsp: 1.701 ± 0.385
0.972TrpGlu: 0.972 ± 0.257
0.324TrpPhe: 0.324 ± 0.153
0.972TrpGly: 0.972 ± 0.256
0.405TrpHis: 0.405 ± 0.183
0.891TrpIle: 0.891 ± 0.312
0.486TrpLys: 0.486 ± 0.232
1.62TrpLeu: 1.62 ± 0.373
0.891TrpMet: 0.891 ± 0.223
0.648TrpAsn: 0.648 ± 0.246
0.891TrpPro: 0.891 ± 0.238
0.729TrpGln: 0.729 ± 0.243
1.377TrpArg: 1.377 ± 0.358
0.891TrpSer: 0.891 ± 0.283
1.053TrpThr: 1.053 ± 0.271
1.134TrpVal: 1.134 ± 0.261
0.324TrpTrp: 0.324 ± 0.144
0.162TrpTyr: 0.162 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.025TyrAla: 2.025 ± 0.417
0.324TyrCys: 0.324 ± 0.145
1.701TyrAsp: 1.701 ± 0.303
1.458TyrGlu: 1.458 ± 0.333
0.567TyrPhe: 0.567 ± 0.227
2.43TyrGly: 2.43 ± 0.33
0.567TyrHis: 0.567 ± 0.211
1.215TyrIle: 1.215 ± 0.263
0.486TyrLys: 0.486 ± 0.174
2.187TyrLeu: 2.187 ± 0.433
0.243TyrMet: 0.243 ± 0.134
0.405TyrAsn: 0.405 ± 0.176
0.648TyrPro: 0.648 ± 0.206
0.486TyrGln: 0.486 ± 0.206
1.62TyrArg: 1.62 ± 0.351
1.62TyrSer: 1.62 ± 0.371
1.944TyrThr: 1.944 ± 0.458
1.539TyrVal: 1.539 ± 0.286
0.567TyrTrp: 0.567 ± 0.247
0.081TyrTyr: 0.081 ± 0.081
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12346 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski