Amino acid dipepetide frequency for Rhodococcus phage Sleepyhead

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.211AlaAla: 15.211 ± 2.734
0.652AlaCys: 0.652 ± 0.241
8.982AlaAsp: 8.982 ± 0.869
6.881AlaGlu: 6.881 ± 0.913
3.187AlaPhe: 3.187 ± 0.464
9.489AlaGly: 9.489 ± 1.459
1.738AlaHis: 1.738 ± 0.41
5.505AlaIle: 5.505 ± 0.684
4.563AlaLys: 4.563 ± 0.749
8.982AlaLeu: 8.982 ± 0.713
3.042AlaMet: 3.042 ± 0.475
2.608AlaAsn: 2.608 ± 0.435
5.143AlaPro: 5.143 ± 0.597
4.708AlaGln: 4.708 ± 0.724
7.171AlaArg: 7.171 ± 0.682
6.012AlaSer: 6.012 ± 0.844
8.475AlaThr: 8.475 ± 0.88
7.026AlaVal: 7.026 ± 0.689
1.449AlaTrp: 1.449 ± 0.35
2.463AlaTyr: 2.463 ± 0.465
0.0AlaXaa: 0.0 ± 0.0
Cys
0.507CysAla: 0.507 ± 0.192
0.072CysCys: 0.072 ± 0.066
0.435CysAsp: 0.435 ± 0.185
0.072CysGlu: 0.072 ± 0.066
0.145CysPhe: 0.145 ± 0.103
1.738CysGly: 1.738 ± 0.338
0.507CysHis: 0.507 ± 0.189
0.29CysIle: 0.29 ± 0.148
0.217CysLys: 0.217 ± 0.111
0.29CysLeu: 0.29 ± 0.152
0.145CysMet: 0.145 ± 0.084
0.579CysAsn: 0.579 ± 0.235
0.435CysPro: 0.435 ± 0.179
0.29CysGln: 0.29 ± 0.136
0.217CysArg: 0.217 ± 0.123
0.29CysSer: 0.29 ± 0.133
0.652CysThr: 0.652 ± 0.227
0.507CysVal: 0.507 ± 0.214
0.145CysTrp: 0.145 ± 0.099
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.764AspAla: 8.764 ± 0.947
0.652AspCys: 0.652 ± 0.201
4.781AspAsp: 4.781 ± 0.802
4.998AspGlu: 4.998 ± 0.708
1.666AspPhe: 1.666 ± 0.281
5.07AspGly: 5.07 ± 0.598
1.666AspHis: 1.666 ± 0.325
3.332AspIle: 3.332 ± 0.539
2.463AspLys: 2.463 ± 0.441
6.519AspLeu: 6.519 ± 0.782
1.159AspMet: 1.159 ± 0.27
2.101AspAsn: 2.101 ± 0.356
5.07AspPro: 5.07 ± 0.733
1.738AspGln: 1.738 ± 0.363
4.925AspArg: 4.925 ± 0.551
3.694AspSer: 3.694 ± 0.537
2.97AspThr: 2.97 ± 0.422
3.839AspVal: 3.839 ± 0.556
0.869AspTrp: 0.869 ± 0.28
1.376AspTyr: 1.376 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
4.491GluAla: 4.491 ± 0.549
0.507GluCys: 0.507 ± 0.162
3.911GluAsp: 3.911 ± 0.594
2.535GluGlu: 2.535 ± 0.551
1.956GluPhe: 1.956 ± 0.354
5.36GluGly: 5.36 ± 0.851
1.014GluHis: 1.014 ± 0.258
3.404GluIle: 3.404 ± 0.418
3.042GluLys: 3.042 ± 0.47
6.446GluLeu: 6.446 ± 0.677
1.376GluMet: 1.376 ± 0.26
2.318GluAsn: 2.318 ± 0.431
3.042GluPro: 3.042 ± 0.659
2.318GluGln: 2.318 ± 0.542
4.418GluArg: 4.418 ± 0.706
3.477GluSer: 3.477 ± 0.527
3.622GluThr: 3.622 ± 0.547
3.911GluVal: 3.911 ± 0.59
1.304GluTrp: 1.304 ± 0.363
0.869GluTyr: 0.869 ± 0.176
0.0GluXaa: 0.0 ± 0.0
Phe
2.825PheAla: 2.825 ± 0.455
0.145PheCys: 0.145 ± 0.107
2.535PheAsp: 2.535 ± 0.379
2.173PheGlu: 2.173 ± 0.348
0.507PhePhe: 0.507 ± 0.156
2.318PheGly: 2.318 ± 0.323
0.579PheHis: 0.579 ± 0.19
1.594PheIle: 1.594 ± 0.336
0.652PheLys: 0.652 ± 0.221
2.101PheLeu: 2.101 ± 0.338
0.435PheMet: 0.435 ± 0.142
1.376PheAsn: 1.376 ± 0.279
1.159PhePro: 1.159 ± 0.249
0.652PheGln: 0.652 ± 0.188
1.956PheArg: 1.956 ± 0.257
1.666PheSer: 1.666 ± 0.365
1.811PheThr: 1.811 ± 0.288
2.752PheVal: 2.752 ± 0.429
0.435PheTrp: 0.435 ± 0.22
0.362PheTyr: 0.362 ± 0.134
0.0PheXaa: 0.0 ± 0.0
Gly
8.33GlyAla: 8.33 ± 1.073
0.652GlyCys: 0.652 ± 0.288
5.143GlyAsp: 5.143 ± 0.559
4.636GlyGlu: 4.636 ± 0.645
2.463GlyPhe: 2.463 ± 0.396
6.664GlyGly: 6.664 ± 1.659
1.521GlyHis: 1.521 ± 0.435
3.984GlyIle: 3.984 ± 0.612
4.491GlyLys: 4.491 ± 0.48
7.243GlyLeu: 7.243 ± 1.047
1.883GlyMet: 1.883 ± 0.38
2.825GlyAsn: 2.825 ± 0.395
3.404GlyPro: 3.404 ± 0.455
3.622GlyGln: 3.622 ± 0.539
5.215GlyArg: 5.215 ± 0.495
6.157GlySer: 6.157 ± 0.673
5.36GlyThr: 5.36 ± 0.867
7.171GlyVal: 7.171 ± 0.833
1.449GlyTrp: 1.449 ± 0.299
1.738GlyTyr: 1.738 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
1.738HisAla: 1.738 ± 0.44
0.217HisCys: 0.217 ± 0.127
0.724HisAsp: 0.724 ± 0.286
1.086HisGlu: 1.086 ± 0.231
0.435HisPhe: 0.435 ± 0.205
1.521HisGly: 1.521 ± 0.275
0.942HisHis: 0.942 ± 0.379
0.942HisIle: 0.942 ± 0.3
0.435HisLys: 0.435 ± 0.185
1.883HisLeu: 1.883 ± 0.375
0.29HisMet: 0.29 ± 0.125
0.724HisAsn: 0.724 ± 0.237
1.738HisPro: 1.738 ± 0.469
0.507HisGln: 0.507 ± 0.185
1.449HisArg: 1.449 ± 0.297
1.014HisSer: 1.014 ± 0.272
1.086HisThr: 1.086 ± 0.293
1.086HisVal: 1.086 ± 0.253
0.435HisTrp: 0.435 ± 0.211
0.507HisTyr: 0.507 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
6.953IleAla: 6.953 ± 0.658
0.29IleCys: 0.29 ± 0.161
6.084IleAsp: 6.084 ± 0.74
5.143IleGlu: 5.143 ± 0.736
1.376IlePhe: 1.376 ± 0.237
4.563IleGly: 4.563 ± 0.739
0.797IleHis: 0.797 ± 0.241
1.376IleIle: 1.376 ± 0.284
1.449IleLys: 1.449 ± 0.33
3.259IleLeu: 3.259 ± 0.413
0.507IleMet: 0.507 ± 0.158
1.666IleAsn: 1.666 ± 0.359
2.028IlePro: 2.028 ± 0.33
1.376IleGln: 1.376 ± 0.304
3.622IleArg: 3.622 ± 0.462
2.245IleSer: 2.245 ± 0.36
3.549IleThr: 3.549 ± 0.546
3.187IleVal: 3.187 ± 0.425
0.797IleTrp: 0.797 ± 0.287
0.942IleTyr: 0.942 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
3.622LysAla: 3.622 ± 0.65
0.29LysCys: 0.29 ± 0.143
1.738LysAsp: 1.738 ± 0.351
2.318LysGlu: 2.318 ± 0.511
1.086LysPhe: 1.086 ± 0.34
3.115LysGly: 3.115 ± 0.464
0.507LysHis: 0.507 ± 0.18
1.594LysIle: 1.594 ± 0.321
1.594LysLys: 1.594 ± 0.329
3.042LysLeu: 3.042 ± 0.483
0.942LysMet: 0.942 ± 0.201
1.811LysAsn: 1.811 ± 0.451
2.028LysPro: 2.028 ± 0.404
1.014LysGln: 1.014 ± 0.305
3.404LysArg: 3.404 ± 0.617
2.752LysSer: 2.752 ± 0.463
3.839LysThr: 3.839 ± 0.657
4.056LysVal: 4.056 ± 0.583
0.869LysTrp: 0.869 ± 0.271
1.014LysTyr: 1.014 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
8.837LeuAla: 8.837 ± 0.952
0.942LeuCys: 0.942 ± 0.324
5.505LeuAsp: 5.505 ± 0.726
4.346LeuGlu: 4.346 ± 0.449
2.245LeuPhe: 2.245 ± 0.448
7.533LeuGly: 7.533 ± 0.689
0.724LeuHis: 0.724 ± 0.206
3.404LeuIle: 3.404 ± 0.643
2.463LeuLys: 2.463 ± 0.446
4.418LeuLeu: 4.418 ± 0.546
1.086LeuMet: 1.086 ± 0.232
1.376LeuAsn: 1.376 ± 0.246
4.853LeuPro: 4.853 ± 0.506
2.68LeuGln: 2.68 ± 0.377
5.505LeuArg: 5.505 ± 0.827
5.795LeuSer: 5.795 ± 0.648
5.432LeuThr: 5.432 ± 0.657
6.809LeuVal: 6.809 ± 0.893
1.449LeuTrp: 1.449 ± 0.359
1.738LeuTyr: 1.738 ± 0.451
0.0LeuXaa: 0.0 ± 0.0
Met
2.535MetAla: 2.535 ± 0.38
0.072MetCys: 0.072 ± 0.079
0.724MetAsp: 0.724 ± 0.235
0.797MetGlu: 0.797 ± 0.257
0.652MetPhe: 0.652 ± 0.267
1.159MetGly: 1.159 ± 0.321
0.652MetHis: 0.652 ± 0.207
1.014MetIle: 1.014 ± 0.239
0.724MetLys: 0.724 ± 0.198
1.086MetLeu: 1.086 ± 0.327
0.217MetMet: 0.217 ± 0.111
0.869MetAsn: 0.869 ± 0.26
1.304MetPro: 1.304 ± 0.268
0.579MetGln: 0.579 ± 0.162
1.956MetArg: 1.956 ± 0.416
1.956MetSer: 1.956 ± 0.37
2.318MetThr: 2.318 ± 0.339
1.376MetVal: 1.376 ± 0.276
0.507MetTrp: 0.507 ± 0.183
0.29MetTyr: 0.29 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
4.346AsnAla: 4.346 ± 0.529
0.072AsnCys: 0.072 ± 0.066
1.376AsnAsp: 1.376 ± 0.305
1.883AsnGlu: 1.883 ± 0.356
0.797AsnPhe: 0.797 ± 0.262
3.332AsnGly: 3.332 ± 0.545
0.579AsnHis: 0.579 ± 0.335
1.159AsnIle: 1.159 ± 0.266
1.014AsnLys: 1.014 ± 0.27
2.101AsnLeu: 2.101 ± 0.365
0.942AsnMet: 0.942 ± 0.232
1.014AsnAsn: 1.014 ± 0.318
2.608AsnPro: 2.608 ± 0.423
0.579AsnGln: 0.579 ± 0.203
1.956AsnArg: 1.956 ± 0.398
2.028AsnSer: 2.028 ± 0.385
2.535AsnThr: 2.535 ± 0.452
1.883AsnVal: 1.883 ± 0.494
0.869AsnTrp: 0.869 ± 0.251
0.797AsnTyr: 0.797 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
5.65ProAla: 5.65 ± 0.655
0.507ProCys: 0.507 ± 0.213
3.766ProAsp: 3.766 ± 0.568
4.563ProGlu: 4.563 ± 0.76
1.159ProPhe: 1.159 ± 0.29
5.215ProGly: 5.215 ± 0.503
0.724ProHis: 0.724 ± 0.244
2.97ProIle: 2.97 ± 0.459
2.608ProLys: 2.608 ± 0.397
3.549ProLeu: 3.549 ± 0.597
1.231ProMet: 1.231 ± 0.31
1.811ProAsn: 1.811 ± 0.376
2.825ProPro: 2.825 ± 0.425
2.318ProGln: 2.318 ± 0.546
2.68ProArg: 2.68 ± 0.529
2.535ProSer: 2.535 ± 0.401
4.056ProThr: 4.056 ± 0.588
3.187ProVal: 3.187 ± 0.454
0.797ProTrp: 0.797 ± 0.186
1.086ProTyr: 1.086 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
5.143GlnAla: 5.143 ± 0.972
0.217GlnCys: 0.217 ± 0.121
1.304GlnAsp: 1.304 ± 0.248
1.594GlnGlu: 1.594 ± 0.32
1.086GlnPhe: 1.086 ± 0.232
2.608GlnGly: 2.608 ± 0.454
0.652GlnHis: 0.652 ± 0.231
2.608GlnIle: 2.608 ± 0.409
1.666GlnLys: 1.666 ± 0.374
3.115GlnLeu: 3.115 ± 0.43
0.942GlnMet: 0.942 ± 0.298
1.086GlnAsn: 1.086 ± 0.232
1.956GlnPro: 1.956 ± 0.359
1.883GlnGln: 1.883 ± 0.438
2.897GlnArg: 2.897 ± 0.426
1.666GlnSer: 1.666 ± 0.381
2.39GlnThr: 2.39 ± 0.481
2.39GlnVal: 2.39 ± 0.432
0.507GlnTrp: 0.507 ± 0.179
0.652GlnTyr: 0.652 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
7.171ArgAla: 7.171 ± 0.687
0.579ArgCys: 0.579 ± 0.196
4.418ArgAsp: 4.418 ± 0.618
2.825ArgGlu: 2.825 ± 0.547
2.028ArgPhe: 2.028 ± 0.306
4.563ArgGly: 4.563 ± 0.589
1.594ArgHis: 1.594 ± 0.404
4.636ArgIle: 4.636 ± 0.63
3.694ArgLys: 3.694 ± 0.587
5.867ArgLeu: 5.867 ± 0.69
2.535ArgMet: 2.535 ± 0.518
1.956ArgAsn: 1.956 ± 0.417
2.535ArgPro: 2.535 ± 0.467
3.332ArgGln: 3.332 ± 0.452
5.505ArgArg: 5.505 ± 0.76
4.708ArgSer: 4.708 ± 0.73
4.491ArgThr: 4.491 ± 0.626
3.042ArgVal: 3.042 ± 0.414
1.449ArgTrp: 1.449 ± 0.311
2.173ArgTyr: 2.173 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
8.257SerAla: 8.257 ± 0.882
0.29SerCys: 0.29 ± 0.131
4.998SerAsp: 4.998 ± 0.548
3.911SerGlu: 3.911 ± 0.551
1.811SerPhe: 1.811 ± 0.305
6.229SerGly: 6.229 ± 0.95
0.942SerHis: 0.942 ± 0.279
2.897SerIle: 2.897 ± 0.459
2.535SerLys: 2.535 ± 0.358
4.274SerLeu: 4.274 ± 0.511
1.449SerMet: 1.449 ± 0.229
1.956SerAsn: 1.956 ± 0.416
2.752SerPro: 2.752 ± 0.314
1.956SerGln: 1.956 ± 0.367
3.984SerArg: 3.984 ± 0.563
4.346SerSer: 4.346 ± 0.508
4.563SerThr: 4.563 ± 0.541
4.708SerVal: 4.708 ± 0.544
0.942SerTrp: 0.942 ± 0.224
1.159SerTyr: 1.159 ± 0.362
0.0SerXaa: 0.0 ± 0.0
Thr
7.75ThrAla: 7.75 ± 0.96
0.579ThrCys: 0.579 ± 0.176
3.622ThrAsp: 3.622 ± 0.445
3.766ThrGlu: 3.766 ± 0.497
2.897ThrPhe: 2.897 ± 0.382
5.722ThrGly: 5.722 ± 0.903
1.014ThrHis: 1.014 ± 0.311
3.404ThrIle: 3.404 ± 0.666
2.897ThrLys: 2.897 ± 0.422
5.215ThrLeu: 5.215 ± 0.721
1.376ThrMet: 1.376 ± 0.314
2.318ThrAsn: 2.318 ± 0.52
4.274ThrPro: 4.274 ± 0.658
2.173ThrGln: 2.173 ± 0.342
4.418ThrArg: 4.418 ± 0.638
5.795ThrSer: 5.795 ± 0.679
4.201ThrThr: 4.201 ± 0.595
4.636ThrVal: 4.636 ± 0.616
1.159ThrTrp: 1.159 ± 0.279
1.521ThrTyr: 1.521 ± 0.386
0.0ThrXaa: 0.0 ± 0.0
Val
6.519ValAla: 6.519 ± 0.714
0.435ValCys: 0.435 ± 0.18
5.07ValAsp: 5.07 ± 0.582
3.911ValGlu: 3.911 ± 0.486
1.883ValPhe: 1.883 ± 0.305
5.36ValGly: 5.36 ± 0.63
1.376ValHis: 1.376 ± 0.354
4.708ValIle: 4.708 ± 0.628
2.897ValLys: 2.897 ± 0.445
4.056ValLeu: 4.056 ± 0.645
0.724ValMet: 0.724 ± 0.218
2.535ValAsn: 2.535 ± 0.418
4.491ValPro: 4.491 ± 0.596
2.825ValGln: 2.825 ± 0.383
4.346ValArg: 4.346 ± 0.656
4.853ValSer: 4.853 ± 0.582
5.36ValThr: 5.36 ± 0.955
4.853ValVal: 4.853 ± 0.596
1.304ValTrp: 1.304 ± 0.267
2.173ValTyr: 2.173 ± 0.331
0.0ValXaa: 0.0 ± 0.0
Trp
2.028TrpAla: 2.028 ± 0.426
0.0TrpCys: 0.0 ± 0.0
1.159TrpAsp: 1.159 ± 0.323
0.579TrpGlu: 0.579 ± 0.238
0.217TrpPhe: 0.217 ± 0.124
0.797TrpGly: 0.797 ± 0.191
0.869TrpHis: 0.869 ± 0.251
1.376TrpIle: 1.376 ± 0.331
0.797TrpLys: 0.797 ± 0.27
1.738TrpLeu: 1.738 ± 0.314
0.145TrpMet: 0.145 ± 0.106
0.579TrpAsn: 0.579 ± 0.2
0.652TrpPro: 0.652 ± 0.275
0.942TrpGln: 0.942 ± 0.234
1.304TrpArg: 1.304 ± 0.287
1.449TrpSer: 1.449 ± 0.349
1.086TrpThr: 1.086 ± 0.263
0.942TrpVal: 0.942 ± 0.233
0.072TrpTrp: 0.072 ± 0.072
0.362TrpTyr: 0.362 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.309
0.362TyrCys: 0.362 ± 0.154
1.376TyrAsp: 1.376 ± 0.274
1.304TyrGlu: 1.304 ± 0.303
0.435TyrPhe: 0.435 ± 0.166
1.594TyrGly: 1.594 ± 0.273
0.362TyrHis: 0.362 ± 0.163
1.014TyrIle: 1.014 ± 0.299
0.435TyrLys: 0.435 ± 0.136
1.956TyrLeu: 1.956 ± 0.365
0.29TyrMet: 0.29 ± 0.135
0.362TyrAsn: 0.362 ± 0.159
0.942TyrPro: 0.942 ± 0.304
0.869TyrGln: 0.869 ± 0.258
2.245TyrArg: 2.245 ± 0.423
1.521TyrSer: 1.521 ± 0.405
0.942TyrThr: 0.942 ± 0.27
2.318TyrVal: 2.318 ± 0.596
0.29TyrTrp: 0.29 ± 0.187
0.435TyrTyr: 0.435 ± 0.183
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski