Amino acid dipepetide frequency for Streptococcus phage phi29961

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.555AlaAla: 3.555 ± 0.631
0.401AlaCys: 0.401 ± 0.136
2.981AlaAsp: 2.981 ± 0.4
3.841AlaGlu: 3.841 ± 0.476
2.637AlaPhe: 2.637 ± 0.362
3.669AlaGly: 3.669 ± 0.549
0.745AlaHis: 0.745 ± 0.189
4.472AlaIle: 4.472 ± 0.827
5.733AlaLys: 5.733 ± 0.743
6.192AlaLeu: 6.192 ± 0.794
1.663AlaMet: 1.663 ± 0.287
3.497AlaAsn: 3.497 ± 0.484
1.605AlaPro: 1.605 ± 0.29
2.58AlaGln: 2.58 ± 0.435
2.752AlaArg: 2.752 ± 0.429
4.357AlaSer: 4.357 ± 0.505
3.325AlaThr: 3.325 ± 0.57
4.013AlaVal: 4.013 ± 0.694
0.573AlaTrp: 0.573 ± 0.242
2.637AlaTyr: 2.637 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.401CysAla: 0.401 ± 0.175
0.287CysCys: 0.287 ± 0.139
0.516CysAsp: 0.516 ± 0.185
0.459CysGlu: 0.459 ± 0.157
0.229CysPhe: 0.229 ± 0.117
0.745CysGly: 0.745 ± 0.229
0.287CysHis: 0.287 ± 0.142
0.459CysIle: 0.459 ± 0.171
0.803CysLys: 0.803 ± 0.273
0.688CysLeu: 0.688 ± 0.238
0.229CysMet: 0.229 ± 0.138
0.401CysAsn: 0.401 ± 0.151
0.459CysPro: 0.459 ± 0.159
0.573CysGln: 0.573 ± 0.168
0.803CysArg: 0.803 ± 0.203
0.86CysSer: 0.86 ± 0.317
0.172CysThr: 0.172 ± 0.091
0.631CysVal: 0.631 ± 0.2
0.0CysTrp: 0.0 ± 0.0
0.516CysTyr: 0.516 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
2.523AspAla: 2.523 ± 0.393
0.459AspCys: 0.459 ± 0.192
2.695AspAsp: 2.695 ± 0.471
5.619AspGlu: 5.619 ± 0.596
3.44AspPhe: 3.44 ± 0.398
4.185AspGly: 4.185 ± 0.435
0.803AspHis: 0.803 ± 0.258
4.587AspIle: 4.587 ± 0.468
3.841AspLys: 3.841 ± 0.488
5.389AspLeu: 5.389 ± 0.559
1.663AspMet: 1.663 ± 0.287
2.293AspAsn: 2.293 ± 0.43
1.204AspPro: 1.204 ± 0.299
1.892AspGln: 1.892 ± 0.331
2.752AspArg: 2.752 ± 0.457
3.096AspSer: 3.096 ± 0.501
2.867AspThr: 2.867 ± 0.461
3.325AspVal: 3.325 ± 0.439
0.803AspTrp: 0.803 ± 0.192
2.867AspTyr: 2.867 ± 0.416
0.0AspXaa: 0.0 ± 0.0
Glu
4.357GluAla: 4.357 ± 0.533
0.745GluCys: 0.745 ± 0.197
4.013GluAsp: 4.013 ± 0.547
6.307GluGlu: 6.307 ± 0.922
2.924GluPhe: 2.924 ± 0.358
4.357GluGly: 4.357 ± 0.434
1.261GluHis: 1.261 ± 0.269
5.791GluIle: 5.791 ± 1.065
9.231GluLys: 9.231 ± 2.313
8.141GluLeu: 8.141 ± 0.717
1.835GluMet: 1.835 ± 0.425
4.3GluAsn: 4.3 ± 0.63
1.491GluPro: 1.491 ± 0.262
3.784GluGln: 3.784 ± 0.459
3.325GluArg: 3.325 ± 0.43
4.243GluSer: 4.243 ± 0.511
4.472GluThr: 4.472 ± 0.49
3.956GluVal: 3.956 ± 0.402
0.631GluTrp: 0.631 ± 0.219
2.351GluTyr: 2.351 ± 0.324
0.0GluXaa: 0.0 ± 0.0
Phe
2.179PheAla: 2.179 ± 0.373
0.803PheCys: 0.803 ± 0.231
3.096PheAsp: 3.096 ± 0.458
2.924PheGlu: 2.924 ± 0.406
2.007PhePhe: 2.007 ± 0.431
3.325PheGly: 3.325 ± 0.407
0.803PheHis: 0.803 ± 0.232
2.867PheIle: 2.867 ± 0.466
3.44PheLys: 3.44 ± 0.427
3.383PheLeu: 3.383 ± 0.485
0.917PheMet: 0.917 ± 0.22
2.179PheAsn: 2.179 ± 0.304
1.147PhePro: 1.147 ± 0.294
1.892PheGln: 1.892 ± 0.294
1.433PheArg: 1.433 ± 0.233
2.752PheSer: 2.752 ± 0.382
2.007PheThr: 2.007 ± 0.373
2.695PheVal: 2.695 ± 0.394
0.459PheTrp: 0.459 ± 0.136
1.663PheTyr: 1.663 ± 0.384
0.0PheXaa: 0.0 ± 0.0
Gly
3.211GlyAla: 3.211 ± 0.458
0.459GlyCys: 0.459 ± 0.16
3.612GlyAsp: 3.612 ± 0.583
4.185GlyGlu: 4.185 ± 0.489
2.981GlyPhe: 2.981 ± 0.416
4.472GlyGly: 4.472 ± 0.67
1.777GlyHis: 1.777 ± 0.369
4.931GlyIle: 4.931 ± 0.595
5.16GlyLys: 5.16 ± 0.561
5.561GlyLeu: 5.561 ± 0.859
1.72GlyMet: 1.72 ± 0.334
3.555GlyAsn: 3.555 ± 0.603
1.032GlyPro: 1.032 ± 0.196
2.351GlyGln: 2.351 ± 0.472
3.211GlyArg: 3.211 ± 0.485
3.44GlySer: 3.44 ± 0.563
3.727GlyThr: 3.727 ± 0.409
4.243GlyVal: 4.243 ± 0.701
0.631GlyTrp: 0.631 ± 0.156
3.096GlyTyr: 3.096 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
0.459HisAla: 0.459 ± 0.146
0.115HisCys: 0.115 ± 0.079
1.319HisAsp: 1.319 ± 0.246
0.975HisGlu: 0.975 ± 0.21
0.86HisPhe: 0.86 ± 0.255
0.975HisGly: 0.975 ± 0.174
0.516HisHis: 0.516 ± 0.193
1.319HisIle: 1.319 ± 0.272
1.261HisLys: 1.261 ± 0.285
1.491HisLeu: 1.491 ± 0.289
0.459HisMet: 0.459 ± 0.15
1.319HisAsn: 1.319 ± 0.243
0.975HisPro: 0.975 ± 0.221
0.86HisGln: 0.86 ± 0.26
0.745HisArg: 0.745 ± 0.185
0.86HisSer: 0.86 ± 0.245
1.032HisThr: 1.032 ± 0.266
1.032HisVal: 1.032 ± 0.215
0.459HisTrp: 0.459 ± 0.136
0.573HisTyr: 0.573 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.587IleAla: 4.587 ± 0.461
0.745IleCys: 0.745 ± 0.253
4.759IleAsp: 4.759 ± 0.442
4.759IleGlu: 4.759 ± 0.602
1.835IlePhe: 1.835 ± 0.475
4.357IleGly: 4.357 ± 0.588
1.261IleHis: 1.261 ± 0.252
4.185IleIle: 4.185 ± 0.468
5.905IleLys: 5.905 ± 0.857
5.504IleLeu: 5.504 ± 0.543
1.491IleMet: 1.491 ± 0.299
3.211IleAsn: 3.211 ± 0.5
2.637IlePro: 2.637 ± 0.443
3.268IleGln: 3.268 ± 0.499
2.007IleArg: 2.007 ± 0.378
5.16IleSer: 5.16 ± 0.723
4.071IleThr: 4.071 ± 0.453
3.956IleVal: 3.956 ± 0.528
0.917IleTrp: 0.917 ± 0.296
2.293IleTyr: 2.293 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
6.88LysAla: 6.88 ± 0.851
0.401LysCys: 0.401 ± 0.155
4.529LysAsp: 4.529 ± 0.666
7.969LysGlu: 7.969 ± 1.6
2.809LysPhe: 2.809 ± 0.372
4.587LysGly: 4.587 ± 0.595
1.319LysHis: 1.319 ± 0.277
5.447LysIle: 5.447 ± 0.718
6.536LysLys: 6.536 ± 1.038
6.593LysLeu: 6.593 ± 1.046
1.835LysMet: 1.835 ± 0.338
4.931LysAsn: 4.931 ± 0.66
2.58LysPro: 2.58 ± 0.438
3.841LysGln: 3.841 ± 0.713
4.071LysArg: 4.071 ± 0.532
5.103LysSer: 5.103 ± 0.556
5.045LysThr: 5.045 ± 0.567
5.733LysVal: 5.733 ± 0.657
0.803LysTrp: 0.803 ± 0.177
2.695LysTyr: 2.695 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
5.332LeuAla: 5.332 ± 0.602
0.573LeuCys: 0.573 ± 0.182
5.16LeuAsp: 5.16 ± 0.36
8.829LeuGlu: 8.829 ± 1.222
3.44LeuPhe: 3.44 ± 0.554
5.045LeuGly: 5.045 ± 0.554
1.261LeuHis: 1.261 ± 0.242
5.389LeuIle: 5.389 ± 0.551
7.453LeuLys: 7.453 ± 0.73
7.224LeuLeu: 7.224 ± 0.714
1.835LeuMet: 1.835 ± 0.381
4.243LeuAsn: 4.243 ± 0.404
3.268LeuPro: 3.268 ± 0.455
3.268LeuGln: 3.268 ± 0.363
3.497LeuArg: 3.497 ± 0.527
7.568LeuSer: 7.568 ± 0.726
6.135LeuThr: 6.135 ± 0.662
5.619LeuVal: 5.619 ± 0.467
0.688LeuTrp: 0.688 ± 0.134
2.867LeuTyr: 2.867 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
1.319MetAla: 1.319 ± 0.282
0.115MetCys: 0.115 ± 0.07
1.376MetAsp: 1.376 ± 0.298
1.835MetGlu: 1.835 ± 0.371
0.86MetPhe: 0.86 ± 0.227
1.892MetGly: 1.892 ± 0.325
0.172MetHis: 0.172 ± 0.101
2.293MetIle: 2.293 ± 0.403
1.949MetLys: 1.949 ± 0.353
1.261MetLeu: 1.261 ± 0.235
0.803MetMet: 0.803 ± 0.217
1.147MetAsn: 1.147 ± 0.281
0.631MetPro: 0.631 ± 0.192
0.745MetGln: 0.745 ± 0.214
1.032MetArg: 1.032 ± 0.235
1.72MetSer: 1.72 ± 0.325
2.121MetThr: 2.121 ± 0.413
1.548MetVal: 1.548 ± 0.249
0.115MetTrp: 0.115 ± 0.082
0.516MetTyr: 0.516 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
3.784AsnAla: 3.784 ± 0.437
0.459AsnCys: 0.459 ± 0.18
2.523AsnAsp: 2.523 ± 0.372
2.637AsnGlu: 2.637 ± 0.422
2.236AsnPhe: 2.236 ± 0.39
4.243AsnGly: 4.243 ± 0.451
1.147AsnHis: 1.147 ± 0.275
2.408AsnIle: 2.408 ± 0.338
4.931AsnLys: 4.931 ± 1.074
4.701AsnLeu: 4.701 ± 0.616
1.032AsnMet: 1.032 ± 0.26
2.58AsnAsn: 2.58 ± 0.605
2.523AsnPro: 2.523 ± 0.334
2.236AsnGln: 2.236 ± 0.314
2.408AsnArg: 2.408 ± 0.347
3.096AsnSer: 3.096 ± 0.408
2.867AsnThr: 2.867 ± 0.472
3.211AsnVal: 3.211 ± 0.386
0.803AsnTrp: 0.803 ± 0.178
1.548AsnTyr: 1.548 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
1.032ProAla: 1.032 ± 0.227
0.287ProCys: 0.287 ± 0.133
2.064ProAsp: 2.064 ± 0.343
2.121ProGlu: 2.121 ± 0.331
1.261ProPhe: 1.261 ± 0.287
1.032ProGly: 1.032 ± 0.314
0.745ProHis: 0.745 ± 0.174
2.179ProIle: 2.179 ± 0.351
3.096ProLys: 3.096 ± 0.501
2.637ProLeu: 2.637 ± 0.352
0.688ProMet: 0.688 ± 0.226
1.147ProAsn: 1.147 ± 0.281
1.089ProPro: 1.089 ± 0.265
1.72ProGln: 1.72 ± 0.251
1.548ProArg: 1.548 ± 0.29
2.695ProSer: 2.695 ± 0.371
1.835ProThr: 1.835 ± 0.315
1.949ProVal: 1.949 ± 0.398
0.229ProTrp: 0.229 ± 0.115
1.663ProTyr: 1.663 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
3.555GlnAla: 3.555 ± 0.532
0.401GlnCys: 0.401 ± 0.153
2.007GlnAsp: 2.007 ± 0.308
3.325GlnGlu: 3.325 ± 0.648
1.72GlnPhe: 1.72 ± 0.22
2.179GlnGly: 2.179 ± 0.295
0.459GlnHis: 0.459 ± 0.146
2.179GlnIle: 2.179 ± 0.373
4.013GlnLys: 4.013 ± 0.493
4.128GlnLeu: 4.128 ± 0.573
1.032GlnMet: 1.032 ± 0.258
2.695GlnAsn: 2.695 ± 0.369
1.204GlnPro: 1.204 ± 0.286
1.663GlnGln: 1.663 ± 0.467
1.433GlnArg: 1.433 ± 0.294
2.809GlnSer: 2.809 ± 0.344
2.752GlnThr: 2.752 ± 0.411
3.325GlnVal: 3.325 ± 0.456
0.401GlnTrp: 0.401 ± 0.166
1.089GlnTyr: 1.089 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
2.179ArgAla: 2.179 ± 0.399
0.401ArgCys: 0.401 ± 0.164
2.236ArgAsp: 2.236 ± 0.376
3.096ArgGlu: 3.096 ± 0.376
1.72ArgPhe: 1.72 ± 0.315
2.121ArgGly: 2.121 ± 0.32
0.631ArgHis: 0.631 ± 0.188
3.039ArgIle: 3.039 ± 0.37
3.956ArgLys: 3.956 ± 0.576
4.185ArgLeu: 4.185 ± 0.456
1.032ArgMet: 1.032 ± 0.268
2.007ArgAsn: 2.007 ± 0.379
0.917ArgPro: 0.917 ± 0.197
2.293ArgGln: 2.293 ± 0.349
2.408ArgArg: 2.408 ± 0.447
2.408ArgSer: 2.408 ± 0.32
2.408ArgThr: 2.408 ± 0.441
3.555ArgVal: 3.555 ± 0.597
0.975ArgTrp: 0.975 ± 0.289
1.777ArgTyr: 1.777 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
4.071SerAla: 4.071 ± 0.6
0.631SerCys: 0.631 ± 0.198
3.956SerAsp: 3.956 ± 0.492
5.561SerGlu: 5.561 ± 0.578
2.523SerPhe: 2.523 ± 0.438
5.504SerGly: 5.504 ± 0.651
1.72SerHis: 1.72 ± 0.307
4.587SerIle: 4.587 ± 0.599
5.275SerLys: 5.275 ± 0.648
4.644SerLeu: 4.644 ± 0.653
1.261SerMet: 1.261 ± 0.31
3.268SerAsn: 3.268 ± 0.499
2.293SerPro: 2.293 ± 0.383
2.408SerGln: 2.408 ± 0.376
2.924SerArg: 2.924 ± 0.476
4.644SerSer: 4.644 ± 0.672
5.103SerThr: 5.103 ± 0.497
4.3SerVal: 4.3 ± 0.4
1.204SerTrp: 1.204 ± 0.238
2.121SerTyr: 2.121 ± 0.365
0.0SerXaa: 0.0 ± 0.0
Thr
3.841ThrAla: 3.841 ± 0.475
0.459ThrCys: 0.459 ± 0.189
2.637ThrAsp: 2.637 ± 0.371
4.415ThrGlu: 4.415 ± 0.517
2.809ThrPhe: 2.809 ± 0.483
3.784ThrGly: 3.784 ± 0.545
0.688ThrHis: 0.688 ± 0.186
4.415ThrIle: 4.415 ± 0.733
4.415ThrLys: 4.415 ± 0.549
6.192ThrLeu: 6.192 ± 0.524
1.548ThrMet: 1.548 ± 0.239
2.924ThrAsn: 2.924 ± 0.453
2.58ThrPro: 2.58 ± 0.455
1.835ThrGln: 1.835 ± 0.361
2.351ThrArg: 2.351 ± 0.334
4.816ThrSer: 4.816 ± 0.552
3.555ThrThr: 3.555 ± 0.489
4.243ThrVal: 4.243 ± 0.552
0.86ThrTrp: 0.86 ± 0.236
2.236ThrTyr: 2.236 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
4.529ValAla: 4.529 ± 0.455
0.688ValCys: 0.688 ± 0.214
3.899ValAsp: 3.899 ± 0.394
5.504ValGlu: 5.504 ± 0.562
2.637ValPhe: 2.637 ± 0.397
4.185ValGly: 4.185 ± 0.548
1.089ValHis: 1.089 ± 0.209
3.612ValIle: 3.612 ± 0.54
3.096ValLys: 3.096 ± 0.47
6.536ValLeu: 6.536 ± 0.65
1.376ValMet: 1.376 ± 0.28
2.809ValAsn: 2.809 ± 0.431
2.007ValPro: 2.007 ± 0.323
2.637ValGln: 2.637 ± 0.441
2.121ValArg: 2.121 ± 0.378
4.931ValSer: 4.931 ± 0.659
4.243ValThr: 4.243 ± 0.452
4.243ValVal: 4.243 ± 0.579
1.147ValTrp: 1.147 ± 0.274
2.523ValTyr: 2.523 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
0.917TrpAla: 0.917 ± 0.185
0.057TrpCys: 0.057 ± 0.058
0.631TrpAsp: 0.631 ± 0.182
1.204TrpGlu: 1.204 ± 0.209
0.745TrpPhe: 0.745 ± 0.192
0.459TrpGly: 0.459 ± 0.136
0.229TrpHis: 0.229 ± 0.097
0.573TrpIle: 0.573 ± 0.167
0.803TrpLys: 0.803 ± 0.252
1.147TrpLeu: 1.147 ± 0.228
0.287TrpMet: 0.287 ± 0.117
1.204TrpAsn: 1.204 ± 0.304
0.057TrpPro: 0.057 ± 0.056
0.688TrpGln: 0.688 ± 0.164
0.459TrpArg: 0.459 ± 0.185
1.032TrpSer: 1.032 ± 0.278
0.745TrpThr: 0.745 ± 0.203
0.401TrpVal: 0.401 ± 0.144
0.287TrpTrp: 0.287 ± 0.108
0.344TrpTyr: 0.344 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.523TyrAla: 2.523 ± 0.316
1.032TyrCys: 1.032 ± 0.282
2.293TyrAsp: 2.293 ± 0.425
1.949TyrGlu: 1.949 ± 0.276
2.293TyrPhe: 2.293 ± 0.353
2.236TyrGly: 2.236 ± 0.439
0.688TyrHis: 0.688 ± 0.25
2.179TyrIle: 2.179 ± 0.386
2.809TyrLys: 2.809 ± 0.432
3.096TyrLeu: 3.096 ± 0.454
0.688TyrMet: 0.688 ± 0.285
1.663TyrAsn: 1.663 ± 0.315
1.319TyrPro: 1.319 ± 0.227
1.892TyrGln: 1.892 ± 0.303
2.064TyrArg: 2.064 ± 0.361
2.408TyrSer: 2.408 ± 0.443
2.121TyrThr: 2.121 ± 0.402
1.835TyrVal: 1.835 ± 0.29
0.287TyrTrp: 0.287 ± 0.111
1.605TyrTyr: 1.605 ± 0.378
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (17443 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski