Amino acid dipepetide frequency for Staphylococcus virus phiETA3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.659AlaAla: 0.659 ± 0.251
0.293AlaCys: 0.293 ± 0.136
3.224AlaAsp: 3.224 ± 0.469
4.397AlaGlu: 4.397 ± 0.54
3.078AlaPhe: 3.078 ± 0.595
3.151AlaGly: 3.151 ± 0.583
1.246AlaHis: 1.246 ± 0.328
5.276AlaIle: 5.276 ± 0.629
6.009AlaLys: 6.009 ± 0.612
4.103AlaLeu: 4.103 ± 0.636
1.392AlaMet: 1.392 ± 0.381
3.517AlaAsn: 3.517 ± 0.518
1.539AlaPro: 1.539 ± 0.313
2.125AlaGln: 2.125 ± 0.403
2.052AlaArg: 2.052 ± 0.36
3.664AlaSer: 3.664 ± 0.659
3.444AlaThr: 3.444 ± 0.592
3.957AlaVal: 3.957 ± 0.748
0.733AlaTrp: 0.733 ± 0.349
2.491AlaTyr: 2.491 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.147CysAla: 0.147 ± 0.097
0.0CysCys: 0.0 ± 0.0
0.293CysAsp: 0.293 ± 0.156
0.293CysGlu: 0.293 ± 0.179
0.366CysPhe: 0.366 ± 0.172
0.293CysGly: 0.293 ± 0.142
0.073CysHis: 0.073 ± 0.073
0.073CysIle: 0.073 ± 0.065
0.513CysLys: 0.513 ± 0.153
0.22CysLeu: 0.22 ± 0.121
0.073CysMet: 0.073 ± 0.063
0.293CysAsn: 0.293 ± 0.142
0.147CysPro: 0.147 ± 0.103
0.22CysGln: 0.22 ± 0.12
0.366CysArg: 0.366 ± 0.156
0.366CysSer: 0.366 ± 0.195
0.147CysThr: 0.147 ± 0.104
0.22CysVal: 0.22 ± 0.133
0.073CysTrp: 0.073 ± 0.075
0.293CysTyr: 0.293 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
3.81AspAla: 3.81 ± 0.692
0.293AspCys: 0.293 ± 0.145
4.763AspAsp: 4.763 ± 0.73
4.763AspGlu: 4.763 ± 0.596
4.323AspPhe: 4.323 ± 0.813
3.81AspGly: 3.81 ± 0.542
0.366AspHis: 0.366 ± 0.162
5.056AspIle: 5.056 ± 0.61
6.302AspLys: 6.302 ± 0.904
5.716AspLeu: 5.716 ± 0.589
1.392AspMet: 1.392 ± 0.293
2.931AspAsn: 2.931 ± 0.478
1.466AspPro: 1.466 ± 0.277
0.953AspGln: 0.953 ± 0.234
2.418AspArg: 2.418 ± 0.452
4.47AspSer: 4.47 ± 0.479
3.371AspThr: 3.371 ± 0.429
3.884AspVal: 3.884 ± 0.631
0.953AspTrp: 0.953 ± 0.306
3.444AspTyr: 3.444 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
4.616GluAla: 4.616 ± 0.625
0.366GluCys: 0.366 ± 0.165
4.543GluAsp: 4.543 ± 0.761
5.862GluGlu: 5.862 ± 0.904
2.858GluPhe: 2.858 ± 0.507
3.297GluGly: 3.297 ± 0.495
1.759GluHis: 1.759 ± 0.309
5.642GluIle: 5.642 ± 0.691
6.741GluLys: 6.741 ± 0.851
7.328GluLeu: 7.328 ± 0.923
2.638GluMet: 2.638 ± 0.463
4.397GluAsn: 4.397 ± 0.502
1.466GluPro: 1.466 ± 0.317
4.103GluGln: 4.103 ± 0.721
3.371GluArg: 3.371 ± 0.539
4.323GluSer: 4.323 ± 0.627
3.078GluThr: 3.078 ± 0.396
5.422GluVal: 5.422 ± 0.775
1.099GluTrp: 1.099 ± 0.29
4.543GluTyr: 4.543 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
2.198PheAla: 2.198 ± 0.374
0.22PheCys: 0.22 ± 0.119
4.177PheAsp: 4.177 ± 0.516
3.444PheGlu: 3.444 ± 0.655
1.099PhePhe: 1.099 ± 0.221
2.858PheGly: 2.858 ± 0.561
0.733PheHis: 0.733 ± 0.216
3.224PheIle: 3.224 ± 0.517
4.983PheLys: 4.983 ± 0.575
2.858PheLeu: 2.858 ± 0.442
1.246PheMet: 1.246 ± 0.26
3.224PheAsn: 3.224 ± 0.416
1.246PhePro: 1.246 ± 0.368
1.099PheGln: 1.099 ± 0.308
1.685PheArg: 1.685 ± 0.269
2.858PheSer: 2.858 ± 0.519
3.004PheThr: 3.004 ± 0.416
2.272PheVal: 2.272 ± 0.486
0.366PheTrp: 0.366 ± 0.141
1.978PheTyr: 1.978 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
3.884GlyAla: 3.884 ± 0.572
0.293GlyCys: 0.293 ± 0.118
4.103GlyAsp: 4.103 ± 0.571
2.931GlyGlu: 2.931 ± 0.438
2.858GlyPhe: 2.858 ± 0.426
2.931GlyGly: 2.931 ± 0.583
1.685GlyHis: 1.685 ± 0.455
4.763GlyIle: 4.763 ± 0.564
4.763GlyLys: 4.763 ± 0.571
4.47GlyLeu: 4.47 ± 0.659
1.978GlyMet: 1.978 ± 0.407
3.224GlyAsn: 3.224 ± 0.437
0.44GlyPro: 0.44 ± 0.211
2.565GlyGln: 2.565 ± 0.503
2.638GlyArg: 2.638 ± 0.461
2.784GlySer: 2.784 ± 0.469
4.25GlyThr: 4.25 ± 0.493
4.983GlyVal: 4.983 ± 0.9
0.659GlyTrp: 0.659 ± 0.249
2.858GlyTyr: 2.858 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
1.466HisAla: 1.466 ± 0.294
0.073HisCys: 0.073 ± 0.078
0.806HisAsp: 0.806 ± 0.243
0.879HisGlu: 0.879 ± 0.247
0.953HisPhe: 0.953 ± 0.227
1.319HisGly: 1.319 ± 0.284
0.44HisHis: 0.44 ± 0.175
1.759HisIle: 1.759 ± 0.462
1.026HisLys: 1.026 ± 0.243
1.392HisLeu: 1.392 ± 0.299
0.44HisMet: 0.44 ± 0.187
1.026HisAsn: 1.026 ± 0.289
1.026HisPro: 1.026 ± 0.348
0.659HisGln: 0.659 ± 0.254
0.806HisArg: 0.806 ± 0.232
1.466HisSer: 1.466 ± 0.342
1.466HisThr: 1.466 ± 0.334
0.879HisVal: 0.879 ± 0.244
0.073HisTrp: 0.073 ± 0.071
0.953HisTyr: 0.953 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
4.323IleAla: 4.323 ± 0.701
0.22IleCys: 0.22 ± 0.126
4.763IleAsp: 4.763 ± 0.678
7.767IleGlu: 7.767 ± 0.849
3.151IlePhe: 3.151 ± 0.497
5.203IleGly: 5.203 ± 0.949
1.392IleHis: 1.392 ± 0.377
2.931IleIle: 2.931 ± 0.472
7.621IleLys: 7.621 ± 0.699
4.91IleLeu: 4.91 ± 0.566
2.565IleMet: 2.565 ± 0.439
4.763IleAsn: 4.763 ± 0.579
2.125IlePro: 2.125 ± 0.323
2.858IleGln: 2.858 ± 0.413
4.03IleArg: 4.03 ± 0.581
4.177IleSer: 4.177 ± 0.474
4.397IleThr: 4.397 ± 0.582
3.81IleVal: 3.81 ± 0.441
0.806IleTrp: 0.806 ± 0.295
2.638IleTyr: 2.638 ± 0.492
0.0IleXaa: 0.0 ± 0.0
Lys
5.642LysAla: 5.642 ± 0.691
0.293LysCys: 0.293 ± 0.159
5.422LysAsp: 5.422 ± 0.686
8.207LysGlu: 8.207 ± 0.885
3.444LysPhe: 3.444 ± 0.444
5.716LysGly: 5.716 ± 0.679
1.832LysHis: 1.832 ± 0.387
6.155LysIle: 6.155 ± 0.692
7.328LysLys: 7.328 ± 0.957
6.302LysLeu: 6.302 ± 0.717
2.858LysMet: 2.858 ± 0.431
5.203LysAsn: 5.203 ± 0.707
3.224LysPro: 3.224 ± 0.58
4.177LysGln: 4.177 ± 0.618
3.884LysArg: 3.884 ± 0.641
6.009LysSer: 6.009 ± 0.737
4.69LysThr: 4.69 ± 0.636
4.836LysVal: 4.836 ± 0.678
0.879LysTrp: 0.879 ± 0.215
4.25LysTyr: 4.25 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
4.323LeuAla: 4.323 ± 0.638
0.44LeuCys: 0.44 ± 0.247
5.276LeuAsp: 5.276 ± 0.661
6.009LeuGlu: 6.009 ± 0.946
4.03LeuPhe: 4.03 ± 0.503
3.371LeuGly: 3.371 ± 0.589
1.246LeuHis: 1.246 ± 0.306
4.983LeuIle: 4.983 ± 0.461
6.522LeuLys: 6.522 ± 0.527
5.716LeuLeu: 5.716 ± 0.711
1.978LeuMet: 1.978 ± 0.4
4.763LeuAsn: 4.763 ± 0.514
2.198LeuPro: 2.198 ± 0.331
2.784LeuGln: 2.784 ± 0.437
3.591LeuArg: 3.591 ± 0.566
4.836LeuSer: 4.836 ± 0.545
4.763LeuThr: 4.763 ± 0.648
3.884LeuVal: 3.884 ± 0.483
0.513LeuTrp: 0.513 ± 0.226
4.397LeuTyr: 4.397 ± 0.594
0.0LeuXaa: 0.0 ± 0.0
Met
2.491MetAla: 2.491 ± 0.45
0.147MetCys: 0.147 ± 0.101
0.879MetAsp: 0.879 ± 0.237
1.319MetGlu: 1.319 ± 0.313
1.026MetPhe: 1.026 ± 0.279
1.099MetGly: 1.099 ± 0.257
0.586MetHis: 0.586 ± 0.201
1.539MetIle: 1.539 ± 0.358
1.978MetLys: 1.978 ± 0.391
1.685MetLeu: 1.685 ± 0.292
0.586MetMet: 0.586 ± 0.239
2.272MetAsn: 2.272 ± 0.294
0.953MetPro: 0.953 ± 0.254
1.099MetGln: 1.099 ± 0.297
1.026MetArg: 1.026 ± 0.292
1.905MetSer: 1.905 ± 0.428
3.004MetThr: 3.004 ± 0.46
0.806MetVal: 0.806 ± 0.218
0.586MetTrp: 0.586 ± 0.171
0.659MetTyr: 0.659 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.957AsnAla: 3.957 ± 0.585
0.147AsnCys: 0.147 ± 0.105
4.47AsnAsp: 4.47 ± 0.603
6.009AsnGlu: 6.009 ± 0.749
2.711AsnPhe: 2.711 ± 0.526
4.763AsnGly: 4.763 ± 0.65
0.879AsnHis: 0.879 ± 0.262
4.323AsnIle: 4.323 ± 0.511
4.983AsnLys: 4.983 ± 0.595
3.371AsnLeu: 3.371 ± 0.529
1.172AsnMet: 1.172 ± 0.284
5.422AsnAsn: 5.422 ± 0.967
2.272AsnPro: 2.272 ± 0.392
2.272AsnGln: 2.272 ± 0.327
2.125AsnArg: 2.125 ± 0.359
3.297AsnSer: 3.297 ± 0.449
3.517AsnThr: 3.517 ± 0.501
4.323AsnVal: 4.323 ± 0.554
0.806AsnTrp: 0.806 ± 0.222
3.004AsnTyr: 3.004 ± 0.511
0.0AsnXaa: 0.0 ± 0.0
Pro
1.099ProAla: 1.099 ± 0.275
0.0ProCys: 0.0 ± 0.0
1.612ProAsp: 1.612 ± 0.275
2.198ProGlu: 2.198 ± 0.435
1.539ProPhe: 1.539 ± 0.376
2.052ProGly: 2.052 ± 0.462
0.44ProHis: 0.44 ± 0.151
2.198ProIle: 2.198 ± 0.471
2.931ProLys: 2.931 ± 0.552
1.685ProLeu: 1.685 ± 0.429
0.659ProMet: 0.659 ± 0.222
1.905ProAsn: 1.905 ± 0.36
0.293ProPro: 0.293 ± 0.141
0.733ProGln: 0.733 ± 0.21
0.806ProArg: 0.806 ± 0.25
1.539ProSer: 1.539 ± 0.34
2.272ProThr: 2.272 ± 0.352
1.172ProVal: 1.172 ± 0.225
0.147ProTrp: 0.147 ± 0.112
1.905ProTyr: 1.905 ± 0.406
0.0ProXaa: 0.0 ± 0.0
Gln
2.858GlnAla: 2.858 ± 0.511
0.366GlnCys: 0.366 ± 0.172
1.978GlnAsp: 1.978 ± 0.394
2.858GlnGlu: 2.858 ± 0.487
1.466GlnPhe: 1.466 ± 0.345
1.978GlnGly: 1.978 ± 0.379
0.879GlnHis: 0.879 ± 0.208
3.078GlnIle: 3.078 ± 0.435
3.297GlnLys: 3.297 ± 0.489
2.638GlnLeu: 2.638 ± 0.468
0.879GlnMet: 0.879 ± 0.308
2.491GlnAsn: 2.491 ± 0.396
1.246GlnPro: 1.246 ± 0.36
1.612GlnGln: 1.612 ± 0.503
2.052GlnArg: 2.052 ± 0.411
2.272GlnSer: 2.272 ± 0.434
1.685GlnThr: 1.685 ± 0.392
2.198GlnVal: 2.198 ± 0.41
0.22GlnTrp: 0.22 ± 0.127
1.319GlnTyr: 1.319 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
1.612ArgAla: 1.612 ± 0.351
0.44ArgCys: 0.44 ± 0.148
2.125ArgAsp: 2.125 ± 0.444
3.297ArgGlu: 3.297 ± 0.557
2.125ArgPhe: 2.125 ± 0.517
2.418ArgGly: 2.418 ± 0.378
1.246ArgHis: 1.246 ± 0.289
4.177ArgIle: 4.177 ± 0.44
3.81ArgLys: 3.81 ± 0.446
3.81ArgLeu: 3.81 ± 0.503
0.879ArgMet: 0.879 ± 0.232
2.858ArgAsn: 2.858 ± 0.479
1.466ArgPro: 1.466 ± 0.262
1.612ArgGln: 1.612 ± 0.382
2.125ArgArg: 2.125 ± 0.463
1.612ArgSer: 1.612 ± 0.324
1.539ArgThr: 1.539 ± 0.395
1.905ArgVal: 1.905 ± 0.33
0.44ArgTrp: 0.44 ± 0.206
2.638ArgTyr: 2.638 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
3.81SerAla: 3.81 ± 0.618
0.073SerCys: 0.073 ± 0.079
4.763SerAsp: 4.763 ± 0.555
3.737SerGlu: 3.737 ± 0.545
2.931SerPhe: 2.931 ± 0.505
4.323SerGly: 4.323 ± 0.657
1.099SerHis: 1.099 ± 0.261
5.422SerIle: 5.422 ± 0.723
5.203SerLys: 5.203 ± 0.861
4.103SerLeu: 4.103 ± 0.584
1.759SerMet: 1.759 ± 0.35
3.737SerAsn: 3.737 ± 0.58
0.879SerPro: 0.879 ± 0.252
2.565SerGln: 2.565 ± 0.481
2.272SerArg: 2.272 ± 0.406
3.224SerSer: 3.224 ± 0.513
3.444SerThr: 3.444 ± 0.433
3.81SerVal: 3.81 ± 0.582
0.733SerTrp: 0.733 ± 0.216
2.272SerTyr: 2.272 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
3.371ThrAla: 3.371 ± 0.422
0.073ThrCys: 0.073 ± 0.065
3.957ThrAsp: 3.957 ± 0.477
3.884ThrGlu: 3.884 ± 0.531
2.638ThrPhe: 2.638 ± 0.524
3.884ThrGly: 3.884 ± 0.501
1.319ThrHis: 1.319 ± 0.309
4.616ThrIle: 4.616 ± 0.718
4.983ThrLys: 4.983 ± 0.661
4.763ThrLeu: 4.763 ± 0.547
0.953ThrMet: 0.953 ± 0.305
4.03ThrAsn: 4.03 ± 0.605
1.612ThrPro: 1.612 ± 0.33
2.345ThrGln: 2.345 ± 0.441
2.491ThrArg: 2.491 ± 0.428
3.664ThrSer: 3.664 ± 0.769
3.664ThrThr: 3.664 ± 0.449
3.81ThrVal: 3.81 ± 0.539
1.026ThrTrp: 1.026 ± 0.363
2.565ThrTyr: 2.565 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
3.078ValAla: 3.078 ± 0.713
0.366ValCys: 0.366 ± 0.155
4.25ValAsp: 4.25 ± 0.609
4.616ValGlu: 4.616 ± 0.563
1.685ValPhe: 1.685 ± 0.359
3.371ValGly: 3.371 ± 0.54
0.513ValHis: 0.513 ± 0.177
5.129ValIle: 5.129 ± 0.573
6.082ValLys: 6.082 ± 0.707
4.983ValLeu: 4.983 ± 0.689
1.172ValMet: 1.172 ± 0.264
3.664ValAsn: 3.664 ± 0.554
2.345ValPro: 2.345 ± 0.445
1.392ValGln: 1.392 ± 0.379
2.125ValArg: 2.125 ± 0.371
4.25ValSer: 4.25 ± 0.661
4.03ValThr: 4.03 ± 0.576
3.591ValVal: 3.591 ± 0.551
0.586ValTrp: 0.586 ± 0.221
2.272ValTyr: 2.272 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.513TrpAla: 0.513 ± 0.223
0.073TrpCys: 0.073 ± 0.075
0.513TrpAsp: 0.513 ± 0.163
0.586TrpGlu: 0.586 ± 0.185
0.366TrpPhe: 0.366 ± 0.151
0.513TrpGly: 0.513 ± 0.265
0.22TrpHis: 0.22 ± 0.134
1.099TrpIle: 1.099 ± 0.352
1.026TrpLys: 1.026 ± 0.248
1.319TrpLeu: 1.319 ± 0.369
0.22TrpMet: 0.22 ± 0.107
1.026TrpAsn: 1.026 ± 0.36
0.073TrpPro: 0.073 ± 0.066
0.586TrpGln: 0.586 ± 0.179
0.44TrpArg: 0.44 ± 0.18
0.806TrpSer: 0.806 ± 0.301
1.026TrpThr: 1.026 ± 0.221
0.879TrpVal: 0.879 ± 0.243
0.147TrpTrp: 0.147 ± 0.109
0.44TrpTyr: 0.44 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 0.492
0.293TyrCys: 0.293 ± 0.143
2.565TyrAsp: 2.565 ± 0.536
4.47TyrGlu: 4.47 ± 0.621
2.125TyrPhe: 2.125 ± 0.434
2.711TyrGly: 2.711 ± 0.516
0.953TyrHis: 0.953 ± 0.264
3.004TyrIle: 3.004 ± 0.574
4.25TyrLys: 4.25 ± 0.622
4.25TyrLeu: 4.25 ± 0.703
0.733TyrMet: 0.733 ± 0.22
3.078TyrAsn: 3.078 ± 0.555
1.172TyrPro: 1.172 ± 0.368
1.612TyrGln: 1.612 ± 0.357
1.832TyrArg: 1.832 ± 0.535
2.638TyrSer: 2.638 ± 0.447
2.858TyrThr: 2.858 ± 0.433
2.858TyrVal: 2.858 ± 0.396
1.026TyrTrp: 1.026 ± 0.38
1.832TyrTyr: 1.832 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski