Amino acid dipepetide frequency for Streptococcus phage Javan626

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.714AlaAla: 2.714 ± 0.832
0.399AlaCys: 0.399 ± 0.189
5.508AlaAsp: 5.508 ± 0.764
6.146AlaGlu: 6.146 ± 0.598
2.235AlaPhe: 2.235 ± 0.385
4.709AlaGly: 4.709 ± 0.994
0.399AlaHis: 0.399 ± 0.197
4.949AlaIle: 4.949 ± 0.582
7.104AlaLys: 7.104 ± 0.906
7.104AlaLeu: 7.104 ± 0.944
2.874AlaMet: 2.874 ± 0.54
4.39AlaAsn: 4.39 ± 0.754
1.996AlaPro: 1.996 ± 0.606
2.874AlaGln: 2.874 ± 0.667
3.672AlaArg: 3.672 ± 0.67
5.987AlaSer: 5.987 ± 0.908
5.029AlaThr: 5.029 ± 1.078
3.672AlaVal: 3.672 ± 0.532
0.878AlaTrp: 0.878 ± 0.24
3.033AlaTyr: 3.033 ± 0.489
0.0AlaXaa: 0.0 ± 0.0
Cys
0.16CysAla: 0.16 ± 0.107
0.0CysCys: 0.0 ± 0.0
0.239CysAsp: 0.239 ± 0.119
0.639CysGlu: 0.639 ± 0.263
0.08CysPhe: 0.08 ± 0.082
0.718CysGly: 0.718 ± 0.388
0.16CysHis: 0.16 ± 0.128
0.319CysIle: 0.319 ± 0.199
0.559CysLys: 0.559 ± 0.244
0.718CysLeu: 0.718 ± 0.296
0.16CysMet: 0.16 ± 0.116
0.399CysAsn: 0.399 ± 0.205
0.08CysPro: 0.08 ± 0.096
0.239CysGln: 0.239 ± 0.136
0.239CysArg: 0.239 ± 0.104
0.479CysSer: 0.479 ± 0.185
0.399CysThr: 0.399 ± 0.227
0.16CysVal: 0.16 ± 0.118
0.08CysTrp: 0.08 ± 0.068
0.08CysTyr: 0.08 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.789AspAla: 4.789 ± 0.569
0.319AspCys: 0.319 ± 0.158
3.911AspAsp: 3.911 ± 0.824
5.667AspGlu: 5.667 ± 0.905
3.592AspPhe: 3.592 ± 0.589
5.268AspGly: 5.268 ± 0.828
0.639AspHis: 0.639 ± 0.272
4.071AspIle: 4.071 ± 0.711
4.869AspLys: 4.869 ± 0.576
5.907AspLeu: 5.907 ± 0.737
1.038AspMet: 1.038 ± 0.261
3.193AspAsn: 3.193 ± 0.456
2.075AspPro: 2.075 ± 0.365
1.117AspGln: 1.117 ± 0.297
2.315AspArg: 2.315 ± 0.393
4.31AspSer: 4.31 ± 0.58
3.831AspThr: 3.831 ± 0.598
3.432AspVal: 3.432 ± 0.444
1.277AspTrp: 1.277 ± 0.354
3.033AspTyr: 3.033 ± 0.541
0.0AspXaa: 0.0 ± 0.0
Glu
5.029GluAla: 5.029 ± 0.624
0.639GluCys: 0.639 ± 0.258
3.991GluAsp: 3.991 ± 0.707
6.466GluGlu: 6.466 ± 1.01
2.794GluPhe: 2.794 ± 0.357
3.352GluGly: 3.352 ± 0.527
1.117GluHis: 1.117 ± 0.328
6.066GluIle: 6.066 ± 0.896
6.865GluLys: 6.865 ± 0.736
7.104GluLeu: 7.104 ± 0.746
2.315GluMet: 2.315 ± 0.467
3.193GluAsn: 3.193 ± 0.606
1.596GluPro: 1.596 ± 0.352
3.113GluGln: 3.113 ± 0.603
2.714GluArg: 2.714 ± 0.464
2.953GluSer: 2.953 ± 0.56
4.151GluThr: 4.151 ± 0.406
5.827GluVal: 5.827 ± 0.658
1.117GluTrp: 1.117 ± 0.283
2.554GluTyr: 2.554 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
2.953PheAla: 2.953 ± 0.486
0.399PheCys: 0.399 ± 0.172
3.193PheAsp: 3.193 ± 0.543
3.352PheGlu: 3.352 ± 0.447
1.357PhePhe: 1.357 ± 0.516
2.474PheGly: 2.474 ± 0.42
0.319PheHis: 0.319 ± 0.124
2.315PheIle: 2.315 ± 0.535
2.953PheLys: 2.953 ± 0.616
2.235PheLeu: 2.235 ± 0.449
0.798PheMet: 0.798 ± 0.257
2.874PheAsn: 2.874 ± 0.499
1.277PhePro: 1.277 ± 0.258
0.718PheGln: 0.718 ± 0.185
1.357PheArg: 1.357 ± 0.357
2.953PheSer: 2.953 ± 0.461
1.437PheThr: 1.437 ± 0.314
2.155PheVal: 2.155 ± 0.316
0.559PheTrp: 0.559 ± 0.237
1.517PheTyr: 1.517 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
4.31GlyAla: 4.31 ± 1.062
0.399GlyCys: 0.399 ± 0.199
3.911GlyAsp: 3.911 ± 0.479
3.033GlyGlu: 3.033 ± 0.427
2.634GlyPhe: 2.634 ± 0.351
4.071GlyGly: 4.071 ± 0.571
0.798GlyHis: 0.798 ± 0.287
3.592GlyIle: 3.592 ± 0.545
6.865GlyLys: 6.865 ± 0.594
6.944GlyLeu: 6.944 ± 0.878
1.996GlyMet: 1.996 ± 0.452
3.352GlyAsn: 3.352 ± 0.69
0.639GlyPro: 0.639 ± 0.195
3.033GlyGln: 3.033 ± 0.475
2.794GlyArg: 2.794 ± 0.517
3.512GlySer: 3.512 ± 0.62
4.949GlyThr: 4.949 ± 1.019
4.231GlyVal: 4.231 ± 0.47
0.878GlyTrp: 0.878 ± 0.285
2.714GlyTyr: 2.714 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
0.798HisAla: 0.798 ± 0.267
0.319HisCys: 0.319 ± 0.171
1.197HisAsp: 1.197 ± 0.508
0.559HisGlu: 0.559 ± 0.245
0.479HisPhe: 0.479 ± 0.228
0.639HisGly: 0.639 ± 0.198
0.239HisHis: 0.239 ± 0.123
0.798HisIle: 0.798 ± 0.233
1.197HisLys: 1.197 ± 0.293
1.596HisLeu: 1.596 ± 0.444
0.08HisMet: 0.08 ± 0.079
0.559HisAsn: 0.559 ± 0.209
0.639HisPro: 0.639 ± 0.168
0.479HisGln: 0.479 ± 0.193
0.399HisArg: 0.399 ± 0.196
0.718HisSer: 0.718 ± 0.216
1.197HisThr: 1.197 ± 0.239
0.479HisVal: 0.479 ± 0.204
0.08HisTrp: 0.08 ± 0.086
0.399HisTyr: 0.399 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
4.709IleAla: 4.709 ± 0.875
0.239IleCys: 0.239 ± 0.145
4.709IleAsp: 4.709 ± 0.731
4.63IleGlu: 4.63 ± 0.885
1.916IlePhe: 1.916 ± 0.357
2.714IleGly: 2.714 ± 0.489
1.038IleHis: 1.038 ± 0.358
3.432IleIle: 3.432 ± 0.643
6.146IleLys: 6.146 ± 0.761
5.747IleLeu: 5.747 ± 0.651
0.798IleMet: 0.798 ± 0.248
4.869IleAsn: 4.869 ± 0.703
1.916IlePro: 1.916 ± 0.339
1.277IleGln: 1.277 ± 0.333
3.033IleArg: 3.033 ± 0.46
4.869IleSer: 4.869 ± 0.658
4.47IleThr: 4.47 ± 0.475
3.911IleVal: 3.911 ± 0.636
0.718IleTrp: 0.718 ± 0.202
2.554IleTyr: 2.554 ± 0.497
0.0IleXaa: 0.0 ± 0.0
Lys
6.944LysAla: 6.944 ± 0.952
0.239LysCys: 0.239 ± 0.141
5.188LysAsp: 5.188 ± 0.603
5.987LysGlu: 5.987 ± 0.84
3.033LysPhe: 3.033 ± 0.449
4.869LysGly: 4.869 ± 0.616
1.117LysHis: 1.117 ± 0.327
5.268LysIle: 5.268 ± 0.717
5.987LysLys: 5.987 ± 0.929
6.226LysLeu: 6.226 ± 0.653
2.634LysMet: 2.634 ± 0.462
4.949LysAsn: 4.949 ± 0.554
2.474LysPro: 2.474 ± 0.412
3.193LysGln: 3.193 ± 0.532
3.672LysArg: 3.672 ± 0.505
4.55LysSer: 4.55 ± 0.584
5.987LysThr: 5.987 ± 0.736
5.747LysVal: 5.747 ± 0.716
1.437LysTrp: 1.437 ± 0.321
3.113LysTyr: 3.113 ± 0.473
0.0LysXaa: 0.0 ± 0.0
Leu
6.944LeuAla: 6.944 ± 0.97
0.559LeuCys: 0.559 ± 0.268
5.987LeuAsp: 5.987 ± 0.74
7.104LeuGlu: 7.104 ± 0.878
2.714LeuPhe: 2.714 ± 0.564
5.348LeuGly: 5.348 ± 0.565
0.718LeuHis: 0.718 ± 0.287
5.029LeuIle: 5.029 ± 0.603
6.785LeuLys: 6.785 ± 0.792
5.987LeuLeu: 5.987 ± 0.862
2.235LeuMet: 2.235 ± 0.536
5.428LeuAsn: 5.428 ± 0.669
3.193LeuPro: 3.193 ± 0.564
3.113LeuGln: 3.113 ± 0.661
3.352LeuArg: 3.352 ± 0.533
5.667LeuSer: 5.667 ± 0.617
6.625LeuThr: 6.625 ± 0.719
5.667LeuVal: 5.667 ± 0.615
0.479LeuTrp: 0.479 ± 0.246
2.474LeuTyr: 2.474 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
1.676MetAla: 1.676 ± 0.387
0.319MetCys: 0.319 ± 0.155
1.117MetAsp: 1.117 ± 0.396
1.676MetGlu: 1.676 ± 0.425
1.038MetPhe: 1.038 ± 0.307
1.357MetGly: 1.357 ± 0.309
0.399MetHis: 0.399 ± 0.172
1.916MetIle: 1.916 ± 0.447
2.075MetLys: 2.075 ± 0.429
1.437MetLeu: 1.437 ± 0.299
0.958MetMet: 0.958 ± 0.275
1.596MetAsn: 1.596 ± 0.437
0.798MetPro: 0.798 ± 0.209
0.798MetGln: 0.798 ± 0.209
1.756MetArg: 1.756 ± 0.317
1.676MetSer: 1.676 ± 0.45
2.155MetThr: 2.155 ± 0.518
1.357MetVal: 1.357 ± 0.373
0.319MetTrp: 0.319 ± 0.139
0.559MetTyr: 0.559 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
5.348AsnAla: 5.348 ± 1.382
0.08AsnCys: 0.08 ± 0.098
3.672AsnAsp: 3.672 ± 0.612
4.63AsnGlu: 4.63 ± 0.629
2.395AsnPhe: 2.395 ± 0.397
5.029AsnGly: 5.029 ± 0.85
1.038AsnHis: 1.038 ± 0.308
2.953AsnIle: 2.953 ± 0.589
3.273AsnLys: 3.273 ± 0.496
4.31AsnLeu: 4.31 ± 0.571
1.117AsnMet: 1.117 ± 0.252
3.752AsnAsn: 3.752 ± 0.712
2.235AsnPro: 2.235 ± 0.501
1.916AsnGln: 1.916 ± 0.353
1.117AsnArg: 1.117 ± 0.324
4.789AsnSer: 4.789 ± 0.617
3.512AsnThr: 3.512 ± 0.662
3.193AsnVal: 3.193 ± 0.52
0.479AsnTrp: 0.479 ± 0.248
2.315AsnTyr: 2.315 ± 0.437
0.0AsnXaa: 0.0 ± 0.0
Pro
1.836ProAla: 1.836 ± 0.459
0.08ProCys: 0.08 ± 0.073
2.235ProAsp: 2.235 ± 0.401
1.836ProGlu: 1.836 ± 0.435
0.878ProPhe: 0.878 ± 0.225
1.357ProGly: 1.357 ± 0.327
0.319ProHis: 0.319 ± 0.143
2.075ProIle: 2.075 ± 0.32
2.075ProLys: 2.075 ± 0.572
2.474ProLeu: 2.474 ± 0.574
0.798ProMet: 0.798 ± 0.254
1.756ProAsn: 1.756 ± 0.383
0.239ProPro: 0.239 ± 0.142
1.596ProGln: 1.596 ± 0.514
0.718ProArg: 0.718 ± 0.284
2.155ProSer: 2.155 ± 0.387
2.155ProThr: 2.155 ± 0.443
2.874ProVal: 2.874 ± 0.455
0.16ProTrp: 0.16 ± 0.1
1.038ProTyr: 1.038 ± 0.402
0.0ProXaa: 0.0 ± 0.0
Gln
3.432GlnAla: 3.432 ± 0.642
0.16GlnCys: 0.16 ± 0.118
1.197GlnAsp: 1.197 ± 0.256
2.554GlnGlu: 2.554 ± 0.493
1.916GlnPhe: 1.916 ± 0.415
3.033GlnGly: 3.033 ± 0.444
0.319GlnHis: 0.319 ± 0.14
2.155GlnIle: 2.155 ± 0.373
3.193GlnLys: 3.193 ± 0.591
3.911GlnLeu: 3.911 ± 0.59
1.277GlnMet: 1.277 ± 0.305
1.437GlnAsn: 1.437 ± 0.315
0.878GlnPro: 0.878 ± 0.251
0.878GlnGln: 0.878 ± 0.31
0.878GlnArg: 0.878 ± 0.227
3.033GlnSer: 3.033 ± 0.586
2.235GlnThr: 2.235 ± 0.397
1.756GlnVal: 1.756 ± 0.382
0.718GlnTrp: 0.718 ± 0.2
1.277GlnTyr: 1.277 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
3.672ArgAla: 3.672 ± 0.541
0.16ArgCys: 0.16 ± 0.094
2.634ArgAsp: 2.634 ± 0.372
2.953ArgGlu: 2.953 ± 0.539
1.357ArgPhe: 1.357 ± 0.378
2.155ArgGly: 2.155 ± 0.405
0.798ArgHis: 0.798 ± 0.204
3.432ArgIle: 3.432 ± 0.426
3.033ArgLys: 3.033 ± 0.66
3.432ArgLeu: 3.432 ± 0.641
0.878ArgMet: 0.878 ± 0.273
1.836ArgAsn: 1.836 ± 0.407
1.038ArgPro: 1.038 ± 0.254
1.596ArgGln: 1.596 ± 0.349
1.357ArgArg: 1.357 ± 0.371
1.756ArgSer: 1.756 ± 0.391
2.155ArgThr: 2.155 ± 0.384
2.075ArgVal: 2.075 ± 0.428
0.639ArgTrp: 0.639 ± 0.248
1.596ArgTyr: 1.596 ± 0.291
0.0ArgXaa: 0.0 ± 0.0
Ser
6.146SerAla: 6.146 ± 0.831
0.319SerCys: 0.319 ± 0.201
5.029SerAsp: 5.029 ± 0.733
3.432SerGlu: 3.432 ± 0.425
3.033SerPhe: 3.033 ± 0.545
5.029SerGly: 5.029 ± 0.45
1.038SerHis: 1.038 ± 0.292
3.752SerIle: 3.752 ± 0.583
5.109SerLys: 5.109 ± 0.62
5.109SerLeu: 5.109 ± 0.783
1.357SerMet: 1.357 ± 0.373
3.193SerAsn: 3.193 ± 0.577
1.596SerPro: 1.596 ± 0.344
3.273SerGln: 3.273 ± 0.535
1.996SerArg: 1.996 ± 0.471
4.869SerSer: 4.869 ± 0.737
4.869SerThr: 4.869 ± 0.832
4.071SerVal: 4.071 ± 0.724
0.878SerTrp: 0.878 ± 0.242
2.554SerTyr: 2.554 ± 0.521
0.0SerXaa: 0.0 ± 0.0
Thr
5.188ThrAla: 5.188 ± 0.653
0.16ThrCys: 0.16 ± 0.093
4.151ThrAsp: 4.151 ± 0.615
3.432ThrGlu: 3.432 ± 0.618
2.395ThrPhe: 2.395 ± 0.376
5.508ThrGly: 5.508 ± 0.851
1.117ThrHis: 1.117 ± 0.274
5.188ThrIle: 5.188 ± 0.892
5.587ThrLys: 5.587 ± 0.685
4.47ThrLeu: 4.47 ± 0.594
1.596ThrMet: 1.596 ± 0.437
3.432ThrAsn: 3.432 ± 0.616
1.996ThrPro: 1.996 ± 0.424
2.075ThrGln: 2.075 ± 0.454
1.996ThrArg: 1.996 ± 0.304
4.151ThrSer: 4.151 ± 0.951
5.428ThrThr: 5.428 ± 0.947
5.428ThrVal: 5.428 ± 0.844
1.756ThrTrp: 1.756 ± 0.384
3.273ThrTyr: 3.273 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
5.268ValAla: 5.268 ± 0.666
0.319ValCys: 0.319 ± 0.149
3.752ValAsp: 3.752 ± 0.508
5.428ValGlu: 5.428 ± 0.785
1.437ValPhe: 1.437 ± 0.304
4.071ValGly: 4.071 ± 0.594
0.559ValHis: 0.559 ± 0.207
4.151ValIle: 4.151 ± 0.772
5.029ValLys: 5.029 ± 0.704
5.907ValLeu: 5.907 ± 0.708
1.277ValMet: 1.277 ± 0.323
4.39ValAsn: 4.39 ± 0.597
2.714ValPro: 2.714 ± 0.435
1.836ValGln: 1.836 ± 0.571
2.634ValArg: 2.634 ± 0.436
4.709ValSer: 4.709 ± 0.75
3.193ValThr: 3.193 ± 0.507
2.395ValVal: 2.395 ± 0.404
0.479ValTrp: 0.479 ± 0.236
2.395ValTyr: 2.395 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.256
0.16TrpCys: 0.16 ± 0.128
0.639TrpAsp: 0.639 ± 0.221
0.878TrpGlu: 0.878 ± 0.224
0.559TrpPhe: 0.559 ± 0.197
0.479TrpGly: 0.479 ± 0.202
0.319TrpHis: 0.319 ± 0.172
0.479TrpIle: 0.479 ± 0.197
1.038TrpLys: 1.038 ± 0.26
1.517TrpLeu: 1.517 ± 0.261
0.239TrpMet: 0.239 ± 0.142
0.718TrpAsn: 0.718 ± 0.28
0.319TrpPro: 0.319 ± 0.198
0.479TrpGln: 0.479 ± 0.199
0.718TrpArg: 0.718 ± 0.296
1.117TrpSer: 1.117 ± 0.306
1.357TrpThr: 1.357 ± 0.473
1.117TrpVal: 1.117 ± 0.397
0.0TrpTrp: 0.0 ± 0.0
0.559TrpTyr: 0.559 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.953TyrAla: 2.953 ± 0.598
0.639TyrCys: 0.639 ± 0.24
2.395TyrAsp: 2.395 ± 0.458
2.554TyrGlu: 2.554 ± 0.526
1.357TyrPhe: 1.357 ± 0.335
2.474TyrGly: 2.474 ± 0.475
0.399TyrHis: 0.399 ± 0.194
1.756TyrIle: 1.756 ± 0.39
2.874TyrLys: 2.874 ± 0.421
3.273TyrLeu: 3.273 ± 0.56
0.479TyrMet: 0.479 ± 0.207
1.836TyrAsn: 1.836 ± 0.339
0.958TyrPro: 0.958 ± 0.31
2.634TyrGln: 2.634 ± 0.612
1.836TyrArg: 1.836 ± 0.496
2.395TyrSer: 2.395 ± 0.51
3.193TyrThr: 3.193 ± 0.488
2.395TyrVal: 2.395 ± 0.469
0.639TyrTrp: 0.639 ± 0.272
1.277TyrTyr: 1.277 ± 0.429
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski