Amino acid dipepetide frequency for Streptococcus phage IPP23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.007AlaAla: 3.007 ± 0.54
0.194AlaCys: 0.194 ± 0.146
4.268AlaAsp: 4.268 ± 0.634
5.529AlaGlu: 5.529 ± 0.685
1.843AlaPhe: 1.843 ± 0.342
4.268AlaGly: 4.268 ± 0.661
1.164AlaHis: 1.164 ± 0.322
4.753AlaIle: 4.753 ± 0.691
4.753AlaLys: 4.753 ± 0.757
4.462AlaLeu: 4.462 ± 0.762
1.358AlaMet: 1.358 ± 0.384
3.88AlaAsn: 3.88 ± 0.538
0.97AlaPro: 0.97 ± 0.267
2.91AlaGln: 2.91 ± 0.486
3.492AlaArg: 3.492 ± 0.48
3.783AlaSer: 3.783 ± 0.633
5.141AlaThr: 5.141 ± 0.858
4.656AlaVal: 4.656 ± 0.735
0.97AlaTrp: 0.97 ± 0.253
2.716AlaTyr: 2.716 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.388CysAla: 0.388 ± 0.282
0.0CysCys: 0.0 ± 0.0
0.388CysAsp: 0.388 ± 0.194
0.776CysGlu: 0.776 ± 0.248
0.097CysPhe: 0.097 ± 0.126
0.388CysGly: 0.388 ± 0.183
0.0CysHis: 0.0 ± 0.0
0.194CysIle: 0.194 ± 0.128
0.485CysLys: 0.485 ± 0.226
0.582CysLeu: 0.582 ± 0.231
0.0CysMet: 0.0 ± 0.0
0.485CysAsn: 0.485 ± 0.2
0.291CysPro: 0.291 ± 0.222
0.291CysGln: 0.291 ± 0.167
0.485CysArg: 0.485 ± 0.268
0.485CysSer: 0.485 ± 0.198
0.291CysThr: 0.291 ± 0.165
0.0CysVal: 0.0 ± 0.0
0.097CysTrp: 0.097 ± 0.091
0.291CysTyr: 0.291 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
3.298AspAla: 3.298 ± 0.426
0.873AspCys: 0.873 ± 0.349
4.074AspAsp: 4.074 ± 0.694
4.656AspGlu: 4.656 ± 0.703
4.268AspPhe: 4.268 ± 0.609
5.141AspGly: 5.141 ± 0.848
0.485AspHis: 0.485 ± 0.188
4.559AspIle: 4.559 ± 0.628
4.85AspLys: 4.85 ± 0.68
5.044AspLeu: 5.044 ± 0.767
1.843AspMet: 1.843 ± 0.339
4.365AspAsn: 4.365 ± 0.606
1.261AspPro: 1.261 ± 0.361
1.358AspGln: 1.358 ± 0.481
2.716AspArg: 2.716 ± 0.642
3.298AspSer: 3.298 ± 0.547
2.716AspThr: 2.716 ± 0.577
3.88AspVal: 3.88 ± 0.919
0.873AspTrp: 0.873 ± 0.23
3.104AspTyr: 3.104 ± 0.622
0.0AspXaa: 0.0 ± 0.0
Glu
5.432GluAla: 5.432 ± 0.731
0.291GluCys: 0.291 ± 0.173
3.977GluAsp: 3.977 ± 0.619
6.984GluGlu: 6.984 ± 1.056
2.522GluPhe: 2.522 ± 0.479
4.171GluGly: 4.171 ± 0.677
0.97GluHis: 0.97 ± 0.379
7.469GluIle: 7.469 ± 1.023
7.566GluLys: 7.566 ± 0.93
8.633GluLeu: 8.633 ± 0.978
1.94GluMet: 1.94 ± 0.546
4.947GluAsn: 4.947 ± 0.626
1.261GluPro: 1.261 ± 0.388
3.589GluGln: 3.589 ± 0.566
3.88GluArg: 3.88 ± 0.782
4.559GluSer: 4.559 ± 0.536
5.432GluThr: 5.432 ± 0.584
4.753GluVal: 4.753 ± 0.836
0.873GluTrp: 0.873 ± 0.329
3.395GluTyr: 3.395 ± 0.6
0.0GluXaa: 0.0 ± 0.0
Phe
2.328PheAla: 2.328 ± 0.415
0.291PheCys: 0.291 ± 0.178
3.298PheAsp: 3.298 ± 0.522
3.88PheGlu: 3.88 ± 0.727
2.619PhePhe: 2.619 ± 0.654
2.522PheGly: 2.522 ± 0.482
0.485PheHis: 0.485 ± 0.2
2.425PheIle: 2.425 ± 0.653
2.91PheLys: 2.91 ± 0.47
3.298PheLeu: 3.298 ± 0.702
0.97PheMet: 0.97 ± 0.32
2.91PheAsn: 2.91 ± 0.514
0.873PhePro: 0.873 ± 0.298
1.746PheGln: 1.746 ± 0.459
1.94PheArg: 1.94 ± 0.616
2.037PheSer: 2.037 ± 0.384
2.813PheThr: 2.813 ± 0.514
3.104PheVal: 3.104 ± 0.486
0.388PheTrp: 0.388 ± 0.173
2.037PheTyr: 2.037 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
3.783GlyAla: 3.783 ± 0.719
0.291GlyCys: 0.291 ± 0.171
3.104GlyAsp: 3.104 ± 0.567
4.365GlyGlu: 4.365 ± 0.659
2.619GlyPhe: 2.619 ± 0.453
3.589GlyGly: 3.589 ± 0.545
1.261GlyHis: 1.261 ± 0.302
4.365GlyIle: 4.365 ± 0.836
5.626GlyLys: 5.626 ± 1.021
4.559GlyLeu: 4.559 ± 0.651
1.94GlyMet: 1.94 ± 0.527
2.425GlyAsn: 2.425 ± 0.45
0.582GlyPro: 0.582 ± 0.24
2.328GlyGln: 2.328 ± 0.443
3.201GlyArg: 3.201 ± 0.563
3.104GlySer: 3.104 ± 0.597
3.686GlyThr: 3.686 ± 0.727
3.104GlyVal: 3.104 ± 0.585
0.776GlyTrp: 0.776 ± 0.323
3.589GlyTyr: 3.589 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
0.873HisAla: 0.873 ± 0.278
0.291HisCys: 0.291 ± 0.151
1.261HisAsp: 1.261 ± 0.338
1.261HisGlu: 1.261 ± 0.375
0.776HisPhe: 0.776 ± 0.277
0.97HisGly: 0.97 ± 0.265
0.388HisHis: 0.388 ± 0.187
1.552HisIle: 1.552 ± 0.312
1.164HisLys: 1.164 ± 0.37
1.261HisLeu: 1.261 ± 0.37
0.388HisMet: 0.388 ± 0.177
0.97HisAsn: 0.97 ± 0.262
0.291HisPro: 0.291 ± 0.185
0.388HisGln: 0.388 ± 0.182
0.097HisArg: 0.097 ± 0.092
0.679HisSer: 0.679 ± 0.206
1.067HisThr: 1.067 ± 0.288
0.873HisVal: 0.873 ± 0.251
0.097HisTrp: 0.097 ± 0.091
0.097HisTyr: 0.097 ± 0.108
0.0HisXaa: 0.0 ± 0.0
Ile
5.141IleAla: 5.141 ± 0.759
0.0IleCys: 0.0 ± 0.0
5.335IleAsp: 5.335 ± 0.849
7.178IleGlu: 7.178 ± 0.719
3.104IlePhe: 3.104 ± 0.479
3.783IleGly: 3.783 ± 0.608
1.067IleHis: 1.067 ± 0.23
5.141IleIle: 5.141 ± 0.811
8.439IleLys: 8.439 ± 0.932
6.402IleLeu: 6.402 ± 0.707
1.94IleMet: 1.94 ± 0.503
4.365IleAsn: 4.365 ± 0.641
2.328IlePro: 2.328 ± 0.66
2.813IleGln: 2.813 ± 0.433
2.328IleArg: 2.328 ± 0.493
5.335IleSer: 5.335 ± 0.91
4.365IleThr: 4.365 ± 0.588
3.104IleVal: 3.104 ± 0.598
1.067IleTrp: 1.067 ± 0.38
2.231IleTyr: 2.231 ± 0.489
0.0IleXaa: 0.0 ± 0.0
Lys
6.208LysAla: 6.208 ± 0.961
0.291LysCys: 0.291 ± 0.189
5.238LysAsp: 5.238 ± 0.652
8.051LysGlu: 8.051 ± 0.758
2.425LysPhe: 2.425 ± 0.449
4.947LysGly: 4.947 ± 0.617
0.97LysHis: 0.97 ± 0.324
7.275LysIle: 7.275 ± 0.569
7.954LysLys: 7.954 ± 1.163
8.051LysLeu: 8.051 ± 0.938
2.716LysMet: 2.716 ± 0.511
4.171LysAsn: 4.171 ± 0.614
2.425LysPro: 2.425 ± 0.399
4.365LysGln: 4.365 ± 0.728
3.783LysArg: 3.783 ± 0.584
5.432LysSer: 5.432 ± 0.721
4.656LysThr: 4.656 ± 0.847
4.268LysVal: 4.268 ± 0.648
1.455LysTrp: 1.455 ± 0.385
2.716LysTyr: 2.716 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
6.596LeuAla: 6.596 ± 0.662
0.776LeuCys: 0.776 ± 0.327
6.887LeuAsp: 6.887 ± 0.769
8.245LeuGlu: 8.245 ± 0.91
3.783LeuPhe: 3.783 ± 0.592
4.753LeuGly: 4.753 ± 0.857
1.358LeuHis: 1.358 ± 0.331
5.626LeuIle: 5.626 ± 0.676
8.051LeuLys: 8.051 ± 0.937
7.76LeuLeu: 7.76 ± 0.917
2.425LeuMet: 2.425 ± 0.671
3.492LeuAsn: 3.492 ± 0.601
2.425LeuPro: 2.425 ± 0.553
3.007LeuGln: 3.007 ± 0.577
2.619LeuArg: 2.619 ± 0.554
6.984LeuSer: 6.984 ± 0.663
4.656LeuThr: 4.656 ± 0.716
3.977LeuVal: 3.977 ± 0.605
0.582LeuTrp: 0.582 ± 0.246
1.552LeuTyr: 1.552 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
1.94MetAla: 1.94 ± 0.468
0.097MetCys: 0.097 ± 0.1
1.843MetAsp: 1.843 ± 0.454
1.455MetGlu: 1.455 ± 0.416
0.873MetPhe: 0.873 ± 0.243
1.164MetGly: 1.164 ± 0.285
0.194MetHis: 0.194 ± 0.133
2.328MetIle: 2.328 ± 0.449
1.552MetLys: 1.552 ± 0.424
2.522MetLeu: 2.522 ± 0.423
0.388MetMet: 0.388 ± 0.185
1.552MetAsn: 1.552 ± 0.346
0.873MetPro: 0.873 ± 0.32
1.067MetGln: 1.067 ± 0.287
1.746MetArg: 1.746 ± 0.439
1.552MetSer: 1.552 ± 0.355
2.425MetThr: 2.425 ± 0.42
1.552MetVal: 1.552 ± 0.361
0.291MetTrp: 0.291 ± 0.168
0.582MetTyr: 0.582 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
4.559AsnAla: 4.559 ± 0.665
0.485AsnCys: 0.485 ± 0.244
2.425AsnAsp: 2.425 ± 0.545
5.432AsnGlu: 5.432 ± 0.817
2.425AsnPhe: 2.425 ± 0.459
4.656AsnGly: 4.656 ± 0.728
1.164AsnHis: 1.164 ± 0.323
4.656AsnIle: 4.656 ± 0.595
3.88AsnLys: 3.88 ± 0.525
5.044AsnLeu: 5.044 ± 0.585
0.97AsnMet: 0.97 ± 0.269
2.619AsnAsn: 2.619 ± 0.568
2.328AsnPro: 2.328 ± 0.409
1.358AsnGln: 1.358 ± 0.326
2.522AsnArg: 2.522 ± 0.471
3.395AsnSer: 3.395 ± 0.457
3.395AsnThr: 3.395 ± 0.484
3.201AsnVal: 3.201 ± 0.483
0.97AsnTrp: 0.97 ± 0.315
2.134AsnTyr: 2.134 ± 0.365
0.0AsnXaa: 0.0 ± 0.0
Pro
1.552ProAla: 1.552 ± 0.37
0.194ProCys: 0.194 ± 0.143
1.843ProAsp: 1.843 ± 0.489
2.425ProGlu: 2.425 ± 0.387
0.97ProPhe: 0.97 ± 0.387
1.164ProGly: 1.164 ± 0.328
0.582ProHis: 0.582 ± 0.235
2.231ProIle: 2.231 ± 0.432
2.134ProLys: 2.134 ± 0.476
1.455ProLeu: 1.455 ± 0.467
0.194ProMet: 0.194 ± 0.124
1.261ProAsn: 1.261 ± 0.304
0.679ProPro: 0.679 ± 0.292
0.679ProGln: 0.679 ± 0.272
0.873ProArg: 0.873 ± 0.305
1.843ProSer: 1.843 ± 0.48
1.552ProThr: 1.552 ± 0.35
1.94ProVal: 1.94 ± 0.404
0.097ProTrp: 0.097 ± 0.088
1.067ProTyr: 1.067 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
2.134GlnAla: 2.134 ± 0.46
0.291GlnCys: 0.291 ± 0.178
1.843GlnAsp: 1.843 ± 0.351
2.328GlnGlu: 2.328 ± 0.419
1.746GlnPhe: 1.746 ± 0.454
1.552GlnGly: 1.552 ± 0.294
0.485GlnHis: 0.485 ± 0.211
2.91GlnIle: 2.91 ± 0.451
3.395GlnLys: 3.395 ± 0.633
3.298GlnLeu: 3.298 ± 0.479
0.97GlnMet: 0.97 ± 0.315
2.425GlnAsn: 2.425 ± 0.57
0.776GlnPro: 0.776 ± 0.232
1.261GlnGln: 1.261 ± 0.39
2.134GlnArg: 2.134 ± 0.415
2.134GlnSer: 2.134 ± 0.462
2.716GlnThr: 2.716 ± 0.355
2.619GlnVal: 2.619 ± 0.459
0.485GlnTrp: 0.485 ± 0.215
1.358GlnTyr: 1.358 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
2.425ArgAla: 2.425 ± 0.417
0.679ArgCys: 0.679 ± 0.246
1.746ArgAsp: 1.746 ± 0.374
3.104ArgGlu: 3.104 ± 0.621
1.455ArgPhe: 1.455 ± 0.406
2.425ArgGly: 2.425 ± 0.496
0.97ArgHis: 0.97 ± 0.33
2.91ArgIle: 2.91 ± 0.576
3.88ArgLys: 3.88 ± 0.655
5.044ArgLeu: 5.044 ± 0.868
1.067ArgMet: 1.067 ± 0.249
2.716ArgAsn: 2.716 ± 0.558
0.97ArgPro: 0.97 ± 0.306
1.455ArgGln: 1.455 ± 0.436
2.037ArgArg: 2.037 ± 0.526
1.746ArgSer: 1.746 ± 0.458
3.104ArgThr: 3.104 ± 0.515
2.522ArgVal: 2.522 ± 0.538
0.97ArgTrp: 0.97 ± 0.356
1.455ArgTyr: 1.455 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.395SerAla: 3.395 ± 0.532
0.0SerCys: 0.0 ± 0.0
4.656SerAsp: 4.656 ± 0.66
4.268SerGlu: 4.268 ± 0.649
2.037SerPhe: 2.037 ± 0.542
2.522SerGly: 2.522 ± 0.508
1.261SerHis: 1.261 ± 0.334
5.335SerIle: 5.335 ± 0.79
5.82SerLys: 5.82 ± 0.88
5.723SerLeu: 5.723 ± 0.809
1.94SerMet: 1.94 ± 0.392
3.589SerAsn: 3.589 ± 0.516
1.455SerPro: 1.455 ± 0.38
2.328SerGln: 2.328 ± 0.442
2.037SerArg: 2.037 ± 0.364
5.432SerSer: 5.432 ± 0.802
3.492SerThr: 3.492 ± 0.728
3.298SerVal: 3.298 ± 0.804
0.873SerTrp: 0.873 ± 0.336
2.716SerTyr: 2.716 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
4.462ThrAla: 4.462 ± 0.629
0.291ThrCys: 0.291 ± 0.166
3.783ThrAsp: 3.783 ± 0.663
4.559ThrGlu: 4.559 ± 0.609
2.813ThrPhe: 2.813 ± 0.814
4.656ThrGly: 4.656 ± 0.845
0.776ThrHis: 0.776 ± 0.335
4.559ThrIle: 4.559 ± 0.684
4.559ThrLys: 4.559 ± 0.546
4.85ThrLeu: 4.85 ± 0.648
1.552ThrMet: 1.552 ± 0.354
3.783ThrAsn: 3.783 ± 0.488
1.843ThrPro: 1.843 ± 0.416
2.134ThrGln: 2.134 ± 0.572
1.649ThrArg: 1.649 ± 0.324
3.298ThrSer: 3.298 ± 0.763
4.559ThrThr: 4.559 ± 0.758
5.044ThrVal: 5.044 ± 0.783
0.776ThrTrp: 0.776 ± 0.223
2.037ThrTyr: 2.037 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
3.201ValAla: 3.201 ± 0.56
0.194ValCys: 0.194 ± 0.131
3.589ValAsp: 3.589 ± 0.46
4.268ValGlu: 4.268 ± 0.758
3.007ValPhe: 3.007 ± 0.388
3.201ValGly: 3.201 ± 0.61
0.679ValHis: 0.679 ± 0.23
3.977ValIle: 3.977 ± 0.683
5.723ValLys: 5.723 ± 0.672
2.813ValLeu: 2.813 ± 0.546
1.843ValMet: 1.843 ± 0.381
5.238ValAsn: 5.238 ± 0.828
1.649ValPro: 1.649 ± 0.302
2.037ValGln: 2.037 ± 0.448
2.522ValArg: 2.522 ± 0.614
4.074ValSer: 4.074 ± 0.574
4.171ValThr: 4.171 ± 0.694
3.783ValVal: 3.783 ± 0.781
0.582ValTrp: 0.582 ± 0.264
1.649ValTyr: 1.649 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
0.776TrpAla: 0.776 ± 0.269
0.0TrpCys: 0.0 ± 0.0
0.582TrpAsp: 0.582 ± 0.249
0.97TrpGlu: 0.97 ± 0.289
1.164TrpPhe: 1.164 ± 0.352
0.388TrpGly: 0.388 ± 0.179
0.194TrpHis: 0.194 ± 0.145
0.679TrpIle: 0.679 ± 0.298
0.776TrpLys: 0.776 ± 0.283
1.455TrpLeu: 1.455 ± 0.458
0.388TrpMet: 0.388 ± 0.188
0.776TrpAsn: 0.776 ± 0.382
0.291TrpPro: 0.291 ± 0.146
0.485TrpGln: 0.485 ± 0.194
0.776TrpArg: 0.776 ± 0.246
0.873TrpSer: 0.873 ± 0.352
0.679TrpThr: 0.679 ± 0.225
0.776TrpVal: 0.776 ± 0.25
0.097TrpTrp: 0.097 ± 0.088
0.582TrpTyr: 0.582 ± 0.301
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.619TyrAla: 2.619 ± 0.437
0.485TyrCys: 0.485 ± 0.187
2.522TyrAsp: 2.522 ± 0.548
2.91TyrGlu: 2.91 ± 0.681
2.328TyrPhe: 2.328 ± 0.42
2.037TyrGly: 2.037 ± 0.385
0.291TyrHis: 0.291 ± 0.169
2.522TyrIle: 2.522 ± 0.498
3.88TyrLys: 3.88 ± 0.566
3.298TyrLeu: 3.298 ± 0.435
1.067TyrMet: 1.067 ± 0.381
1.649TyrAsn: 1.649 ± 0.338
1.164TyrPro: 1.164 ± 0.394
1.261TyrGln: 1.261 ± 0.376
1.843TyrArg: 1.843 ± 0.555
2.134TyrSer: 2.134 ± 0.528
1.067TyrThr: 1.067 ± 0.336
1.746TyrVal: 1.746 ± 0.371
0.485TyrTrp: 0.485 ± 0.215
1.94TyrTyr: 1.94 ± 0.694
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (10310 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski