Amino acid dipepetide frequency for Streptococcus phage Javan107

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.46AlaAla: 4.46 ± 0.922
0.686AlaCys: 0.686 ± 0.277
3.86AlaAsp: 3.86 ± 0.608
3.688AlaGlu: 3.688 ± 0.619
2.316AlaPhe: 2.316 ± 0.407
5.06AlaGly: 5.06 ± 0.839
1.029AlaHis: 1.029 ± 0.305
4.717AlaIle: 4.717 ± 0.771
5.661AlaLys: 5.661 ± 0.453
6.09AlaLeu: 6.09 ± 1.05
2.402AlaMet: 2.402 ± 0.45
3.345AlaAsn: 3.345 ± 0.515
1.372AlaPro: 1.372 ± 0.322
3.345AlaGln: 3.345 ± 0.484
3.688AlaArg: 3.688 ± 0.523
4.289AlaSer: 4.289 ± 0.7
5.232AlaThr: 5.232 ± 0.784
4.889AlaVal: 4.889 ± 0.733
0.6AlaTrp: 0.6 ± 0.203
3.688AlaTyr: 3.688 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
0.257CysAla: 0.257 ± 0.156
0.172CysCys: 0.172 ± 0.128
0.429CysAsp: 0.429 ± 0.214
0.686CysGlu: 0.686 ± 0.302
0.086CysPhe: 0.086 ± 0.093
1.029CysGly: 1.029 ± 0.398
0.257CysHis: 0.257 ± 0.156
0.429CysIle: 0.429 ± 0.162
0.343CysLys: 0.343 ± 0.242
0.943CysLeu: 0.943 ± 0.31
0.086CysMet: 0.086 ± 0.079
0.257CysAsn: 0.257 ± 0.147
0.429CysPro: 0.429 ± 0.172
0.686CysGln: 0.686 ± 0.214
0.858CysArg: 0.858 ± 0.284
0.6CysSer: 0.6 ± 0.212
0.257CysThr: 0.257 ± 0.127
0.429CysVal: 0.429 ± 0.173
0.0CysTrp: 0.0 ± 0.0
0.686CysTyr: 0.686 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
3.088AspAla: 3.088 ± 0.461
0.6AspCys: 0.6 ± 0.178
3.517AspAsp: 3.517 ± 0.924
4.632AspGlu: 4.632 ± 0.639
3.002AspPhe: 3.002 ± 0.433
4.717AspGly: 4.717 ± 0.751
1.63AspHis: 1.63 ± 0.329
3.688AspIle: 3.688 ± 0.476
4.46AspLys: 4.46 ± 0.463
6.519AspLeu: 6.519 ± 0.816
1.973AspMet: 1.973 ± 0.319
2.144AspAsn: 2.144 ± 0.463
1.458AspPro: 1.458 ± 0.495
1.887AspGln: 1.887 ± 0.494
2.058AspArg: 2.058 ± 0.542
2.916AspSer: 2.916 ± 0.384
3.088AspThr: 3.088 ± 0.428
3.174AspVal: 3.174 ± 0.592
0.858AspTrp: 0.858 ± 0.236
3.345AspTyr: 3.345 ± 0.757
0.0AspXaa: 0.0 ± 0.0
Glu
5.832GluAla: 5.832 ± 0.797
0.686GluCys: 0.686 ± 0.281
4.117GluAsp: 4.117 ± 0.548
6.862GluGlu: 6.862 ± 0.851
2.402GluPhe: 2.402 ± 0.495
5.489GluGly: 5.489 ± 0.488
1.115GluHis: 1.115 ± 0.247
3.259GluIle: 3.259 ± 0.622
5.918GluLys: 5.918 ± 0.789
8.148GluLeu: 8.148 ± 0.818
2.058GluMet: 2.058 ± 0.636
3.345GluAsn: 3.345 ± 0.609
1.973GluPro: 1.973 ± 0.428
3.945GluGln: 3.945 ± 0.673
2.23GluArg: 2.23 ± 0.454
3.259GluSer: 3.259 ± 0.494
4.632GluThr: 4.632 ± 0.773
4.546GluVal: 4.546 ± 0.618
0.772GluTrp: 0.772 ± 0.339
1.287GluTyr: 1.287 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
1.715PheAla: 1.715 ± 0.509
0.686PheCys: 0.686 ± 0.252
2.745PheAsp: 2.745 ± 0.413
2.573PheGlu: 2.573 ± 0.499
1.372PhePhe: 1.372 ± 0.449
3.088PheGly: 3.088 ± 0.436
0.772PheHis: 0.772 ± 0.298
1.887PheIle: 1.887 ± 0.396
3.088PheLys: 3.088 ± 0.715
2.573PheLeu: 2.573 ± 0.514
0.858PheMet: 0.858 ± 0.291
1.801PheAsn: 1.801 ± 0.258
0.6PhePro: 0.6 ± 0.271
1.801PheGln: 1.801 ± 0.423
1.458PheArg: 1.458 ± 0.317
2.487PheSer: 2.487 ± 0.475
2.316PheThr: 2.316 ± 0.309
1.63PheVal: 1.63 ± 0.405
0.6PheTrp: 0.6 ± 0.208
2.058PheTyr: 2.058 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
3.517GlyAla: 3.517 ± 0.581
0.086GlyCys: 0.086 ± 0.102
4.374GlyAsp: 4.374 ± 0.665
4.289GlyGlu: 4.289 ± 0.57
2.402GlyPhe: 2.402 ± 0.426
5.318GlyGly: 5.318 ± 0.901
2.144GlyHis: 2.144 ± 0.49
5.06GlyIle: 5.06 ± 0.713
6.004GlyLys: 6.004 ± 0.76
6.09GlyLeu: 6.09 ± 0.731
1.887GlyMet: 1.887 ± 0.362
3.431GlyAsn: 3.431 ± 0.597
0.772GlyPro: 0.772 ± 0.218
3.002GlyGln: 3.002 ± 0.489
3.86GlyArg: 3.86 ± 0.723
4.46GlySer: 4.46 ± 0.988
4.632GlyThr: 4.632 ± 0.736
4.889GlyVal: 4.889 ± 0.816
0.686GlyTrp: 0.686 ± 0.231
2.573GlyTyr: 2.573 ± 0.468
0.0GlyXaa: 0.0 ± 0.0
His
1.029HisAla: 1.029 ± 0.229
0.172HisCys: 0.172 ± 0.109
1.372HisAsp: 1.372 ± 0.362
0.858HisGlu: 0.858 ± 0.251
1.201HisPhe: 1.201 ± 0.358
1.115HisGly: 1.115 ± 0.339
0.772HisHis: 0.772 ± 0.232
1.287HisIle: 1.287 ± 0.282
1.115HisLys: 1.115 ± 0.246
2.316HisLeu: 2.316 ± 0.374
0.429HisMet: 0.429 ± 0.217
1.201HisAsn: 1.201 ± 0.231
1.115HisPro: 1.115 ± 0.377
0.772HisGln: 0.772 ± 0.287
1.372HisArg: 1.372 ± 0.32
1.201HisSer: 1.201 ± 0.268
0.858HisThr: 0.858 ± 0.336
1.287HisVal: 1.287 ± 0.282
0.172HisTrp: 0.172 ± 0.118
0.6HisTyr: 0.6 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.975IleAla: 4.975 ± 0.424
0.515IleCys: 0.515 ± 0.23
4.803IleAsp: 4.803 ± 0.524
3.517IleGlu: 3.517 ± 0.518
1.372IlePhe: 1.372 ± 0.481
3.945IleGly: 3.945 ± 0.708
0.858IleHis: 0.858 ± 0.235
2.916IleIle: 2.916 ± 0.508
4.975IleLys: 4.975 ± 0.823
5.318IleLeu: 5.318 ± 0.554
1.201IleMet: 1.201 ± 0.362
2.058IleAsn: 2.058 ± 0.417
1.801IlePro: 1.801 ± 0.376
2.316IleGln: 2.316 ± 0.436
2.487IleArg: 2.487 ± 0.501
4.289IleSer: 4.289 ± 0.684
4.117IleThr: 4.117 ± 0.711
3.174IleVal: 3.174 ± 0.628
1.287IleTrp: 1.287 ± 0.351
2.058IleTyr: 2.058 ± 0.384
0.0IleXaa: 0.0 ± 0.0
Lys
7.291LysAla: 7.291 ± 0.745
0.6LysCys: 0.6 ± 0.198
3.86LysAsp: 3.86 ± 0.614
5.146LysGlu: 5.146 ± 0.531
1.973LysPhe: 1.973 ± 0.306
5.661LysGly: 5.661 ± 0.533
1.887LysHis: 1.887 ± 0.354
4.117LysIle: 4.117 ± 0.514
4.717LysLys: 4.717 ± 0.687
5.747LysLeu: 5.747 ± 0.865
1.715LysMet: 1.715 ± 0.407
2.573LysAsn: 2.573 ± 0.414
2.573LysPro: 2.573 ± 0.423
3.517LysGln: 3.517 ± 0.489
5.232LysArg: 5.232 ± 0.775
4.203LysSer: 4.203 ± 0.517
4.717LysThr: 4.717 ± 0.745
4.975LysVal: 4.975 ± 0.765
0.943LysTrp: 0.943 ± 0.271
1.973LysTyr: 1.973 ± 0.465
0.0LysXaa: 0.0 ± 0.0
Leu
6.862LeuAla: 6.862 ± 0.758
0.6LeuCys: 0.6 ± 0.229
4.889LeuAsp: 4.889 ± 0.513
7.805LeuGlu: 7.805 ± 0.768
2.745LeuPhe: 2.745 ± 0.489
6.175LeuGly: 6.175 ± 0.577
1.201LeuHis: 1.201 ± 0.308
4.374LeuIle: 4.374 ± 0.657
7.033LeuLys: 7.033 ± 0.764
7.891LeuLeu: 7.891 ± 0.876
2.23LeuMet: 2.23 ± 0.48
3.86LeuAsn: 3.86 ± 0.611
3.774LeuPro: 3.774 ± 0.731
3.774LeuGln: 3.774 ± 0.505
3.774LeuArg: 3.774 ± 0.644
7.548LeuSer: 7.548 ± 0.71
7.205LeuThr: 7.205 ± 0.717
6.09LeuVal: 6.09 ± 0.673
0.772LeuTrp: 0.772 ± 0.223
3.86LeuTyr: 3.86 ± 0.798
0.0LeuXaa: 0.0 ± 0.0
Met
2.487MetAla: 2.487 ± 0.453
0.086MetCys: 0.086 ± 0.075
1.372MetAsp: 1.372 ± 0.462
1.458MetGlu: 1.458 ± 0.473
0.858MetPhe: 0.858 ± 0.256
1.887MetGly: 1.887 ± 0.476
0.0MetHis: 0.0 ± 0.0
1.287MetIle: 1.287 ± 0.32
1.63MetLys: 1.63 ± 0.338
1.372MetLeu: 1.372 ± 0.409
0.686MetMet: 0.686 ± 0.25
0.6MetAsn: 0.6 ± 0.208
0.515MetPro: 0.515 ± 0.191
0.515MetGln: 0.515 ± 0.188
1.287MetArg: 1.287 ± 0.311
2.144MetSer: 2.144 ± 0.423
3.088MetThr: 3.088 ± 0.475
1.544MetVal: 1.544 ± 0.345
0.086MetTrp: 0.086 ± 0.078
0.429MetTyr: 0.429 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.774AsnAla: 3.774 ± 0.552
0.343AsnCys: 0.343 ± 0.242
1.63AsnAsp: 1.63 ± 0.322
2.745AsnGlu: 2.745 ± 0.522
1.715AsnPhe: 1.715 ± 0.5
4.46AsnGly: 4.46 ± 0.735
0.772AsnHis: 0.772 ± 0.287
1.973AsnIle: 1.973 ± 0.342
3.002AsnLys: 3.002 ± 0.448
3.259AsnLeu: 3.259 ± 0.445
0.858AsnMet: 0.858 ± 0.284
1.715AsnAsn: 1.715 ± 0.546
1.973AsnPro: 1.973 ± 0.343
2.23AsnGln: 2.23 ± 0.407
1.887AsnArg: 1.887 ± 0.312
3.431AsnSer: 3.431 ± 0.425
2.745AsnThr: 2.745 ± 0.612
2.23AsnVal: 2.23 ± 0.445
1.029AsnTrp: 1.029 ± 0.344
0.943AsnTyr: 0.943 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
1.201ProAla: 1.201 ± 0.299
0.257ProCys: 0.257 ± 0.131
2.058ProAsp: 2.058 ± 0.509
2.23ProGlu: 2.23 ± 0.499
1.115ProPhe: 1.115 ± 0.351
1.201ProGly: 1.201 ± 0.366
0.943ProHis: 0.943 ± 0.239
1.887ProIle: 1.887 ± 0.374
2.487ProLys: 2.487 ± 0.388
2.659ProLeu: 2.659 ± 0.287
0.343ProMet: 0.343 ± 0.159
1.801ProAsn: 1.801 ± 0.447
0.943ProPro: 0.943 ± 0.334
1.458ProGln: 1.458 ± 0.31
1.115ProArg: 1.115 ± 0.27
2.659ProSer: 2.659 ± 0.564
2.23ProThr: 2.23 ± 0.47
1.973ProVal: 1.973 ± 0.449
0.429ProTrp: 0.429 ± 0.184
1.63ProTyr: 1.63 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
3.86GlnAla: 3.86 ± 0.697
0.343GlnCys: 0.343 ± 0.158
1.973GlnAsp: 1.973 ± 0.387
3.774GlnGlu: 3.774 ± 0.77
1.801GlnPhe: 1.801 ± 0.416
2.402GlnGly: 2.402 ± 0.443
0.686GlnHis: 0.686 ± 0.198
2.573GlnIle: 2.573 ± 0.372
3.345GlnLys: 3.345 ± 0.402
4.889GlnLeu: 4.889 ± 0.551
1.201GlnMet: 1.201 ± 0.305
2.058GlnAsn: 2.058 ± 0.445
1.544GlnPro: 1.544 ± 0.394
1.973GlnGln: 1.973 ± 0.491
1.544GlnArg: 1.544 ± 0.417
2.745GlnSer: 2.745 ± 0.408
3.174GlnThr: 3.174 ± 0.398
3.174GlnVal: 3.174 ± 0.563
0.686GlnTrp: 0.686 ± 0.29
0.6GlnTyr: 0.6 ± 0.188
0.0GlnXaa: 0.0 ± 0.0
Arg
2.659ArgAla: 2.659 ± 0.572
0.686ArgCys: 0.686 ± 0.37
2.23ArgAsp: 2.23 ± 0.533
3.602ArgGlu: 3.602 ± 0.546
1.715ArgPhe: 1.715 ± 0.515
2.316ArgGly: 2.316 ± 0.408
1.115ArgHis: 1.115 ± 0.332
1.973ArgIle: 1.973 ± 0.451
3.945ArgLys: 3.945 ± 0.844
5.06ArgLeu: 5.06 ± 0.604
0.6ArgMet: 0.6 ± 0.189
2.144ArgAsn: 2.144 ± 0.434
1.458ArgPro: 1.458 ± 0.336
3.517ArgGln: 3.517 ± 0.466
2.745ArgArg: 2.745 ± 0.647
2.916ArgSer: 2.916 ± 0.454
1.973ArgThr: 1.973 ± 0.405
3.517ArgVal: 3.517 ± 0.601
0.858ArgTrp: 0.858 ± 0.3
1.715ArgTyr: 1.715 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
5.318SerAla: 5.318 ± 0.688
0.343SerCys: 0.343 ± 0.161
4.374SerAsp: 4.374 ± 0.72
4.632SerGlu: 4.632 ± 0.707
2.659SerPhe: 2.659 ± 0.448
4.889SerGly: 4.889 ± 0.715
1.887SerHis: 1.887 ± 0.378
4.289SerIle: 4.289 ± 0.744
4.374SerLys: 4.374 ± 0.726
5.918SerLeu: 5.918 ± 0.698
1.287SerMet: 1.287 ± 0.372
2.745SerAsn: 2.745 ± 0.532
1.973SerPro: 1.973 ± 0.374
2.487SerGln: 2.487 ± 0.489
3.002SerArg: 3.002 ± 0.487
5.318SerSer: 5.318 ± 0.86
4.203SerThr: 4.203 ± 0.758
4.546SerVal: 4.546 ± 0.543
1.544SerTrp: 1.544 ± 0.289
2.402SerTyr: 2.402 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
4.975ThrAla: 4.975 ± 0.693
0.429ThrCys: 0.429 ± 0.204
3.602ThrAsp: 3.602 ± 0.493
4.289ThrGlu: 4.289 ± 0.523
3.259ThrPhe: 3.259 ± 0.841
3.774ThrGly: 3.774 ± 0.566
0.858ThrHis: 0.858 ± 0.228
5.489ThrIle: 5.489 ± 1.07
5.06ThrLys: 5.06 ± 0.529
6.004ThrLeu: 6.004 ± 0.899
1.029ThrMet: 1.029 ± 0.271
2.745ThrAsn: 2.745 ± 0.395
3.088ThrPro: 3.088 ± 0.718
2.487ThrGln: 2.487 ± 0.589
2.144ThrArg: 2.144 ± 0.452
5.575ThrSer: 5.575 ± 0.76
5.232ThrThr: 5.232 ± 0.637
5.918ThrVal: 5.918 ± 0.718
1.029ThrTrp: 1.029 ± 0.252
2.316ThrTyr: 2.316 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
3.431ValAla: 3.431 ± 0.458
0.515ValCys: 0.515 ± 0.185
3.602ValAsp: 3.602 ± 0.628
5.404ValGlu: 5.404 ± 0.705
2.144ValPhe: 2.144 ± 0.466
3.774ValGly: 3.774 ± 0.592
1.029ValHis: 1.029 ± 0.213
4.46ValIle: 4.46 ± 0.673
3.602ValLys: 3.602 ± 0.44
7.205ValLeu: 7.205 ± 0.717
1.201ValMet: 1.201 ± 0.372
2.23ValAsn: 2.23 ± 0.383
2.058ValPro: 2.058 ± 0.424
1.801ValGln: 1.801 ± 0.282
3.259ValArg: 3.259 ± 0.581
5.06ValSer: 5.06 ± 0.7
6.004ValThr: 6.004 ± 0.784
3.86ValVal: 3.86 ± 0.681
1.458ValTrp: 1.458 ± 0.371
2.659ValTyr: 2.659 ± 0.57
0.0ValXaa: 0.0 ± 0.0
Trp
0.858TrpAla: 0.858 ± 0.236
0.257TrpCys: 0.257 ± 0.151
0.858TrpAsp: 0.858 ± 0.281
1.115TrpGlu: 1.115 ± 0.253
0.858TrpPhe: 0.858 ± 0.306
0.6TrpGly: 0.6 ± 0.191
0.257TrpHis: 0.257 ± 0.114
0.686TrpIle: 0.686 ± 0.225
0.6TrpLys: 0.6 ± 0.281
1.287TrpLeu: 1.287 ± 0.202
0.429TrpMet: 0.429 ± 0.165
1.287TrpAsn: 1.287 ± 0.393
0.086TrpPro: 0.086 ± 0.074
0.772TrpGln: 0.772 ± 0.212
0.943TrpArg: 0.943 ± 0.304
1.115TrpSer: 1.115 ± 0.297
0.943TrpThr: 0.943 ± 0.305
0.943TrpVal: 0.943 ± 0.249
0.257TrpTrp: 0.257 ± 0.133
0.172TrpTyr: 0.172 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.83TyrAla: 2.83 ± 0.406
0.858TyrCys: 0.858 ± 0.389
3.174TyrAsp: 3.174 ± 0.583
2.573TyrGlu: 2.573 ± 0.419
1.115TyrPhe: 1.115 ± 0.3
2.573TyrGly: 2.573 ± 0.465
1.029TyrHis: 1.029 ± 0.332
1.63TyrIle: 1.63 ± 0.422
2.058TyrLys: 2.058 ± 0.37
3.174TyrLeu: 3.174 ± 0.796
0.772TyrMet: 0.772 ± 0.26
1.372TyrAsn: 1.372 ± 0.323
1.115TyrPro: 1.115 ± 0.249
2.058TyrGln: 2.058 ± 0.439
1.801TyrArg: 1.801 ± 0.293
1.973TyrSer: 1.973 ± 0.476
2.659TyrThr: 2.659 ± 0.521
1.973TyrVal: 1.973 ± 0.316
0.257TyrTrp: 0.257 ± 0.139
1.029TyrTyr: 1.029 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (11660 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski