Amino acid dipepetide frequency for Escherichia phage vB_EcoP_S523

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.401AlaAla: 9.401 ± 1.559
0.839AlaCys: 0.839 ± 0.235
5.96AlaAsp: 5.96 ± 1.128
5.876AlaGlu: 5.876 ± 0.768
3.106AlaPhe: 3.106 ± 0.457
8.058AlaGly: 8.058 ± 1.023
1.511AlaHis: 1.511 ± 0.298
4.869AlaIle: 4.869 ± 0.641
7.471AlaLys: 7.471 ± 0.928
6.967AlaLeu: 6.967 ± 0.966
2.854AlaMet: 2.854 ± 0.549
4.869AlaAsn: 4.869 ± 0.852
2.77AlaPro: 2.77 ± 0.423
4.029AlaGln: 4.029 ± 0.498
4.701AlaArg: 4.701 ± 0.527
4.365AlaSer: 4.365 ± 0.763
3.358AlaThr: 3.358 ± 0.482
5.456AlaVal: 5.456 ± 0.688
1.343AlaTrp: 1.343 ± 0.306
3.442AlaTyr: 3.442 ± 0.655
0.0AlaXaa: 0.0 ± 0.0
Cys
0.755CysAla: 0.755 ± 0.286
0.084CysCys: 0.084 ± 0.088
0.588CysAsp: 0.588 ± 0.307
0.588CysGlu: 0.588 ± 0.192
0.588CysPhe: 0.588 ± 0.2
0.504CysGly: 0.504 ± 0.24
0.252CysHis: 0.252 ± 0.141
0.42CysIle: 0.42 ± 0.221
0.252CysLys: 0.252 ± 0.139
1.343CysLeu: 1.343 ± 0.392
0.0CysMet: 0.0 ± 0.0
0.42CysAsn: 0.42 ± 0.166
0.42CysPro: 0.42 ± 0.179
0.42CysGln: 0.42 ± 0.171
0.672CysArg: 0.672 ± 0.282
0.588CysSer: 0.588 ± 0.207
0.252CysThr: 0.252 ± 0.152
0.504CysVal: 0.504 ± 0.216
0.168CysTrp: 0.168 ± 0.142
0.42CysTyr: 0.42 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
4.869AspAla: 4.869 ± 0.731
0.504AspCys: 0.504 ± 0.21
4.281AspAsp: 4.281 ± 0.826
4.029AspGlu: 4.029 ± 0.597
2.434AspPhe: 2.434 ± 0.385
7.219AspGly: 7.219 ± 0.98
0.923AspHis: 0.923 ± 0.268
2.854AspIle: 2.854 ± 0.489
3.693AspLys: 3.693 ± 0.612
3.358AspLeu: 3.358 ± 0.644
2.938AspMet: 2.938 ± 0.578
1.931AspAsn: 1.931 ± 0.297
3.442AspPro: 3.442 ± 0.643
2.015AspGln: 2.015 ± 0.471
3.442AspArg: 3.442 ± 0.854
4.029AspSer: 4.029 ± 0.548
3.358AspThr: 3.358 ± 0.42
4.281AspVal: 4.281 ± 0.509
0.839AspTrp: 0.839 ± 0.352
1.763AspTyr: 1.763 ± 0.343
0.0AspXaa: 0.0 ± 0.0
Glu
7.891GluAla: 7.891 ± 1.047
0.672GluCys: 0.672 ± 0.304
4.617GluAsp: 4.617 ± 0.617
5.792GluGlu: 5.792 ± 1.12
2.854GluPhe: 2.854 ± 0.481
4.869GluGly: 4.869 ± 0.699
1.763GluHis: 1.763 ± 0.444
2.77GluIle: 2.77 ± 0.383
3.274GluLys: 3.274 ± 0.626
6.464GluLeu: 6.464 ± 0.687
2.518GluMet: 2.518 ± 0.426
3.274GluAsn: 3.274 ± 0.648
1.931GluPro: 1.931 ± 0.423
3.945GluGln: 3.945 ± 0.495
4.701GluArg: 4.701 ± 0.618
3.19GluSer: 3.19 ± 0.606
3.861GluThr: 3.861 ± 0.451
4.617GluVal: 4.617 ± 0.753
1.259GluTrp: 1.259 ± 0.335
3.106GluTyr: 3.106 ± 0.51
0.084GluXaa: 0.084 ± 0.086
Phe
2.35PheAla: 2.35 ± 0.555
0.42PheCys: 0.42 ± 0.194
2.099PheAsp: 2.099 ± 0.491
2.099PheGlu: 2.099 ± 0.423
1.007PhePhe: 1.007 ± 0.267
2.938PheGly: 2.938 ± 0.491
1.091PheHis: 1.091 ± 0.291
1.679PheIle: 1.679 ± 0.44
2.518PheLys: 2.518 ± 0.474
3.022PheLeu: 3.022 ± 0.57
1.343PheMet: 1.343 ± 0.34
1.679PheAsn: 1.679 ± 0.426
1.847PhePro: 1.847 ± 0.442
1.343PheGln: 1.343 ± 0.313
2.099PheArg: 2.099 ± 0.337
1.931PheSer: 1.931 ± 0.468
3.442PheThr: 3.442 ± 0.624
1.847PheVal: 1.847 ± 0.354
0.252PheTrp: 0.252 ± 0.161
1.343PheTyr: 1.343 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
6.715GlyAla: 6.715 ± 0.759
0.839GlyCys: 0.839 ± 0.359
5.792GlyAsp: 5.792 ± 0.743
5.54GlyGlu: 5.54 ± 0.621
2.602GlyPhe: 2.602 ± 0.36
6.044GlyGly: 6.044 ± 0.846
0.923GlyHis: 0.923 ± 0.255
3.945GlyIle: 3.945 ± 0.587
7.219GlyLys: 7.219 ± 1.1
6.715GlyLeu: 6.715 ± 0.983
2.518GlyMet: 2.518 ± 0.435
3.526GlyAsn: 3.526 ± 0.568
0.336GlyPro: 0.336 ± 0.129
2.686GlyGln: 2.686 ± 0.475
4.785GlyArg: 4.785 ± 0.604
5.12GlySer: 5.12 ± 0.692
3.777GlyThr: 3.777 ± 0.632
4.533GlyVal: 4.533 ± 0.656
2.266GlyTrp: 2.266 ± 0.594
3.106GlyTyr: 3.106 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
1.931HisAla: 1.931 ± 0.406
0.252HisCys: 0.252 ± 0.125
1.343HisAsp: 1.343 ± 0.325
1.259HisGlu: 1.259 ± 0.367
0.923HisPhe: 0.923 ± 0.251
1.595HisGly: 1.595 ± 0.366
0.42HisHis: 0.42 ± 0.193
0.923HisIle: 0.923 ± 0.335
1.175HisLys: 1.175 ± 0.34
1.679HisLeu: 1.679 ± 0.412
0.504HisMet: 0.504 ± 0.19
0.839HisAsn: 0.839 ± 0.293
0.252HisPro: 0.252 ± 0.162
0.252HisGln: 0.252 ± 0.138
0.588HisArg: 0.588 ± 0.211
1.259HisSer: 1.259 ± 0.301
1.175HisThr: 1.175 ± 0.311
1.595HisVal: 1.595 ± 0.3
0.42HisTrp: 0.42 ± 0.167
0.42HisTyr: 0.42 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
4.029IleAla: 4.029 ± 0.608
0.504IleCys: 0.504 ± 0.21
4.113IleAsp: 4.113 ± 0.52
3.274IleGlu: 3.274 ± 0.483
0.923IlePhe: 0.923 ± 0.243
3.526IleGly: 3.526 ± 0.415
0.923IleHis: 0.923 ± 0.393
2.518IleIle: 2.518 ± 0.529
4.029IleLys: 4.029 ± 0.459
3.19IleLeu: 3.19 ± 0.422
1.007IleMet: 1.007 ± 0.335
1.931IleAsn: 1.931 ± 0.618
2.434IlePro: 2.434 ± 0.487
2.015IleGln: 2.015 ± 0.568
3.61IleArg: 3.61 ± 0.512
2.602IleSer: 2.602 ± 0.531
2.686IleThr: 2.686 ± 0.437
2.854IleVal: 2.854 ± 0.413
0.839IleTrp: 0.839 ± 0.281
1.427IleTyr: 1.427 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
8.226LysAla: 8.226 ± 1.009
0.588LysCys: 0.588 ± 0.206
3.945LysAsp: 3.945 ± 0.49
6.128LysGlu: 6.128 ± 0.633
2.015LysPhe: 2.015 ± 0.499
5.372LysGly: 5.372 ± 0.7
1.511LysHis: 1.511 ± 0.395
2.266LysIle: 2.266 ± 0.411
5.624LysLys: 5.624 ± 1.032
5.456LysLeu: 5.456 ± 0.777
2.35LysMet: 2.35 ± 0.402
2.434LysAsn: 2.434 ± 0.45
2.854LysPro: 2.854 ± 0.601
2.854LysGln: 2.854 ± 0.537
3.945LysArg: 3.945 ± 0.713
3.777LysSer: 3.777 ± 0.729
2.854LysThr: 2.854 ± 0.444
5.624LysVal: 5.624 ± 0.692
1.007LysTrp: 1.007 ± 0.353
2.434LysTyr: 2.434 ± 0.442
0.0LysXaa: 0.0 ± 0.0
Leu
8.394LeuAla: 8.394 ± 0.85
0.252LeuCys: 0.252 ± 0.133
4.197LeuAsp: 4.197 ± 0.555
6.128LeuGlu: 6.128 ± 1.015
2.434LeuPhe: 2.434 ± 0.412
3.861LeuGly: 3.861 ± 0.529
1.175LeuHis: 1.175 ± 0.266
4.113LeuIle: 4.113 ± 0.61
6.715LeuLys: 6.715 ± 0.748
4.197LeuLeu: 4.197 ± 0.672
2.518LeuMet: 2.518 ± 0.493
4.365LeuAsn: 4.365 ± 0.783
2.77LeuPro: 2.77 ± 0.487
3.61LeuGln: 3.61 ± 0.546
5.792LeuArg: 5.792 ± 0.565
4.029LeuSer: 4.029 ± 0.603
4.197LeuThr: 4.197 ± 0.598
5.037LeuVal: 5.037 ± 0.66
1.091LeuTrp: 1.091 ± 0.327
2.518LeuTyr: 2.518 ± 0.469
0.0LeuXaa: 0.0 ± 0.0
Met
2.434MetAla: 2.434 ± 0.44
0.252MetCys: 0.252 ± 0.17
1.763MetAsp: 1.763 ± 0.431
2.182MetGlu: 2.182 ± 0.447
1.427MetPhe: 1.427 ± 0.312
2.686MetGly: 2.686 ± 0.563
0.336MetHis: 0.336 ± 0.166
1.259MetIle: 1.259 ± 0.338
1.763MetLys: 1.763 ± 0.395
3.106MetLeu: 3.106 ± 0.51
0.588MetMet: 0.588 ± 0.213
1.763MetAsn: 1.763 ± 0.437
1.343MetPro: 1.343 ± 0.33
1.595MetGln: 1.595 ± 0.482
0.839MetArg: 0.839 ± 0.25
2.182MetSer: 2.182 ± 0.303
1.847MetThr: 1.847 ± 0.411
2.35MetVal: 2.35 ± 0.58
0.084MetTrp: 0.084 ± 0.075
0.672MetTyr: 0.672 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
3.61AsnAla: 3.61 ± 0.641
0.42AsnCys: 0.42 ± 0.157
2.77AsnAsp: 2.77 ± 0.409
3.442AsnGlu: 3.442 ± 0.452
2.015AsnPhe: 2.015 ± 0.247
4.869AsnGly: 4.869 ± 0.788
1.091AsnHis: 1.091 ± 0.285
2.434AsnIle: 2.434 ± 0.363
3.274AsnLys: 3.274 ± 0.483
3.274AsnLeu: 3.274 ± 0.43
1.175AsnMet: 1.175 ± 0.381
1.511AsnAsn: 1.511 ± 0.553
2.099AsnPro: 2.099 ± 0.304
1.595AsnGln: 1.595 ± 0.327
2.854AsnArg: 2.854 ± 0.751
2.686AsnSer: 2.686 ± 0.622
2.266AsnThr: 2.266 ± 0.513
2.938AsnVal: 2.938 ± 0.488
0.504AsnTrp: 0.504 ± 0.221
1.763AsnTyr: 1.763 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
2.77ProAla: 2.77 ± 0.421
0.336ProCys: 0.336 ± 0.18
2.434ProAsp: 2.434 ± 0.419
3.693ProGlu: 3.693 ± 0.827
1.343ProPhe: 1.343 ± 0.338
1.343ProGly: 1.343 ± 0.223
0.672ProHis: 0.672 ± 0.237
1.343ProIle: 1.343 ± 0.272
3.022ProLys: 3.022 ± 0.557
2.434ProLeu: 2.434 ± 0.432
1.007ProMet: 1.007 ± 0.412
2.182ProAsn: 2.182 ± 0.547
0.672ProPro: 0.672 ± 0.196
0.755ProGln: 0.755 ± 0.24
1.343ProArg: 1.343 ± 0.35
2.015ProSer: 2.015 ± 0.348
2.182ProThr: 2.182 ± 0.386
3.19ProVal: 3.19 ± 0.43
0.588ProTrp: 0.588 ± 0.189
1.259ProTyr: 1.259 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
4.785GlnAla: 4.785 ± 0.555
0.168GlnCys: 0.168 ± 0.13
1.763GlnAsp: 1.763 ± 0.334
2.518GlnGlu: 2.518 ± 0.545
1.931GlnPhe: 1.931 ± 0.278
3.022GlnGly: 3.022 ± 0.613
0.504GlnHis: 0.504 ± 0.243
2.266GlnIle: 2.266 ± 0.435
1.511GlnLys: 1.511 ± 0.475
3.777GlnLeu: 3.777 ± 0.564
1.091GlnMet: 1.091 ± 0.237
1.175GlnAsn: 1.175 ± 0.258
1.679GlnPro: 1.679 ± 0.357
2.015GlnGln: 2.015 ± 0.35
1.679GlnArg: 1.679 ± 0.305
2.182GlnSer: 2.182 ± 0.527
2.015GlnThr: 2.015 ± 0.527
2.266GlnVal: 2.266 ± 0.448
0.839GlnTrp: 0.839 ± 0.235
1.763GlnTyr: 1.763 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
4.953ArgAla: 4.953 ± 0.786
0.755ArgCys: 0.755 ± 0.291
3.19ArgAsp: 3.19 ± 0.499
4.197ArgGlu: 4.197 ± 0.598
2.434ArgPhe: 2.434 ± 0.471
3.861ArgGly: 3.861 ± 0.457
0.923ArgHis: 0.923 ± 0.284
2.35ArgIle: 2.35 ± 0.491
4.449ArgLys: 4.449 ± 0.802
4.785ArgLeu: 4.785 ± 0.616
1.595ArgMet: 1.595 ± 0.385
3.777ArgAsn: 3.777 ± 0.65
1.595ArgPro: 1.595 ± 0.35
1.679ArgGln: 1.679 ± 0.359
2.099ArgArg: 2.099 ± 0.283
4.113ArgSer: 4.113 ± 0.682
2.686ArgThr: 2.686 ± 0.336
4.197ArgVal: 4.197 ± 0.484
0.839ArgTrp: 0.839 ± 0.338
1.343ArgTyr: 1.343 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
4.617SerAla: 4.617 ± 0.661
0.672SerCys: 0.672 ± 0.259
3.861SerAsp: 3.861 ± 0.594
3.61SerGlu: 3.61 ± 0.675
2.35SerPhe: 2.35 ± 0.405
5.288SerGly: 5.288 ± 0.654
1.511SerHis: 1.511 ± 0.341
3.19SerIle: 3.19 ± 0.492
4.365SerLys: 4.365 ± 0.718
3.358SerLeu: 3.358 ± 0.496
1.847SerMet: 1.847 ± 0.5
2.686SerAsn: 2.686 ± 0.473
2.182SerPro: 2.182 ± 0.464
1.931SerGln: 1.931 ± 0.447
3.274SerArg: 3.274 ± 0.498
2.434SerSer: 2.434 ± 0.482
3.19SerThr: 3.19 ± 0.421
3.61SerVal: 3.61 ± 0.439
0.504SerTrp: 0.504 ± 0.207
2.266SerTyr: 2.266 ± 0.675
0.0SerXaa: 0.0 ± 0.0
Thr
3.61ThrAla: 3.61 ± 0.628
0.42ThrCys: 0.42 ± 0.192
3.19ThrAsp: 3.19 ± 0.467
4.701ThrGlu: 4.701 ± 0.637
1.931ThrPhe: 1.931 ± 0.415
5.037ThrGly: 5.037 ± 0.701
1.007ThrHis: 1.007 ± 0.248
3.61ThrIle: 3.61 ± 0.553
3.358ThrLys: 3.358 ± 0.523
5.372ThrLeu: 5.372 ± 0.718
1.259ThrMet: 1.259 ± 0.303
2.015ThrAsn: 2.015 ± 0.44
2.099ThrPro: 2.099 ± 0.371
2.015ThrGln: 2.015 ± 0.604
2.686ThrArg: 2.686 ± 0.386
3.61ThrSer: 3.61 ± 0.619
2.938ThrThr: 2.938 ± 0.444
3.358ThrVal: 3.358 ± 0.456
0.672ThrTrp: 0.672 ± 0.244
1.091ThrTyr: 1.091 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
5.876ValAla: 5.876 ± 0.662
0.504ValCys: 0.504 ± 0.211
3.442ValAsp: 3.442 ± 0.364
4.953ValGlu: 4.953 ± 0.671
2.602ValPhe: 2.602 ± 0.636
4.869ValGly: 4.869 ± 0.773
1.427ValHis: 1.427 ± 0.297
3.19ValIle: 3.19 ± 0.552
4.365ValLys: 4.365 ± 0.718
5.037ValLeu: 5.037 ± 0.733
1.847ValMet: 1.847 ± 0.404
3.442ValAsn: 3.442 ± 0.785
2.434ValPro: 2.434 ± 0.422
2.099ValGln: 2.099 ± 0.413
4.197ValArg: 4.197 ± 0.605
4.197ValSer: 4.197 ± 0.68
4.785ValThr: 4.785 ± 0.513
4.029ValVal: 4.029 ± 0.732
1.007ValTrp: 1.007 ± 0.379
2.099ValTyr: 2.099 ± 0.522
0.0ValXaa: 0.0 ± 0.0
Trp
0.755TrpAla: 0.755 ± 0.288
0.252TrpCys: 0.252 ± 0.161
0.252TrpAsp: 0.252 ± 0.162
1.343TrpGlu: 1.343 ± 0.287
0.42TrpPhe: 0.42 ± 0.193
0.923TrpGly: 0.923 ± 0.282
0.336TrpHis: 0.336 ± 0.195
0.839TrpIle: 0.839 ± 0.308
1.679TrpLys: 1.679 ± 0.395
1.595TrpLeu: 1.595 ± 0.411
0.42TrpMet: 0.42 ± 0.166
0.923TrpAsn: 0.923 ± 0.275
0.252TrpPro: 0.252 ± 0.125
0.672TrpGln: 0.672 ± 0.221
0.755TrpArg: 0.755 ± 0.27
0.923TrpSer: 0.923 ± 0.314
1.175TrpThr: 1.175 ± 0.316
1.343TrpVal: 1.343 ± 0.364
0.168TrpTrp: 0.168 ± 0.119
0.336TrpTyr: 0.336 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.274TyrAla: 3.274 ± 0.471
0.504TyrCys: 0.504 ± 0.198
2.602TyrAsp: 2.602 ± 0.387
2.099TyrGlu: 2.099 ± 0.445
1.007TyrPhe: 1.007 ± 0.298
3.106TyrGly: 3.106 ± 0.329
0.504TyrHis: 0.504 ± 0.215
1.595TyrIle: 1.595 ± 0.449
1.511TyrLys: 1.511 ± 0.413
2.266TyrLeu: 2.266 ± 0.422
0.923TyrMet: 0.923 ± 0.27
1.847TyrAsn: 1.847 ± 0.33
1.175TyrPro: 1.175 ± 0.324
1.427TyrGln: 1.427 ± 0.435
1.763TyrArg: 1.763 ± 0.29
1.595TyrSer: 1.595 ± 0.436
1.931TyrThr: 1.931 ± 0.488
2.77TyrVal: 2.77 ± 0.504
0.672TyrTrp: 0.672 ± 0.198
1.091TyrTyr: 1.091 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.084XaaMet: 0.084 ± 0.087
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (11914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski