Amino acid dipepetide frequency for Methanothermobacter phage psiM100

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.269AlaAla: 4.269 ± 0.885
0.674AlaCys: 0.674 ± 0.294
2.921AlaAsp: 2.921 ± 0.617
5.28AlaGlu: 5.28 ± 0.88
1.91AlaPhe: 1.91 ± 0.616
3.258AlaGly: 3.258 ± 0.455
1.011AlaHis: 1.011 ± 0.294
3.595AlaIle: 3.595 ± 0.496
1.685AlaLys: 1.685 ± 0.386
6.404AlaLeu: 6.404 ± 0.856
1.348AlaMet: 1.348 ± 0.437
1.236AlaAsn: 1.236 ± 0.355
1.798AlaPro: 1.798 ± 0.511
0.899AlaGln: 0.899 ± 0.336
4.044AlaArg: 4.044 ± 0.623
3.37AlaSer: 3.37 ± 0.568
2.921AlaThr: 2.921 ± 0.506
3.595AlaVal: 3.595 ± 0.9
1.348AlaTrp: 1.348 ± 0.503
2.247AlaTyr: 2.247 ± 0.59
0.0AlaXaa: 0.0 ± 0.0
Cys
0.112CysAla: 0.112 ± 0.101
0.112CysCys: 0.112 ± 0.109
0.337CysAsp: 0.337 ± 0.252
0.674CysGlu: 0.674 ± 0.31
0.674CysPhe: 0.674 ± 0.228
0.674CysGly: 0.674 ± 0.274
0.112CysHis: 0.112 ± 0.116
0.337CysIle: 0.337 ± 0.225
0.562CysLys: 0.562 ± 0.243
0.112CysLeu: 0.112 ± 0.11
0.225CysMet: 0.225 ± 0.142
0.337CysAsn: 0.337 ± 0.185
0.112CysPro: 0.112 ± 0.113
0.337CysGln: 0.337 ± 0.185
0.449CysArg: 0.449 ± 0.198
0.449CysSer: 0.449 ± 0.3
0.562CysThr: 0.562 ± 0.27
0.674CysVal: 0.674 ± 0.314
0.225CysTrp: 0.225 ± 0.17
0.112CysTyr: 0.112 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.932AspAla: 3.932 ± 0.731
0.225AspCys: 0.225 ± 0.149
5.73AspAsp: 5.73 ± 0.999
7.19AspGlu: 7.19 ± 1.181
3.258AspPhe: 3.258 ± 0.602
4.044AspGly: 4.044 ± 0.84
1.236AspHis: 1.236 ± 0.329
4.269AspIle: 4.269 ± 0.855
3.707AspLys: 3.707 ± 0.501
4.606AspLeu: 4.606 ± 0.729
2.247AspMet: 2.247 ± 0.5
1.91AspAsn: 1.91 ± 0.398
2.696AspPro: 2.696 ± 0.49
2.135AspGln: 2.135 ± 0.4
2.809AspArg: 2.809 ± 0.539
2.359AspSer: 2.359 ± 0.568
1.91AspThr: 1.91 ± 0.388
3.595AspVal: 3.595 ± 0.686
1.123AspTrp: 1.123 ± 0.462
3.37AspTyr: 3.37 ± 0.721
0.0AspXaa: 0.0 ± 0.0
Glu
5.168GluAla: 5.168 ± 1.05
0.562GluCys: 0.562 ± 0.283
7.64GluAsp: 7.64 ± 1.067
10.111GluGlu: 10.111 ± 1.381
4.157GluPhe: 4.157 ± 0.718
6.516GluGly: 6.516 ± 1.164
1.461GluHis: 1.461 ± 0.55
5.73GluIle: 5.73 ± 0.735
6.404GluLys: 6.404 ± 0.829
7.752GluLeu: 7.752 ± 0.939
1.798GluMet: 1.798 ± 0.452
4.044GluAsn: 4.044 ± 0.682
3.37GluPro: 3.37 ± 0.675
1.461GluGln: 1.461 ± 0.383
3.483GluArg: 3.483 ± 0.688
4.382GluSer: 4.382 ± 0.692
3.82GluThr: 3.82 ± 0.878
6.516GluVal: 6.516 ± 0.883
1.798GluTrp: 1.798 ± 0.597
5.505GluTyr: 5.505 ± 0.805
0.0GluXaa: 0.0 ± 0.0
Phe
1.011PheAla: 1.011 ± 0.309
0.337PheCys: 0.337 ± 0.199
2.022PheAsp: 2.022 ± 0.486
2.472PheGlu: 2.472 ± 0.621
1.236PhePhe: 1.236 ± 0.308
1.236PheGly: 1.236 ± 0.226
1.011PheHis: 1.011 ± 0.353
3.033PheIle: 3.033 ± 0.847
4.044PheLys: 4.044 ± 1.078
3.483PheLeu: 3.483 ± 0.649
2.247PheMet: 2.247 ± 0.566
1.798PheAsn: 1.798 ± 0.384
1.011PhePro: 1.011 ± 0.337
0.899PheGln: 0.899 ± 0.275
2.359PheArg: 2.359 ± 0.682
3.258PheSer: 3.258 ± 0.497
1.573PheThr: 1.573 ± 0.321
1.685PheVal: 1.685 ± 0.398
0.337PheTrp: 0.337 ± 0.177
1.348PheTyr: 1.348 ± 0.436
0.0PheXaa: 0.0 ± 0.0
Gly
2.696GlyAla: 2.696 ± 0.537
0.337GlyCys: 0.337 ± 0.206
4.494GlyAsp: 4.494 ± 0.848
5.842GlyGlu: 5.842 ± 0.794
2.359GlyPhe: 2.359 ± 0.423
6.179GlyGly: 6.179 ± 1.187
1.123GlyHis: 1.123 ± 0.392
4.157GlyIle: 4.157 ± 0.715
3.932GlyLys: 3.932 ± 0.621
4.943GlyLeu: 4.943 ± 0.793
2.022GlyMet: 2.022 ± 0.549
2.022GlyAsn: 2.022 ± 0.348
2.472GlyPro: 2.472 ± 0.564
1.123GlyGln: 1.123 ± 0.484
4.606GlyArg: 4.606 ± 0.663
3.595GlySer: 3.595 ± 0.653
3.146GlyThr: 3.146 ± 0.621
6.179GlyVal: 6.179 ± 0.618
1.461GlyTrp: 1.461 ± 0.43
3.37GlyTyr: 3.37 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.011HisAla: 1.011 ± 0.376
0.112HisCys: 0.112 ± 0.113
1.461HisAsp: 1.461 ± 0.479
2.247HisGlu: 2.247 ± 0.428
0.337HisPhe: 0.337 ± 0.18
0.674HisGly: 0.674 ± 0.349
0.112HisHis: 0.112 ± 0.131
1.798HisIle: 1.798 ± 0.586
1.348HisLys: 1.348 ± 0.325
1.91HisLeu: 1.91 ± 0.432
0.449HisMet: 0.449 ± 0.224
0.899HisAsn: 0.899 ± 0.297
1.573HisPro: 1.573 ± 0.535
0.112HisGln: 0.112 ± 0.11
0.674HisArg: 0.674 ± 0.235
1.123HisSer: 1.123 ± 0.325
0.562HisThr: 0.562 ± 0.288
1.573HisVal: 1.573 ± 0.381
0.112HisTrp: 0.112 ± 0.109
1.011HisTyr: 1.011 ± 0.371
0.0HisXaa: 0.0 ± 0.0
Ile
2.247IleAla: 2.247 ± 0.415
0.562IleCys: 0.562 ± 0.24
3.707IleAsp: 3.707 ± 0.705
5.28IleGlu: 5.28 ± 0.834
2.022IlePhe: 2.022 ± 0.423
2.696IleGly: 2.696 ± 0.49
1.573IleHis: 1.573 ± 0.374
4.943IleIle: 4.943 ± 0.968
6.179IleLys: 6.179 ± 0.942
6.966IleLeu: 6.966 ± 0.893
2.247IleMet: 2.247 ± 0.534
2.921IleAsn: 2.921 ± 0.671
4.382IlePro: 4.382 ± 0.757
2.809IleGln: 2.809 ± 0.536
3.595IleArg: 3.595 ± 0.501
4.382IleSer: 4.382 ± 0.719
5.73IleThr: 5.73 ± 0.732
2.921IleVal: 2.921 ± 0.452
1.236IleTrp: 1.236 ± 0.318
3.033IleTyr: 3.033 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
5.28LysAla: 5.28 ± 0.696
0.786LysCys: 0.786 ± 0.291
3.483LysAsp: 3.483 ± 0.788
7.078LysGlu: 7.078 ± 0.901
1.685LysPhe: 1.685 ± 0.465
4.157LysGly: 4.157 ± 0.79
0.786LysHis: 0.786 ± 0.233
5.73LysIle: 5.73 ± 0.831
5.505LysLys: 5.505 ± 1.08
5.617LysLeu: 5.617 ± 0.843
0.899LysMet: 0.899 ± 0.307
3.146LysAsn: 3.146 ± 0.597
2.921LysPro: 2.921 ± 0.66
1.348LysGln: 1.348 ± 0.369
4.494LysArg: 4.494 ± 0.962
3.82LysSer: 3.82 ± 0.619
3.37LysThr: 3.37 ± 0.751
5.056LysVal: 5.056 ± 0.681
1.011LysTrp: 1.011 ± 0.554
2.809LysTyr: 2.809 ± 0.629
0.0LysXaa: 0.0 ± 0.0
Leu
3.595LeuAla: 3.595 ± 0.553
0.337LeuCys: 0.337 ± 0.183
4.943LeuAsp: 4.943 ± 0.705
9.437LeuGlu: 9.437 ± 0.87
2.472LeuPhe: 2.472 ± 0.544
6.628LeuGly: 6.628 ± 0.887
1.798LeuHis: 1.798 ± 0.529
5.73LeuIle: 5.73 ± 1.054
7.752LeuLys: 7.752 ± 0.801
8.426LeuLeu: 8.426 ± 0.907
2.247LeuMet: 2.247 ± 0.506
5.28LeuAsn: 5.28 ± 0.579
4.494LeuPro: 4.494 ± 1.207
4.269LeuGln: 4.269 ± 0.666
7.415LeuArg: 7.415 ± 1.207
5.28LeuSer: 5.28 ± 0.837
5.505LeuThr: 5.505 ± 1.07
3.595LeuVal: 3.595 ± 0.535
0.562LeuTrp: 0.562 ± 0.275
2.359LeuTyr: 2.359 ± 0.561
0.0LeuXaa: 0.0 ± 0.0
Met
1.461MetAla: 1.461 ± 0.35
0.112MetCys: 0.112 ± 0.094
1.798MetAsp: 1.798 ± 0.441
2.809MetGlu: 2.809 ± 0.518
0.674MetPhe: 0.674 ± 0.342
1.798MetGly: 1.798 ± 0.307
0.562MetHis: 0.562 ± 0.2
1.91MetIle: 1.91 ± 0.496
3.033MetLys: 3.033 ± 0.468
1.91MetLeu: 1.91 ± 0.572
0.899MetMet: 0.899 ± 0.252
1.348MetAsn: 1.348 ± 0.381
0.337MetPro: 0.337 ± 0.256
0.674MetGln: 0.674 ± 0.321
1.798MetArg: 1.798 ± 0.396
1.348MetSer: 1.348 ± 0.308
1.011MetThr: 1.011 ± 0.469
2.135MetVal: 2.135 ± 0.492
0.225MetTrp: 0.225 ± 0.146
0.674MetTyr: 0.674 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
1.461AsnAla: 1.461 ± 0.5
0.337AsnCys: 0.337 ± 0.178
3.146AsnAsp: 3.146 ± 0.505
2.809AsnGlu: 2.809 ± 0.435
1.348AsnPhe: 1.348 ± 0.374
3.033AsnGly: 3.033 ± 0.827
0.674AsnHis: 0.674 ± 0.258
3.707AsnIle: 3.707 ± 0.646
2.696AsnLys: 2.696 ± 0.6
4.044AsnLeu: 4.044 ± 0.925
1.011AsnMet: 1.011 ± 0.277
1.011AsnAsn: 1.011 ± 0.387
3.033AsnPro: 3.033 ± 0.757
1.123AsnGln: 1.123 ± 0.3
2.921AsnArg: 2.921 ± 0.605
1.798AsnSer: 1.798 ± 0.48
1.573AsnThr: 1.573 ± 0.369
1.91AsnVal: 1.91 ± 0.443
0.449AsnTrp: 0.449 ± 0.206
2.921AsnTyr: 2.921 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
1.798ProAla: 1.798 ± 0.379
0.337ProCys: 0.337 ± 0.179
2.472ProAsp: 2.472 ± 0.538
3.483ProGlu: 3.483 ± 0.595
1.798ProPhe: 1.798 ± 0.43
2.696ProGly: 2.696 ± 0.61
1.123ProHis: 1.123 ± 0.428
2.247ProIle: 2.247 ± 0.396
2.584ProLys: 2.584 ± 0.55
4.606ProLeu: 4.606 ± 0.806
0.562ProMet: 0.562 ± 0.257
2.247ProAsn: 2.247 ± 0.696
2.359ProPro: 2.359 ± 0.473
1.123ProGln: 1.123 ± 0.441
2.135ProArg: 2.135 ± 0.391
4.719ProSer: 4.719 ± 0.963
1.573ProThr: 1.573 ± 0.55
3.82ProVal: 3.82 ± 0.685
0.225ProTrp: 0.225 ± 0.136
1.348ProTyr: 1.348 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
1.91GlnAla: 1.91 ± 0.487
0.449GlnCys: 0.449 ± 0.188
1.011GlnAsp: 1.011 ± 0.427
2.247GlnGlu: 2.247 ± 0.441
0.562GlnPhe: 0.562 ± 0.266
2.584GlnGly: 2.584 ± 0.594
1.011GlnHis: 1.011 ± 0.392
1.573GlnIle: 1.573 ± 0.252
1.573GlnLys: 1.573 ± 0.439
2.022GlnLeu: 2.022 ± 0.349
0.674GlnMet: 0.674 ± 0.314
1.011GlnAsn: 1.011 ± 0.283
1.123GlnPro: 1.123 ± 0.507
0.112GlnGln: 0.112 ± 0.114
1.573GlnArg: 1.573 ± 0.57
1.123GlnSer: 1.123 ± 0.382
1.461GlnThr: 1.461 ± 0.36
1.91GlnVal: 1.91 ± 0.449
0.112GlnTrp: 0.112 ± 0.108
1.348GlnTyr: 1.348 ± 0.434
0.0GlnXaa: 0.0 ± 0.0
Arg
2.921ArgAla: 2.921 ± 0.641
0.449ArgCys: 0.449 ± 0.271
3.483ArgAsp: 3.483 ± 0.562
6.516ArgGlu: 6.516 ± 0.92
2.696ArgPhe: 2.696 ± 0.617
3.483ArgGly: 3.483 ± 0.57
0.449ArgHis: 0.449 ± 0.232
4.606ArgIle: 4.606 ± 0.825
4.269ArgLys: 4.269 ± 0.871
5.168ArgLeu: 5.168 ± 0.832
2.135ArgMet: 2.135 ± 0.503
2.135ArgAsn: 2.135 ± 0.493
1.685ArgPro: 1.685 ± 0.488
1.011ArgGln: 1.011 ± 0.229
5.056ArgArg: 5.056 ± 0.767
2.472ArgSer: 2.472 ± 0.616
2.247ArgThr: 2.247 ± 0.492
5.056ArgVal: 5.056 ± 0.804
0.674ArgTrp: 0.674 ± 0.233
2.809ArgTyr: 2.809 ± 0.681
0.0ArgXaa: 0.0 ± 0.0
Ser
3.932SerAla: 3.932 ± 0.863
0.562SerCys: 0.562 ± 0.299
3.37SerAsp: 3.37 ± 0.579
4.831SerGlu: 4.831 ± 0.957
3.033SerPhe: 3.033 ± 0.766
3.932SerGly: 3.932 ± 0.794
1.236SerHis: 1.236 ± 0.328
5.168SerIle: 5.168 ± 1.047
2.921SerLys: 2.921 ± 0.76
6.628SerLeu: 6.628 ± 0.652
1.011SerMet: 1.011 ± 0.293
1.685SerAsn: 1.685 ± 0.413
2.584SerPro: 2.584 ± 0.42
2.135SerGln: 2.135 ± 0.49
2.696SerArg: 2.696 ± 0.552
4.606SerSer: 4.606 ± 0.75
2.809SerThr: 2.809 ± 0.499
3.258SerVal: 3.258 ± 0.669
0.674SerTrp: 0.674 ± 0.369
3.146SerTyr: 3.146 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
3.707ThrAla: 3.707 ± 0.734
0.225ThrCys: 0.225 ± 0.134
3.146ThrAsp: 3.146 ± 0.492
3.146ThrGlu: 3.146 ± 0.597
1.91ThrPhe: 1.91 ± 0.566
4.943ThrGly: 4.943 ± 0.549
1.236ThrHis: 1.236 ± 0.356
4.494ThrIle: 4.494 ± 0.608
2.247ThrLys: 2.247 ± 0.546
5.73ThrLeu: 5.73 ± 0.854
1.011ThrMet: 1.011 ± 0.357
1.798ThrAsn: 1.798 ± 0.407
2.247ThrPro: 2.247 ± 0.541
1.011ThrGln: 1.011 ± 0.354
2.472ThrArg: 2.472 ± 0.693
3.258ThrSer: 3.258 ± 0.594
3.483ThrThr: 3.483 ± 0.737
4.382ThrVal: 4.382 ± 0.886
0.562ThrTrp: 0.562 ± 0.272
2.584ThrTyr: 2.584 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
4.943ValAla: 4.943 ± 1.0
0.337ValCys: 0.337 ± 0.175
4.269ValAsp: 4.269 ± 0.647
5.393ValGlu: 5.393 ± 0.84
2.135ValPhe: 2.135 ± 0.465
3.37ValGly: 3.37 ± 0.697
1.461ValHis: 1.461 ± 0.379
3.146ValIle: 3.146 ± 0.534
5.393ValLys: 5.393 ± 0.743
5.056ValLeu: 5.056 ± 0.731
1.798ValMet: 1.798 ± 0.42
3.37ValAsn: 3.37 ± 0.757
2.472ValPro: 2.472 ± 0.583
1.573ValGln: 1.573 ± 0.376
3.258ValArg: 3.258 ± 0.745
5.28ValSer: 5.28 ± 0.853
5.842ValThr: 5.842 ± 0.874
3.146ValVal: 3.146 ± 0.615
0.674ValTrp: 0.674 ± 0.202
2.472ValTyr: 2.472 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.337TrpAla: 0.337 ± 0.203
0.0TrpCys: 0.0 ± 0.0
1.011TrpAsp: 1.011 ± 0.332
1.461TrpGlu: 1.461 ± 0.447
0.112TrpPhe: 0.112 ± 0.117
0.786TrpGly: 0.786 ± 0.227
0.112TrpHis: 0.112 ± 0.108
0.449TrpIle: 0.449 ± 0.26
0.899TrpLys: 0.899 ± 0.24
1.123TrpLeu: 1.123 ± 0.325
0.786TrpMet: 0.786 ± 0.335
0.899TrpAsn: 0.899 ± 0.261
1.011TrpPro: 1.011 ± 0.478
0.112TrpGln: 0.112 ± 0.111
1.123TrpArg: 1.123 ± 0.292
0.786TrpSer: 0.786 ± 0.332
0.674TrpThr: 0.674 ± 0.333
1.348TrpVal: 1.348 ± 0.408
0.112TrpTrp: 0.112 ± 0.116
0.337TrpTyr: 0.337 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.022TyrAla: 2.022 ± 0.549
0.337TyrCys: 0.337 ± 0.166
1.91TyrAsp: 1.91 ± 0.492
3.37TyrGlu: 3.37 ± 0.718
2.022TyrPhe: 2.022 ± 0.478
3.37TyrGly: 3.37 ± 0.587
1.011TyrHis: 1.011 ± 0.342
2.809TyrIle: 2.809 ± 0.644
2.247TyrLys: 2.247 ± 0.576
5.617TyrLeu: 5.617 ± 0.707
0.786TyrMet: 0.786 ± 0.339
1.91TyrAsn: 1.91 ± 0.408
1.573TyrPro: 1.573 ± 0.512
1.123TyrGln: 1.123 ± 0.331
2.359TyrArg: 2.359 ± 0.703
2.809TyrSer: 2.809 ± 0.502
3.82TyrThr: 3.82 ± 0.624
3.146TyrVal: 3.146 ± 0.403
0.562TyrTrp: 0.562 ± 0.266
1.91TyrTyr: 1.91 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 36 proteins (8902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski