Amino acid dipepetide frequency for Deep-sea thermophilic phage D6E

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.685AlaAla: 5.685 ± 0.714
0.28AlaCys: 0.28 ± 0.19
5.405AlaAsp: 5.405 ± 0.602
6.244AlaGlu: 6.244 ± 0.852
2.889AlaPhe: 2.889 ± 0.616
3.914AlaGly: 3.914 ± 0.614
1.678AlaHis: 1.678 ± 0.394
4.939AlaIle: 4.939 ± 0.704
5.499AlaLys: 5.499 ± 0.628
5.871AlaLeu: 5.871 ± 0.663
1.957AlaMet: 1.957 ± 0.354
3.262AlaAsn: 3.262 ± 0.617
1.118AlaPro: 1.118 ± 0.441
3.541AlaGln: 3.541 ± 0.608
3.075AlaArg: 3.075 ± 0.476
4.101AlaSer: 4.101 ± 0.715
3.728AlaThr: 3.728 ± 0.714
6.058AlaVal: 6.058 ± 0.793
1.025AlaTrp: 1.025 ± 0.248
2.796AlaTyr: 2.796 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
0.28CysAla: 0.28 ± 0.224
0.28CysCys: 0.28 ± 0.142
0.746CysAsp: 0.746 ± 0.282
0.652CysGlu: 0.652 ± 0.256
0.186CysPhe: 0.186 ± 0.108
0.466CysGly: 0.466 ± 0.21
0.373CysHis: 0.373 ± 0.219
0.466CysIle: 0.466 ± 0.24
0.466CysLys: 0.466 ± 0.166
0.559CysLeu: 0.559 ± 0.196
0.466CysMet: 0.466 ± 0.214
0.186CysAsn: 0.186 ± 0.13
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.373CysArg: 0.373 ± 0.172
0.373CysSer: 0.373 ± 0.192
0.466CysThr: 0.466 ± 0.17
0.28CysVal: 0.28 ± 0.206
0.093CysTrp: 0.093 ± 0.092
0.28CysTyr: 0.28 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
4.007AspAla: 4.007 ± 0.461
0.932AspCys: 0.932 ± 0.324
4.287AspAsp: 4.287 ± 0.785
5.778AspGlu: 5.778 ± 0.73
2.237AspPhe: 2.237 ± 0.485
5.685AspGly: 5.685 ± 0.943
0.746AspHis: 0.746 ± 0.252
4.287AspIle: 4.287 ± 0.568
3.914AspLys: 3.914 ± 0.505
4.753AspLeu: 4.753 ± 0.776
2.516AspMet: 2.516 ± 0.542
2.144AspAsn: 2.144 ± 0.387
2.144AspPro: 2.144 ± 0.377
1.957AspGln: 1.957 ± 0.359
3.728AspArg: 3.728 ± 0.568
2.33AspSer: 2.33 ± 0.416
3.075AspThr: 3.075 ± 0.621
4.38AspVal: 4.38 ± 0.673
0.746AspTrp: 0.746 ± 0.291
3.075AspTyr: 3.075 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
6.244GluAla: 6.244 ± 0.892
0.373GluCys: 0.373 ± 0.214
3.355GluAsp: 3.355 ± 0.532
6.244GluGlu: 6.244 ± 1.076
1.771GluPhe: 1.771 ± 0.43
3.914GluGly: 3.914 ± 0.6
1.584GluHis: 1.584 ± 0.341
5.126GluIle: 5.126 ± 0.647
7.456GluLys: 7.456 ± 0.994
6.244GluLeu: 6.244 ± 0.774
2.237GluMet: 2.237 ± 0.435
4.567GluAsn: 4.567 ± 0.536
2.144GluPro: 2.144 ± 0.539
4.939GluGln: 4.939 ± 0.616
5.871GluArg: 5.871 ± 0.785
2.703GluSer: 2.703 ± 0.512
4.939GluThr: 4.939 ± 0.643
5.312GluVal: 5.312 ± 0.611
1.305GluTrp: 1.305 ± 0.369
2.796GluTyr: 2.796 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
2.05PheAla: 2.05 ± 0.388
0.186PheCys: 0.186 ± 0.154
2.237PheAsp: 2.237 ± 0.37
2.796PheGlu: 2.796 ± 0.506
2.144PhePhe: 2.144 ± 0.436
3.355PheGly: 3.355 ± 0.492
0.373PheHis: 0.373 ± 0.218
2.33PheIle: 2.33 ± 0.52
2.516PheLys: 2.516 ± 0.446
3.914PheLeu: 3.914 ± 0.642
1.118PheMet: 1.118 ± 0.324
1.584PheAsn: 1.584 ± 0.445
1.118PhePro: 1.118 ± 0.32
1.305PheGln: 1.305 ± 0.33
1.398PheArg: 1.398 ± 0.366
2.61PheSer: 2.61 ± 0.644
2.144PheThr: 2.144 ± 0.448
2.33PheVal: 2.33 ± 0.435
0.373PheTrp: 0.373 ± 0.188
1.491PheTyr: 1.491 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
3.914GlyAla: 3.914 ± 0.634
0.28GlyCys: 0.28 ± 0.17
4.473GlyAsp: 4.473 ± 0.716
5.499GlyGlu: 5.499 ± 0.923
2.703GlyPhe: 2.703 ± 0.453
5.033GlyGly: 5.033 ± 0.696
0.839GlyHis: 0.839 ± 0.248
4.66GlyIle: 4.66 ± 0.714
4.939GlyLys: 4.939 ± 0.749
5.778GlyLeu: 5.778 ± 0.861
1.678GlyMet: 1.678 ± 0.391
2.423GlyAsn: 2.423 ± 0.634
0.839GlyPro: 0.839 ± 0.281
2.05GlyGln: 2.05 ± 0.479
3.635GlyArg: 3.635 ± 0.74
3.262GlySer: 3.262 ± 0.43
4.101GlyThr: 4.101 ± 0.845
4.939GlyVal: 4.939 ± 0.617
0.559GlyTrp: 0.559 ± 0.19
2.889GlyTyr: 2.889 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
1.305HisAla: 1.305 ± 0.32
0.28HisCys: 0.28 ± 0.148
1.212HisAsp: 1.212 ± 0.393
1.118HisGlu: 1.118 ± 0.354
0.559HisPhe: 0.559 ± 0.205
0.932HisGly: 0.932 ± 0.286
0.373HisHis: 0.373 ± 0.199
1.491HisIle: 1.491 ± 0.326
1.305HisLys: 1.305 ± 0.261
1.212HisLeu: 1.212 ± 0.297
0.373HisMet: 0.373 ± 0.206
0.746HisAsn: 0.746 ± 0.282
1.025HisPro: 1.025 ± 0.282
0.186HisGln: 0.186 ± 0.133
1.398HisArg: 1.398 ± 0.379
0.466HisSer: 0.466 ± 0.212
1.025HisThr: 1.025 ± 0.294
0.746HisVal: 0.746 ± 0.267
0.093HisTrp: 0.093 ± 0.096
1.678HisTyr: 1.678 ± 0.413
0.0HisXaa: 0.0 ± 0.0
Ile
5.778IleAla: 5.778 ± 0.573
0.559IleCys: 0.559 ± 0.232
4.567IleAsp: 4.567 ± 0.798
7.735IleGlu: 7.735 ± 0.935
1.491IlePhe: 1.491 ± 0.369
4.007IleGly: 4.007 ± 0.624
1.212IleHis: 1.212 ± 0.248
4.939IleIle: 4.939 ± 0.757
5.219IleLys: 5.219 ± 0.938
4.007IleLeu: 4.007 ± 0.643
1.584IleMet: 1.584 ± 0.475
2.703IleAsn: 2.703 ± 0.494
3.169IlePro: 3.169 ± 0.679
2.61IleGln: 2.61 ± 0.462
4.66IleArg: 4.66 ± 0.62
3.169IleSer: 3.169 ± 0.618
3.728IleThr: 3.728 ± 0.551
3.635IleVal: 3.635 ± 0.703
0.559IleTrp: 0.559 ± 0.227
1.864IleTyr: 1.864 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
5.965LysAla: 5.965 ± 0.919
0.186LysCys: 0.186 ± 0.125
4.753LysAsp: 4.753 ± 0.788
5.965LysGlu: 5.965 ± 1.011
2.144LysPhe: 2.144 ± 0.481
5.685LysGly: 5.685 ± 0.653
0.839LysHis: 0.839 ± 0.205
4.194LysIle: 4.194 ± 0.666
6.897LysLys: 6.897 ± 1.101
7.735LysLeu: 7.735 ± 1.026
2.61LysMet: 2.61 ± 0.496
3.728LysAsn: 3.728 ± 0.502
2.61LysPro: 2.61 ± 0.514
4.101LysGln: 4.101 ± 0.753
4.66LysArg: 4.66 ± 0.613
2.982LysSer: 2.982 ± 0.618
5.033LysThr: 5.033 ± 0.551
5.033LysVal: 5.033 ± 0.668
1.118LysTrp: 1.118 ± 0.294
2.889LysTyr: 2.889 ± 0.596
0.0LysXaa: 0.0 ± 0.0
Leu
6.431LeuAla: 6.431 ± 0.76
0.559LeuCys: 0.559 ± 0.233
6.058LeuAsp: 6.058 ± 0.613
5.405LeuGlu: 5.405 ± 0.724
3.075LeuPhe: 3.075 ± 0.545
4.101LeuGly: 4.101 ± 0.648
1.212LeuHis: 1.212 ± 0.286
5.033LeuIle: 5.033 ± 0.6
7.176LeuLys: 7.176 ± 0.889
6.337LeuLeu: 6.337 ± 0.759
1.584LeuMet: 1.584 ± 0.348
3.262LeuAsn: 3.262 ± 0.647
3.262LeuPro: 3.262 ± 0.663
3.541LeuGln: 3.541 ± 0.666
5.219LeuArg: 5.219 ± 1.033
5.219LeuSer: 5.219 ± 0.508
5.778LeuThr: 5.778 ± 0.775
3.728LeuVal: 3.728 ± 0.893
0.652LeuTrp: 0.652 ± 0.309
4.007LeuTyr: 4.007 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
1.957MetAla: 1.957 ± 0.37
0.093MetCys: 0.093 ± 0.092
1.584MetAsp: 1.584 ± 0.292
1.212MetGlu: 1.212 ± 0.327
1.305MetPhe: 1.305 ± 0.422
1.584MetGly: 1.584 ± 0.353
0.559MetHis: 0.559 ± 0.201
1.491MetIle: 1.491 ± 0.333
2.982MetLys: 2.982 ± 0.429
1.398MetLeu: 1.398 ± 0.214
0.559MetMet: 0.559 ± 0.229
1.305MetAsn: 1.305 ± 0.342
1.305MetPro: 1.305 ± 0.311
1.212MetGln: 1.212 ± 0.374
1.678MetArg: 1.678 ± 0.364
1.771MetSer: 1.771 ± 0.411
1.212MetThr: 1.212 ± 0.298
1.864MetVal: 1.864 ± 0.336
0.466MetTrp: 0.466 ± 0.244
0.932MetTyr: 0.932 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
2.982AsnAla: 2.982 ± 0.619
0.373AsnCys: 0.373 ± 0.16
3.262AsnAsp: 3.262 ± 0.584
2.61AsnGlu: 2.61 ± 0.427
1.491AsnPhe: 1.491 ± 0.329
4.753AsnGly: 4.753 ± 0.544
1.118AsnHis: 1.118 ± 0.351
2.889AsnIle: 2.889 ± 0.482
2.144AsnLys: 2.144 ± 0.508
3.169AsnLeu: 3.169 ± 0.447
1.025AsnMet: 1.025 ± 0.314
2.889AsnAsn: 2.889 ± 0.718
2.05AsnPro: 2.05 ± 0.343
1.584AsnGln: 1.584 ± 0.297
3.075AsnArg: 3.075 ± 0.543
2.144AsnSer: 2.144 ± 0.522
2.423AsnThr: 2.423 ± 0.4
4.101AsnVal: 4.101 ± 0.504
0.559AsnTrp: 0.559 ± 0.181
1.305AsnTyr: 1.305 ± 0.315
0.0AsnXaa: 0.0 ± 0.0
Pro
3.169ProAla: 3.169 ± 0.582
0.186ProCys: 0.186 ± 0.111
1.957ProAsp: 1.957 ± 0.37
2.423ProGlu: 2.423 ± 0.499
1.678ProPhe: 1.678 ± 0.512
1.491ProGly: 1.491 ± 0.399
1.025ProHis: 1.025 ± 0.338
2.237ProIle: 2.237 ± 0.423
3.075ProLys: 3.075 ± 0.62
2.05ProLeu: 2.05 ± 0.5
0.559ProMet: 0.559 ± 0.2
2.237ProAsn: 2.237 ± 0.408
1.398ProPro: 1.398 ± 0.353
1.305ProGln: 1.305 ± 0.316
1.584ProArg: 1.584 ± 0.337
2.144ProSer: 2.144 ± 0.547
2.33ProThr: 2.33 ± 0.526
2.516ProVal: 2.516 ± 0.535
0.186ProTrp: 0.186 ± 0.136
1.305ProTyr: 1.305 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
2.889GlnAla: 2.889 ± 0.448
0.186GlnCys: 0.186 ± 0.137
1.957GlnAsp: 1.957 ± 0.484
2.982GlnGlu: 2.982 ± 0.567
1.864GlnPhe: 1.864 ± 0.499
2.33GlnGly: 2.33 ± 0.536
0.559GlnHis: 0.559 ± 0.194
2.982GlnIle: 2.982 ± 0.621
3.821GlnLys: 3.821 ± 0.642
3.914GlnLeu: 3.914 ± 0.555
0.932GlnMet: 0.932 ± 0.327
1.957GlnAsn: 1.957 ± 0.375
1.771GlnPro: 1.771 ± 0.447
2.144GlnGln: 2.144 ± 0.576
2.237GlnArg: 2.237 ± 0.512
1.678GlnSer: 1.678 ± 0.43
2.516GlnThr: 2.516 ± 0.499
2.982GlnVal: 2.982 ± 0.567
0.559GlnTrp: 0.559 ± 0.195
0.932GlnTyr: 0.932 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
4.38ArgAla: 4.38 ± 0.575
0.746ArgCys: 0.746 ± 0.262
3.262ArgAsp: 3.262 ± 0.55
4.473ArgGlu: 4.473 ± 0.717
2.889ArgPhe: 2.889 ± 0.493
2.144ArgGly: 2.144 ± 0.401
0.932ArgHis: 0.932 ± 0.284
3.075ArgIle: 3.075 ± 0.586
5.312ArgLys: 5.312 ± 0.726
6.058ArgLeu: 6.058 ± 0.801
1.305ArgMet: 1.305 ± 0.366
2.423ArgAsn: 2.423 ± 0.415
2.423ArgPro: 2.423 ± 0.461
2.237ArgGln: 2.237 ± 0.443
3.169ArgArg: 3.169 ± 0.595
2.423ArgSer: 2.423 ± 0.484
2.237ArgThr: 2.237 ± 0.526
4.846ArgVal: 4.846 ± 0.617
1.212ArgTrp: 1.212 ± 0.298
2.05ArgTyr: 2.05 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
3.821SerAla: 3.821 ± 0.654
0.373SerCys: 0.373 ± 0.197
2.423SerAsp: 2.423 ± 0.399
3.262SerGlu: 3.262 ± 0.713
1.678SerPhe: 1.678 ± 0.409
4.194SerGly: 4.194 ± 0.576
0.746SerHis: 0.746 ± 0.258
3.448SerIle: 3.448 ± 0.596
2.889SerLys: 2.889 ± 0.491
3.355SerLeu: 3.355 ± 0.553
1.305SerMet: 1.305 ± 0.454
1.584SerAsn: 1.584 ± 0.32
2.423SerPro: 2.423 ± 0.387
1.398SerGln: 1.398 ± 0.311
3.169SerArg: 3.169 ± 0.562
1.771SerSer: 1.771 ± 0.41
2.516SerThr: 2.516 ± 0.574
4.567SerVal: 4.567 ± 0.659
0.746SerTrp: 0.746 ± 0.225
1.864SerTyr: 1.864 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
3.355ThrAla: 3.355 ± 0.67
0.466ThrCys: 0.466 ± 0.227
3.728ThrAsp: 3.728 ± 0.75
4.194ThrGlu: 4.194 ± 0.708
2.423ThrPhe: 2.423 ± 0.561
3.448ThrGly: 3.448 ± 0.558
1.025ThrHis: 1.025 ± 0.329
5.685ThrIle: 5.685 ± 0.727
4.939ThrLys: 4.939 ± 0.547
6.524ThrLeu: 6.524 ± 0.982
1.305ThrMet: 1.305 ± 0.298
2.05ThrAsn: 2.05 ± 0.466
1.957ThrPro: 1.957 ± 0.41
2.05ThrGln: 2.05 ± 0.531
2.33ThrArg: 2.33 ± 0.474
2.61ThrSer: 2.61 ± 0.747
3.355ThrThr: 3.355 ± 0.797
4.101ThrVal: 4.101 ± 0.584
0.839ThrTrp: 0.839 ± 0.275
1.398ThrTyr: 1.398 ± 0.489
0.0ThrXaa: 0.0 ± 0.0
Val
5.405ValAla: 5.405 ± 0.686
0.186ValCys: 0.186 ± 0.115
4.007ValAsp: 4.007 ± 0.638
5.499ValGlu: 5.499 ± 0.544
2.237ValPhe: 2.237 ± 0.447
4.38ValGly: 4.38 ± 0.647
1.212ValHis: 1.212 ± 0.333
4.939ValIle: 4.939 ± 0.773
4.473ValLys: 4.473 ± 0.841
5.126ValLeu: 5.126 ± 0.731
2.05ValMet: 2.05 ± 0.378
4.567ValAsn: 4.567 ± 0.574
3.075ValPro: 3.075 ± 0.552
2.982ValGln: 2.982 ± 0.697
2.703ValArg: 2.703 ± 0.402
3.821ValSer: 3.821 ± 0.651
4.101ValThr: 4.101 ± 0.617
4.007ValVal: 4.007 ± 0.836
0.746ValTrp: 0.746 ± 0.277
3.262ValTyr: 3.262 ± 0.588
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.191
0.186TrpCys: 0.186 ± 0.131
0.839TrpAsp: 0.839 ± 0.223
1.025TrpGlu: 1.025 ± 0.281
1.025TrpPhe: 1.025 ± 0.315
0.466TrpGly: 0.466 ± 0.19
0.093TrpHis: 0.093 ± 0.09
0.373TrpIle: 0.373 ± 0.181
1.491TrpLys: 1.491 ± 0.373
0.746TrpLeu: 0.746 ± 0.284
0.28TrpMet: 0.28 ± 0.14
0.839TrpAsn: 0.839 ± 0.269
0.093TrpPro: 0.093 ± 0.092
0.466TrpGln: 0.466 ± 0.179
1.491TrpArg: 1.491 ± 0.387
0.466TrpSer: 0.466 ± 0.255
0.559TrpThr: 0.559 ± 0.263
0.746TrpVal: 0.746 ± 0.256
0.186TrpTrp: 0.186 ± 0.136
0.373TrpTyr: 0.373 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.075TyrAla: 3.075 ± 0.635
0.28TyrCys: 0.28 ± 0.154
2.61TyrAsp: 2.61 ± 0.502
4.007TyrGlu: 4.007 ± 0.674
1.584TyrPhe: 1.584 ± 0.533
2.61TyrGly: 2.61 ± 0.506
1.025TyrHis: 1.025 ± 0.369
2.61TyrIle: 2.61 ± 0.557
2.61TyrLys: 2.61 ± 0.485
2.889TyrLeu: 2.889 ± 0.478
0.932TyrMet: 0.932 ± 0.244
1.491TyrAsn: 1.491 ± 0.372
0.932TyrPro: 0.932 ± 0.299
1.584TyrGln: 1.584 ± 0.378
2.144TyrArg: 2.144 ± 0.514
1.491TyrSer: 1.491 ± 0.339
2.516TyrThr: 2.516 ± 0.493
2.61TyrVal: 2.61 ± 0.467
0.186TyrTrp: 0.186 ± 0.137
1.212TyrTyr: 1.212 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski