Amino acid dipepetide frequency for Podoviridae sp. ctbh1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.168AlaAla: 8.168 ± 1.623
0.832AlaCys: 0.832 ± 0.37
5.067AlaAsp: 5.067 ± 0.653
4.008AlaGlu: 4.008 ± 0.818
2.798AlaPhe: 2.798 ± 0.418
4.613AlaGly: 4.613 ± 0.731
1.513AlaHis: 1.513 ± 0.426
5.596AlaIle: 5.596 ± 0.761
6.731AlaLys: 6.731 ± 0.829
7.184AlaLeu: 7.184 ± 0.717
2.496AlaMet: 2.496 ± 0.439
4.159AlaAsn: 4.159 ± 0.565
2.723AlaPro: 2.723 ± 0.505
3.554AlaGln: 3.554 ± 0.607
3.479AlaArg: 3.479 ± 0.627
5.823AlaSer: 5.823 ± 1.694
5.748AlaThr: 5.748 ± 1.5
4.84AlaVal: 4.84 ± 0.587
1.134AlaTrp: 1.134 ± 0.236
1.664AlaTyr: 1.664 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.681CysAla: 0.681 ± 0.246
0.076CysCys: 0.076 ± 0.081
0.529CysAsp: 0.529 ± 0.169
0.983CysGlu: 0.983 ± 0.27
0.681CysPhe: 0.681 ± 0.292
0.908CysGly: 0.908 ± 0.352
0.151CysHis: 0.151 ± 0.11
0.605CysIle: 0.605 ± 0.205
0.454CysLys: 0.454 ± 0.185
0.454CysLeu: 0.454 ± 0.177
0.151CysMet: 0.151 ± 0.099
0.227CysAsn: 0.227 ± 0.118
0.227CysPro: 0.227 ± 0.139
0.151CysGln: 0.151 ± 0.117
0.529CysArg: 0.529 ± 0.164
0.832CysSer: 0.832 ± 0.313
0.529CysThr: 0.529 ± 0.198
0.529CysVal: 0.529 ± 0.182
0.151CysTrp: 0.151 ± 0.111
0.076CysTyr: 0.076 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.916AspAla: 4.916 ± 0.788
0.378AspCys: 0.378 ± 0.171
2.647AspAsp: 2.647 ± 0.555
3.63AspGlu: 3.63 ± 0.665
2.798AspPhe: 2.798 ± 0.39
4.84AspGly: 4.84 ± 0.614
0.983AspHis: 0.983 ± 0.321
3.706AspIle: 3.706 ± 0.515
3.176AspLys: 3.176 ± 0.538
5.143AspLeu: 5.143 ± 0.574
2.118AspMet: 2.118 ± 0.461
2.042AspAsn: 2.042 ± 0.492
2.042AspPro: 2.042 ± 0.35
2.798AspGln: 2.798 ± 0.49
1.588AspArg: 1.588 ± 0.345
4.084AspSer: 4.084 ± 0.543
3.479AspThr: 3.479 ± 0.589
4.235AspVal: 4.235 ± 0.539
0.908AspTrp: 0.908 ± 0.367
2.269AspTyr: 2.269 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
4.613GluAla: 4.613 ± 0.592
0.303GluCys: 0.303 ± 0.161
2.647GluAsp: 2.647 ± 0.706
2.269GluGlu: 2.269 ± 0.505
3.101GluPhe: 3.101 ± 0.539
1.966GluGly: 1.966 ± 0.376
1.513GluHis: 1.513 ± 0.369
4.311GluIle: 4.311 ± 0.642
3.403GluLys: 3.403 ± 0.633
7.109GluLeu: 7.109 ± 0.949
1.815GluMet: 1.815 ± 0.373
3.554GluAsn: 3.554 ± 0.526
1.437GluPro: 1.437 ± 0.395
4.159GluGln: 4.159 ± 0.726
2.949GluArg: 2.949 ± 0.419
3.252GluSer: 3.252 ± 0.454
3.025GluThr: 3.025 ± 0.463
3.63GluVal: 3.63 ± 0.512
0.756GluTrp: 0.756 ± 0.198
2.269GluTyr: 2.269 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
2.647PheAla: 2.647 ± 0.424
0.454PheCys: 0.454 ± 0.227
2.571PheAsp: 2.571 ± 0.525
2.723PheGlu: 2.723 ± 0.383
1.437PhePhe: 1.437 ± 0.394
2.874PheGly: 2.874 ± 0.593
0.454PheHis: 0.454 ± 0.174
3.63PheIle: 3.63 ± 0.551
2.723PheLys: 2.723 ± 0.395
2.798PheLeu: 2.798 ± 0.47
0.378PheMet: 0.378 ± 0.155
3.403PheAsn: 3.403 ± 0.551
1.059PhePro: 1.059 ± 0.332
1.361PheGln: 1.361 ± 0.372
1.588PheArg: 1.588 ± 0.405
2.798PheSer: 2.798 ± 0.402
3.328PheThr: 3.328 ± 0.579
2.193PheVal: 2.193 ± 0.468
0.303PheTrp: 0.303 ± 0.138
1.437PheTyr: 1.437 ± 0.405
0.0PheXaa: 0.0 ± 0.0
Gly
5.521GlyAla: 5.521 ± 1.533
0.605GlyCys: 0.605 ± 0.222
4.764GlyAsp: 4.764 ± 0.906
4.235GlyGlu: 4.235 ± 0.994
2.949GlyPhe: 2.949 ± 0.426
4.386GlyGly: 4.386 ± 0.725
1.21GlyHis: 1.21 ± 0.391
4.008GlyIle: 4.008 ± 0.724
4.764GlyLys: 4.764 ± 0.762
5.445GlyLeu: 5.445 ± 0.718
1.513GlyMet: 1.513 ± 0.35
4.159GlyAsn: 4.159 ± 0.761
0.378GlyPro: 0.378 ± 0.159
2.571GlyGln: 2.571 ± 0.57
2.723GlyArg: 2.723 ± 0.463
4.916GlySer: 4.916 ± 0.941
4.613GlyThr: 4.613 ± 0.971
4.311GlyVal: 4.311 ± 0.783
1.059GlyTrp: 1.059 ± 0.267
2.798GlyTyr: 2.798 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.832HisAla: 0.832 ± 0.327
0.227HisCys: 0.227 ± 0.14
1.437HisAsp: 1.437 ± 0.424
0.908HisGlu: 0.908 ± 0.22
0.605HisPhe: 0.605 ± 0.241
0.908HisGly: 0.908 ± 0.264
0.908HisHis: 0.908 ± 0.219
1.21HisIle: 1.21 ± 0.315
0.908HisLys: 0.908 ± 0.26
1.739HisLeu: 1.739 ± 0.373
0.227HisMet: 0.227 ± 0.13
1.286HisAsn: 1.286 ± 0.277
0.908HisPro: 0.908 ± 0.26
0.681HisGln: 0.681 ± 0.288
0.756HisArg: 0.756 ± 0.211
1.134HisSer: 1.134 ± 0.306
0.529HisThr: 0.529 ± 0.187
1.286HisVal: 1.286 ± 0.289
0.454HisTrp: 0.454 ± 0.184
0.303HisTyr: 0.303 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
5.974IleAla: 5.974 ± 0.823
0.832IleCys: 0.832 ± 0.268
5.369IleAsp: 5.369 ± 0.638
4.462IleGlu: 4.462 ± 0.575
2.193IlePhe: 2.193 ± 0.363
4.311IleGly: 4.311 ± 0.645
1.134IleHis: 1.134 ± 0.356
3.025IleIle: 3.025 ± 0.53
4.159IleLys: 4.159 ± 0.606
4.991IleLeu: 4.991 ± 0.535
1.664IleMet: 1.664 ± 0.303
4.084IleAsn: 4.084 ± 0.533
2.723IlePro: 2.723 ± 0.522
3.025IleGln: 3.025 ± 0.462
2.571IleArg: 2.571 ± 0.382
6.277IleSer: 6.277 ± 0.669
4.84IleThr: 4.84 ± 0.576
3.176IleVal: 3.176 ± 0.459
0.832IleTrp: 0.832 ± 0.297
1.513IleTyr: 1.513 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
6.731LysAla: 6.731 ± 1.092
0.605LysCys: 0.605 ± 0.217
3.554LysAsp: 3.554 ± 0.564
3.101LysGlu: 3.101 ± 0.456
2.42LysPhe: 2.42 ± 0.328
3.781LysGly: 3.781 ± 0.572
1.739LysHis: 1.739 ± 0.407
4.311LysIle: 4.311 ± 0.586
5.143LysLys: 5.143 ± 0.691
6.05LysLeu: 6.05 ± 0.561
1.134LysMet: 1.134 ± 0.384
3.706LysAsn: 3.706 ± 0.421
3.176LysPro: 3.176 ± 0.503
4.235LysGln: 4.235 ± 0.813
3.176LysArg: 3.176 ± 0.496
3.857LysSer: 3.857 ± 0.643
4.159LysThr: 4.159 ± 0.527
4.159LysVal: 4.159 ± 0.638
0.983LysTrp: 0.983 ± 0.33
2.269LysTyr: 2.269 ± 0.554
0.0LysXaa: 0.0 ± 0.0
Leu
7.563LeuAla: 7.563 ± 1.001
1.134LeuCys: 1.134 ± 0.335
5.823LeuAsp: 5.823 ± 0.561
5.445LeuGlu: 5.445 ± 0.873
2.874LeuPhe: 2.874 ± 0.519
5.974LeuGly: 5.974 ± 0.916
1.134LeuHis: 1.134 ± 0.291
6.201LeuIle: 6.201 ± 0.631
7.487LeuLys: 7.487 ± 0.726
6.882LeuLeu: 6.882 ± 0.758
2.118LeuMet: 2.118 ± 0.404
4.689LeuAsn: 4.689 ± 0.582
2.949LeuPro: 2.949 ± 0.554
3.857LeuGln: 3.857 ± 0.468
5.143LeuArg: 5.143 ± 1.345
5.369LeuSer: 5.369 ± 0.756
5.369LeuThr: 5.369 ± 0.655
5.974LeuVal: 5.974 ± 0.811
0.378LeuTrp: 0.378 ± 0.151
2.42LeuTyr: 2.42 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.193MetAla: 2.193 ± 0.395
0.076MetCys: 0.076 ± 0.07
1.286MetAsp: 1.286 ± 0.306
0.832MetGlu: 0.832 ± 0.26
0.908MetPhe: 0.908 ± 0.287
1.664MetGly: 1.664 ± 0.342
0.529MetHis: 0.529 ± 0.191
0.983MetIle: 0.983 ± 0.208
1.059MetLys: 1.059 ± 0.272
1.966MetLeu: 1.966 ± 0.513
0.832MetMet: 0.832 ± 0.218
1.513MetAsn: 1.513 ± 0.391
1.513MetPro: 1.513 ± 0.324
1.134MetGln: 1.134 ± 0.278
1.059MetArg: 1.059 ± 0.332
1.891MetSer: 1.891 ± 0.403
1.513MetThr: 1.513 ± 0.287
1.588MetVal: 1.588 ± 0.323
0.0MetTrp: 0.0 ± 0.0
0.681MetTyr: 0.681 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
4.311AsnAla: 4.311 ± 0.655
0.151AsnCys: 0.151 ± 0.131
2.571AsnAsp: 2.571 ± 0.514
2.723AsnGlu: 2.723 ± 0.413
2.118AsnPhe: 2.118 ± 0.375
5.294AsnGly: 5.294 ± 0.635
0.681AsnHis: 0.681 ± 0.264
3.328AsnIle: 3.328 ± 0.458
4.084AsnLys: 4.084 ± 0.59
4.613AsnLeu: 4.613 ± 0.6
0.681AsnMet: 0.681 ± 0.189
3.63AsnAsn: 3.63 ± 0.797
2.269AsnPro: 2.269 ± 0.442
3.025AsnGln: 3.025 ± 0.494
1.739AsnArg: 1.739 ± 0.343
4.689AsnSer: 4.689 ± 0.722
3.176AsnThr: 3.176 ± 0.506
3.554AsnVal: 3.554 ± 0.523
0.908AsnTrp: 0.908 ± 0.295
1.815AsnTyr: 1.815 ± 0.886
0.0AsnXaa: 0.0 ± 0.0
Pro
2.344ProAla: 2.344 ± 0.533
0.227ProCys: 0.227 ± 0.147
2.344ProAsp: 2.344 ± 0.427
2.723ProGlu: 2.723 ± 0.455
1.513ProPhe: 1.513 ± 0.335
1.286ProGly: 1.286 ± 0.272
0.378ProHis: 0.378 ± 0.179
3.781ProIle: 3.781 ± 0.761
2.42ProLys: 2.42 ± 0.517
2.874ProLeu: 2.874 ± 0.455
0.681ProMet: 0.681 ± 0.254
1.286ProAsn: 1.286 ± 0.294
1.059ProPro: 1.059 ± 0.266
1.361ProGln: 1.361 ± 0.29
0.832ProArg: 0.832 ± 0.23
2.723ProSer: 2.723 ± 0.478
2.647ProThr: 2.647 ± 0.414
2.723ProVal: 2.723 ± 0.453
0.454ProTrp: 0.454 ± 0.173
0.681ProTyr: 0.681 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
5.596GlnAla: 5.596 ± 0.986
0.605GlnCys: 0.605 ± 0.244
1.739GlnAsp: 1.739 ± 0.294
2.723GlnGlu: 2.723 ± 0.551
1.437GlnPhe: 1.437 ± 0.418
2.723GlnGly: 2.723 ± 0.619
0.529GlnHis: 0.529 ± 0.257
3.176GlnIle: 3.176 ± 0.645
2.949GlnLys: 2.949 ± 0.694
5.445GlnLeu: 5.445 ± 1.951
0.681GlnMet: 0.681 ± 0.243
2.042GlnAsn: 2.042 ± 0.295
2.042GlnPro: 2.042 ± 0.346
3.554GlnGln: 3.554 ± 0.702
1.966GlnArg: 1.966 ± 0.374
3.706GlnSer: 3.706 ± 0.473
2.344GlnThr: 2.344 ± 0.363
3.63GlnVal: 3.63 ± 0.615
0.681GlnTrp: 0.681 ± 0.255
1.815GlnTyr: 1.815 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
2.269ArgAla: 2.269 ± 0.58
0.605ArgCys: 0.605 ± 0.251
2.193ArgAsp: 2.193 ± 0.423
2.647ArgGlu: 2.647 ± 0.375
2.042ArgPhe: 2.042 ± 0.365
2.118ArgGly: 2.118 ± 0.425
0.378ArgHis: 0.378 ± 0.152
3.176ArgIle: 3.176 ± 0.571
2.496ArgLys: 2.496 ± 0.372
4.916ArgLeu: 4.916 ± 0.698
1.134ArgMet: 1.134 ± 0.301
2.118ArgAsn: 2.118 ± 0.4
1.437ArgPro: 1.437 ± 0.357
2.949ArgGln: 2.949 ± 1.364
1.437ArgArg: 1.437 ± 0.344
3.176ArgSer: 3.176 ± 0.544
2.269ArgThr: 2.269 ± 0.352
2.193ArgVal: 2.193 ± 0.345
0.378ArgTrp: 0.378 ± 0.165
1.739ArgTyr: 1.739 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
5.899SerAla: 5.899 ± 1.531
0.756SerCys: 0.756 ± 0.235
4.386SerAsp: 4.386 ± 0.613
4.538SerGlu: 4.538 ± 0.571
3.101SerPhe: 3.101 ± 0.569
4.991SerGly: 4.991 ± 0.768
1.21SerHis: 1.21 ± 0.373
5.294SerIle: 5.294 ± 0.558
5.596SerLys: 5.596 ± 0.742
7.26SerLeu: 7.26 ± 0.695
1.891SerMet: 1.891 ± 0.346
3.63SerAsn: 3.63 ± 0.604
1.588SerPro: 1.588 ± 0.338
2.571SerGln: 2.571 ± 0.498
2.269SerArg: 2.269 ± 0.616
4.159SerSer: 4.159 ± 0.655
3.554SerThr: 3.554 ± 0.487
3.857SerVal: 3.857 ± 0.689
0.681SerTrp: 0.681 ± 0.215
2.42SerTyr: 2.42 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
4.764ThrAla: 4.764 ± 0.601
0.303ThrCys: 0.303 ± 0.163
2.798ThrAsp: 2.798 ± 0.502
3.252ThrGlu: 3.252 ± 0.545
2.42ThrPhe: 2.42 ± 0.37
6.655ThrGly: 6.655 ± 1.601
1.21ThrHis: 1.21 ± 0.286
4.008ThrIle: 4.008 ± 0.695
3.101ThrLys: 3.101 ± 0.645
5.369ThrLeu: 5.369 ± 0.599
1.134ThrMet: 1.134 ± 0.279
3.101ThrAsn: 3.101 ± 0.549
3.479ThrPro: 3.479 ± 0.67
2.193ThrGln: 2.193 ± 0.383
2.496ThrArg: 2.496 ± 0.403
3.706ThrSer: 3.706 ± 0.481
5.143ThrThr: 5.143 ± 0.944
5.067ThrVal: 5.067 ± 0.826
0.832ThrTrp: 0.832 ± 0.233
2.042ThrTyr: 2.042 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
4.159ValAla: 4.159 ± 0.569
0.378ValCys: 0.378 ± 0.191
3.479ValAsp: 3.479 ± 0.443
4.538ValGlu: 4.538 ± 0.675
2.874ValPhe: 2.874 ± 0.439
4.462ValGly: 4.462 ± 0.829
0.605ValHis: 0.605 ± 0.201
3.554ValIle: 3.554 ± 0.534
5.369ValLys: 5.369 ± 0.667
4.613ValLeu: 4.613 ± 0.525
1.437ValMet: 1.437 ± 0.378
4.764ValAsn: 4.764 ± 0.556
1.815ValPro: 1.815 ± 0.367
4.159ValGln: 4.159 ± 1.249
2.798ValArg: 2.798 ± 0.512
3.933ValSer: 3.933 ± 0.737
4.462ValThr: 4.462 ± 0.871
4.084ValVal: 4.084 ± 0.507
0.756ValTrp: 0.756 ± 0.314
1.361ValTyr: 1.361 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
0.756TrpAla: 0.756 ± 0.24
0.303TrpCys: 0.303 ± 0.183
0.378TrpAsp: 0.378 ± 0.237
0.756TrpGlu: 0.756 ± 0.226
0.529TrpPhe: 0.529 ± 0.193
0.529TrpGly: 0.529 ± 0.187
0.227TrpHis: 0.227 ± 0.137
1.134TrpIle: 1.134 ± 0.339
0.605TrpLys: 0.605 ± 0.203
0.983TrpLeu: 0.983 ± 0.267
0.378TrpMet: 0.378 ± 0.153
0.454TrpAsn: 0.454 ± 0.181
0.227TrpPro: 0.227 ± 0.182
0.378TrpGln: 0.378 ± 0.143
0.832TrpArg: 0.832 ± 0.274
1.437TrpSer: 1.437 ± 0.404
0.454TrpThr: 0.454 ± 0.175
1.134TrpVal: 1.134 ± 0.348
0.378TrpTrp: 0.378 ± 0.195
0.681TrpTyr: 0.681 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.193TyrAla: 2.193 ± 0.477
0.076TyrCys: 0.076 ± 0.077
2.193TyrAsp: 2.193 ± 0.45
1.588TyrGlu: 1.588 ± 0.368
1.361TyrPhe: 1.361 ± 0.358
2.647TyrGly: 2.647 ± 0.756
0.756TyrHis: 0.756 ± 0.248
2.118TyrIle: 2.118 ± 0.446
1.664TyrLys: 1.664 ± 0.373
2.949TyrLeu: 2.949 ± 0.632
0.681TyrMet: 0.681 ± 0.26
1.513TyrAsn: 1.513 ± 0.339
1.286TyrPro: 1.286 ± 0.311
1.664TyrGln: 1.664 ± 0.344
1.739TyrArg: 1.739 ± 0.478
1.891TyrSer: 1.891 ± 0.502
1.891TyrThr: 1.891 ± 0.383
1.513TyrVal: 1.513 ± 0.236
0.529TyrTrp: 0.529 ± 0.172
1.437TyrTyr: 1.437 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski