Amino acid dipepetide frequency for Podoviridae sp. ctdc61

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.777AlaAla: 1.777 ± 0.7
0.0AlaCys: 0.0 ± 0.0
2.793AlaAsp: 2.793 ± 0.957
4.57AlaGlu: 4.57 ± 1.455
3.3AlaPhe: 3.3 ± 1.252
1.015AlaGly: 1.015 ± 0.409
0.254AlaHis: 0.254 ± 0.251
3.3AlaIle: 3.3 ± 1.043
4.062AlaLys: 4.062 ± 0.705
4.824AlaLeu: 4.824 ± 1.154
0.508AlaMet: 0.508 ± 0.278
3.046AlaAsn: 3.046 ± 0.761
1.269AlaPro: 1.269 ± 0.56
1.269AlaGln: 1.269 ± 0.665
1.523AlaArg: 1.523 ± 0.967
2.793AlaSer: 2.793 ± 0.798
4.062AlaThr: 4.062 ± 1.215
5.331AlaVal: 5.331 ± 0.971
0.508AlaTrp: 0.508 ± 0.264
3.046AlaTyr: 3.046 ± 0.626
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.508CysCys: 0.508 ± 0.483
0.762CysAsp: 0.762 ± 0.36
0.762CysGlu: 0.762 ± 0.579
0.0CysPhe: 0.0 ± 0.0
0.508CysGly: 0.508 ± 0.483
0.254CysHis: 0.254 ± 0.242
0.254CysIle: 0.254 ± 0.242
0.762CysLys: 0.762 ± 0.591
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.762CysAsn: 0.762 ± 0.488
0.508CysPro: 0.508 ± 0.342
0.0CysGln: 0.0 ± 0.0
0.508CysArg: 0.508 ± 0.342
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.254CysVal: 0.254 ± 0.242
0.0CysTrp: 0.0 ± 0.0
0.508CysTyr: 0.508 ± 0.483
0.0CysXaa: 0.0 ± 0.0
Asp
2.539AspAla: 2.539 ± 0.736
0.254AspCys: 0.254 ± 0.242
6.093AspAsp: 6.093 ± 3.302
6.601AspGlu: 6.601 ± 2.086
2.285AspPhe: 2.285 ± 0.873
6.601AspGly: 6.601 ± 1.248
0.762AspHis: 0.762 ± 0.419
9.139AspIle: 9.139 ± 1.775
4.316AspLys: 4.316 ± 1.31
4.062AspLeu: 4.062 ± 0.74
1.269AspMet: 1.269 ± 0.382
5.839AspAsn: 5.839 ± 0.853
4.57AspPro: 4.57 ± 2.19
0.762AspGln: 0.762 ± 0.674
1.523AspArg: 1.523 ± 0.873
4.316AspSer: 4.316 ± 0.668
5.585AspThr: 5.585 ± 0.911
6.347AspVal: 6.347 ± 0.96
0.254AspTrp: 0.254 ± 0.192
4.57AspTyr: 4.57 ± 0.963
0.0AspXaa: 0.0 ± 0.0
Glu
2.793GluAla: 2.793 ± 0.548
1.015GluCys: 1.015 ± 0.804
4.824GluAsp: 4.824 ± 1.416
1.523GluGlu: 1.523 ± 0.754
3.3GluPhe: 3.3 ± 1.246
1.777GluGly: 1.777 ± 0.426
0.762GluHis: 0.762 ± 0.386
6.093GluIle: 6.093 ± 1.336
3.554GluLys: 3.554 ± 0.919
7.87GluLeu: 7.87 ± 1.759
1.523GluMet: 1.523 ± 0.592
3.046GluAsn: 3.046 ± 1.085
2.539GluPro: 2.539 ± 1.055
2.031GluGln: 2.031 ± 0.661
2.539GluArg: 2.539 ± 1.05
3.808GluSer: 3.808 ± 0.844
5.331GluThr: 5.331 ± 0.85
4.57GluVal: 4.57 ± 1.666
0.762GluTrp: 0.762 ± 0.431
2.793GluTyr: 2.793 ± 0.584
0.0GluXaa: 0.0 ± 0.0
Phe
2.031PheAla: 2.031 ± 1.204
0.0PheCys: 0.0 ± 0.0
3.808PheAsp: 3.808 ± 1.406
2.285PheGlu: 2.285 ± 0.707
1.015PhePhe: 1.015 ± 0.747
2.539PheGly: 2.539 ± 0.988
0.762PheHis: 0.762 ± 0.326
3.3PheIle: 3.3 ± 0.556
2.285PheLys: 2.285 ± 0.907
2.285PheLeu: 2.285 ± 0.592
0.762PheMet: 0.762 ± 0.325
4.57PheAsn: 4.57 ± 0.851
1.269PhePro: 1.269 ± 0.432
0.0PheGln: 0.0 ± 0.0
1.523PheArg: 1.523 ± 0.65
2.539PheSer: 2.539 ± 0.674
3.3PheThr: 3.3 ± 0.83
2.031PheVal: 2.031 ± 0.702
0.254PheTrp: 0.254 ± 0.333
3.554PheTyr: 3.554 ± 1.168
0.0PheXaa: 0.0 ± 0.0
Gly
1.269GlyAla: 1.269 ± 0.635
0.0GlyCys: 0.0 ± 0.0
3.808GlyAsp: 3.808 ± 0.966
4.062GlyGlu: 4.062 ± 0.504
4.316GlyPhe: 4.316 ± 1.25
3.808GlyGly: 3.808 ± 1.077
0.762GlyHis: 0.762 ± 0.28
4.57GlyIle: 4.57 ± 1.04
5.331GlyLys: 5.331 ± 1.213
5.839GlyLeu: 5.839 ± 1.272
1.777GlyMet: 1.777 ± 0.615
3.808GlyAsn: 3.808 ± 0.894
0.0GlyPro: 0.0 ± 0.0
0.254GlyGln: 0.254 ± 0.192
1.523GlyArg: 1.523 ± 0.568
3.3GlySer: 3.3 ± 1.078
5.077GlyThr: 5.077 ± 1.185
6.855GlyVal: 6.855 ± 1.018
0.254GlyTrp: 0.254 ± 0.31
2.539GlyTyr: 2.539 ± 0.765
0.0GlyXaa: 0.0 ± 0.0
His
0.762HisAla: 0.762 ± 0.315
0.0HisCys: 0.0 ± 0.0
0.254HisAsp: 0.254 ± 0.258
0.254HisGlu: 0.254 ± 0.251
0.254HisPhe: 0.254 ± 0.242
1.269HisGly: 1.269 ± 1.022
0.508HisHis: 0.508 ± 0.331
0.508HisIle: 0.508 ± 0.414
1.015HisLys: 1.015 ± 0.531
1.523HisLeu: 1.523 ± 0.6
0.254HisMet: 0.254 ± 0.242
1.015HisAsn: 1.015 ± 0.693
0.508HisPro: 0.508 ± 0.363
0.508HisGln: 0.508 ± 0.361
0.0HisArg: 0.0 ± 0.0
0.508HisSer: 0.508 ± 0.352
0.508HisThr: 0.508 ± 0.382
1.015HisVal: 1.015 ± 0.512
0.0HisTrp: 0.0 ± 0.0
1.015HisTyr: 1.015 ± 0.611
0.0HisXaa: 0.0 ± 0.0
Ile
2.285IleAla: 2.285 ± 0.736
0.254IleCys: 0.254 ± 0.242
8.632IleAsp: 8.632 ± 1.256
5.839IleGlu: 5.839 ± 1.57
1.777IlePhe: 1.777 ± 0.452
5.331IleGly: 5.331 ± 1.523
1.777IleHis: 1.777 ± 0.914
5.077IleIle: 5.077 ± 0.792
5.585IleLys: 5.585 ± 1.162
5.585IleLeu: 5.585 ± 0.68
0.762IleMet: 0.762 ± 0.349
5.585IleAsn: 5.585 ± 1.06
3.554IlePro: 3.554 ± 0.595
2.539IleGln: 2.539 ± 0.512
1.523IleArg: 1.523 ± 0.409
6.093IleSer: 6.093 ± 1.131
7.362IleThr: 7.362 ± 1.382
4.824IleVal: 4.824 ± 1.144
0.508IleTrp: 0.508 ± 0.23
2.793IleTyr: 2.793 ± 1.552
0.0IleXaa: 0.0 ± 0.0
Lys
4.316LysAla: 4.316 ± 1.01
0.508LysCys: 0.508 ± 0.535
4.316LysAsp: 4.316 ± 0.474
4.062LysGlu: 4.062 ± 0.715
3.046LysPhe: 3.046 ± 1.142
3.046LysGly: 3.046 ± 1.057
1.015LysHis: 1.015 ± 0.459
4.062LysIle: 4.062 ± 0.894
2.539LysLys: 2.539 ± 0.916
6.093LysLeu: 6.093 ± 1.381
1.269LysMet: 1.269 ± 0.541
3.808LysAsn: 3.808 ± 1.124
2.285LysPro: 2.285 ± 0.667
2.539LysGln: 2.539 ± 0.892
3.808LysArg: 3.808 ± 1.337
3.046LysSer: 3.046 ± 0.675
8.124LysThr: 8.124 ± 1.238
6.601LysVal: 6.601 ± 1.697
0.508LysTrp: 0.508 ± 0.313
2.539LysTyr: 2.539 ± 0.685
0.0LysXaa: 0.0 ± 0.0
Leu
5.839LeuAla: 5.839 ± 1.57
0.254LeuCys: 0.254 ± 0.242
6.093LeuAsp: 6.093 ± 1.09
3.808LeuGlu: 3.808 ± 1.144
2.793LeuPhe: 2.793 ± 0.906
6.601LeuGly: 6.601 ± 1.089
1.777LeuHis: 1.777 ± 0.593
4.316LeuIle: 4.316 ± 1.13
4.062LeuLys: 4.062 ± 1.392
3.808LeuLeu: 3.808 ± 1.113
2.285LeuMet: 2.285 ± 0.497
5.331LeuAsn: 5.331 ± 1.258
3.3LeuPro: 3.3 ± 0.675
1.269LeuGln: 1.269 ± 0.756
2.031LeuArg: 2.031 ± 0.722
9.901LeuSer: 9.901 ± 1.317
5.585LeuThr: 5.585 ± 1.633
5.077LeuVal: 5.077 ± 1.064
1.015LeuTrp: 1.015 ± 0.33
4.062LeuTyr: 4.062 ± 0.877
0.0LeuXaa: 0.0 ± 0.0
Met
1.523MetAla: 1.523 ± 1.078
0.508MetCys: 0.508 ± 0.483
0.762MetAsp: 0.762 ± 0.438
1.015MetGlu: 1.015 ± 0.364
2.031MetPhe: 2.031 ± 0.526
1.269MetGly: 1.269 ± 0.503
0.508MetHis: 0.508 ± 0.308
1.523MetIle: 1.523 ± 0.602
2.539MetLys: 2.539 ± 0.482
3.046MetLeu: 3.046 ± 0.843
0.254MetMet: 0.254 ± 0.268
1.015MetAsn: 1.015 ± 0.482
0.254MetPro: 0.254 ± 0.192
0.762MetGln: 0.762 ± 0.481
0.508MetArg: 0.508 ± 0.331
0.762MetSer: 0.762 ± 0.351
1.269MetThr: 1.269 ± 0.593
0.508MetVal: 0.508 ± 0.271
0.254MetTrp: 0.254 ± 0.303
2.031MetTyr: 2.031 ± 0.848
0.0MetXaa: 0.0 ± 0.0
Asn
3.046AsnAla: 3.046 ± 0.886
0.0AsnCys: 0.0 ± 0.0
4.062AsnAsp: 4.062 ± 1.39
6.347AsnGlu: 6.347 ± 0.656
1.269AsnPhe: 1.269 ± 0.36
5.077AsnGly: 5.077 ± 1.759
0.762AsnHis: 0.762 ± 0.519
6.093AsnIle: 6.093 ± 1.171
6.601AsnLys: 6.601 ± 1.641
3.808AsnLeu: 3.808 ± 0.848
2.285AsnMet: 2.285 ± 1.217
5.331AsnAsn: 5.331 ± 1.244
2.031AsnPro: 2.031 ± 0.631
1.269AsnGln: 1.269 ± 0.545
1.523AsnArg: 1.523 ± 0.582
3.3AsnSer: 3.3 ± 0.917
6.347AsnThr: 6.347 ± 1.236
5.077AsnVal: 5.077 ± 0.927
0.254AsnTrp: 0.254 ± 0.242
3.3AsnTyr: 3.3 ± 0.988
0.0AsnXaa: 0.0 ± 0.0
Pro
2.031ProAla: 2.031 ± 0.565
0.254ProCys: 0.254 ± 0.268
3.3ProAsp: 3.3 ± 1.626
3.554ProGlu: 3.554 ± 0.953
1.269ProPhe: 1.269 ± 0.721
0.0ProGly: 0.0 ± 0.0
0.254ProHis: 0.254 ± 0.251
2.285ProIle: 2.285 ± 0.689
2.793ProLys: 2.793 ± 0.873
2.539ProLeu: 2.539 ± 0.909
0.762ProMet: 0.762 ± 0.313
3.808ProAsn: 3.808 ± 1.789
2.793ProPro: 2.793 ± 1.372
0.254ProGln: 0.254 ± 0.258
0.254ProArg: 0.254 ± 0.251
2.793ProSer: 2.793 ± 0.629
2.539ProThr: 2.539 ± 0.814
3.554ProVal: 3.554 ± 1.385
0.254ProTrp: 0.254 ± 0.242
2.285ProTyr: 2.285 ± 0.442
0.0ProXaa: 0.0 ± 0.0
Gln
1.015GlnAla: 1.015 ± 1.056
0.254GlnCys: 0.254 ± 0.242
1.269GlnAsp: 1.269 ± 0.503
1.269GlnGlu: 1.269 ± 0.56
0.762GlnPhe: 0.762 ± 0.725
1.777GlnGly: 1.777 ± 0.525
0.254GlnHis: 0.254 ± 0.258
1.523GlnIle: 1.523 ± 0.413
0.762GlnLys: 0.762 ± 0.457
2.539GlnLeu: 2.539 ± 1.002
0.254GlnMet: 0.254 ± 0.292
0.508GlnAsn: 0.508 ± 0.402
1.523GlnPro: 1.523 ± 0.73
1.015GlnGln: 1.015 ± 0.975
1.523GlnArg: 1.523 ± 0.678
1.523GlnSer: 1.523 ± 0.519
1.269GlnThr: 1.269 ± 0.613
0.762GlnVal: 0.762 ± 0.351
0.254GlnTrp: 0.254 ± 0.31
1.523GlnTyr: 1.523 ± 0.599
0.0GlnXaa: 0.0 ± 0.0
Arg
1.523ArgAla: 1.523 ± 0.688
0.254ArgCys: 0.254 ± 0.242
2.285ArgAsp: 2.285 ± 0.573
1.777ArgGlu: 1.777 ± 1.007
1.015ArgPhe: 1.015 ± 0.517
1.015ArgGly: 1.015 ± 0.436
0.254ArgHis: 0.254 ± 0.242
2.285ArgIle: 2.285 ± 0.556
0.762ArgLys: 0.762 ± 0.347
4.57ArgLeu: 4.57 ± 1.206
1.015ArgMet: 1.015 ± 0.559
1.015ArgAsn: 1.015 ± 0.822
1.269ArgPro: 1.269 ± 0.691
0.762ArgGln: 0.762 ± 0.237
2.539ArgArg: 2.539 ± 0.986
2.031ArgSer: 2.031 ± 1.126
3.554ArgThr: 3.554 ± 0.63
3.3ArgVal: 3.3 ± 0.937
0.762ArgTrp: 0.762 ± 0.323
1.269ArgTyr: 1.269 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
4.062SerAla: 4.062 ± 0.938
0.254SerCys: 0.254 ± 0.303
5.839SerAsp: 5.839 ± 1.142
3.046SerGlu: 3.046 ± 0.786
3.046SerPhe: 3.046 ± 0.608
5.585SerGly: 5.585 ± 0.964
0.508SerHis: 0.508 ± 0.23
4.57SerIle: 4.57 ± 0.948
7.108SerLys: 7.108 ± 1.201
4.824SerLeu: 4.824 ± 1.269
1.015SerMet: 1.015 ± 0.519
5.331SerAsn: 5.331 ± 1.169
2.539SerPro: 2.539 ± 0.839
0.762SerGln: 0.762 ± 0.44
1.269SerArg: 1.269 ± 0.532
4.316SerSer: 4.316 ± 0.914
5.585SerThr: 5.585 ± 1.49
5.839SerVal: 5.839 ± 0.947
0.508SerTrp: 0.508 ± 0.516
1.777SerTyr: 1.777 ± 1.074
0.0SerXaa: 0.0 ± 0.0
Thr
4.062ThrAla: 4.062 ± 1.098
0.762ThrCys: 0.762 ± 0.579
7.362ThrAsp: 7.362 ± 0.82
4.062ThrGlu: 4.062 ± 1.58
2.285ThrPhe: 2.285 ± 0.882
3.808ThrGly: 3.808 ± 0.759
0.254ThrHis: 0.254 ± 0.242
7.108ThrIle: 7.108 ± 1.022
3.046ThrLys: 3.046 ± 1.423
7.108ThrLeu: 7.108 ± 0.716
2.285ThrMet: 2.285 ± 0.496
5.839ThrAsn: 5.839 ± 1.463
2.031ThrPro: 2.031 ± 0.451
1.523ThrGln: 1.523 ± 0.525
2.031ThrArg: 2.031 ± 0.567
6.347ThrSer: 6.347 ± 1.298
3.554ThrThr: 3.554 ± 0.828
8.378ThrVal: 8.378 ± 1.246
1.269ThrTrp: 1.269 ± 0.459
5.839ThrTyr: 5.839 ± 0.936
0.0ThrXaa: 0.0 ± 0.0
Val
5.331ValAla: 5.331 ± 1.121
0.0ValCys: 0.0 ± 0.0
7.362ValAsp: 7.362 ± 1.822
4.57ValGlu: 4.57 ± 0.993
2.285ValPhe: 2.285 ± 0.994
4.062ValGly: 4.062 ± 1.704
0.0ValHis: 0.0 ± 0.0
6.601ValIle: 6.601 ± 1.023
5.331ValLys: 5.331 ± 1.179
4.57ValLeu: 4.57 ± 1.135
1.523ValMet: 1.523 ± 0.677
5.331ValAsn: 5.331 ± 1.353
4.062ValPro: 4.062 ± 1.246
1.777ValGln: 1.777 ± 0.633
4.824ValArg: 4.824 ± 0.837
7.108ValSer: 7.108 ± 1.361
6.347ValThr: 6.347 ± 1.37
5.585ValVal: 5.585 ± 1.403
1.269ValTrp: 1.269 ± 0.443
2.285ValTyr: 2.285 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.254TrpAla: 0.254 ± 0.251
0.254TrpCys: 0.254 ± 0.242
0.508TrpAsp: 0.508 ± 0.483
0.762TrpGlu: 0.762 ± 0.454
1.777TrpPhe: 1.777 ± 0.599
0.254TrpGly: 0.254 ± 0.242
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.254TrpLys: 0.254 ± 0.251
0.762TrpLeu: 0.762 ± 0.426
0.254TrpMet: 0.254 ± 0.258
0.508TrpAsn: 0.508 ± 0.379
0.0TrpPro: 0.0 ± 0.0
1.015TrpGln: 1.015 ± 0.442
0.254TrpArg: 0.254 ± 0.303
1.269TrpSer: 1.269 ± 0.557
0.254TrpThr: 0.254 ± 0.192
0.254TrpVal: 0.254 ± 0.192
0.254TrpTrp: 0.254 ± 0.192
0.508TrpTyr: 0.508 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.793TyrAla: 2.793 ± 0.794
1.015TyrCys: 1.015 ± 0.717
3.554TyrAsp: 3.554 ± 0.665
2.285TyrGlu: 2.285 ± 1.364
2.285TyrPhe: 2.285 ± 0.972
3.808TyrGly: 3.808 ± 0.712
0.254TyrHis: 0.254 ± 0.268
5.077TyrIle: 5.077 ± 1.71
5.077TyrLys: 5.077 ± 0.734
3.554TyrLeu: 3.554 ± 1.009
2.031TyrMet: 2.031 ± 0.396
2.285TyrAsn: 2.285 ± 0.771
1.269TyrPro: 1.269 ± 0.484
1.269TyrGln: 1.269 ± 0.327
2.031TyrArg: 2.031 ± 0.69
2.031TyrSer: 2.031 ± 0.762
3.3TyrThr: 3.3 ± 1.046
4.062TyrVal: 4.062 ± 0.651
0.254TyrTrp: 0.254 ± 0.258
1.777TyrTyr: 1.777 ± 0.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3940 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski