Amino acid dipepetide frequency for Podoviridae sp. ctjc_2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.598AlaAla: 5.598 ± 1.502
1.083AlaCys: 1.083 ± 0.452
3.25AlaAsp: 3.25 ± 0.731
10.112AlaGlu: 10.112 ± 5.47
1.986AlaPhe: 1.986 ± 0.859
3.611AlaGly: 3.611 ± 1.008
1.445AlaHis: 1.445 ± 0.401
2.167AlaIle: 2.167 ± 0.465
3.973AlaLys: 3.973 ± 0.967
5.959AlaLeu: 5.959 ± 1.095
1.625AlaMet: 1.625 ± 0.509
5.417AlaAsn: 5.417 ± 0.867
2.709AlaPro: 2.709 ± 0.806
2.889AlaGln: 2.889 ± 0.55
1.806AlaArg: 1.806 ± 0.616
3.973AlaSer: 3.973 ± 0.87
4.153AlaThr: 4.153 ± 1.191
4.695AlaVal: 4.695 ± 1.062
1.083AlaTrp: 1.083 ± 0.381
3.07AlaTyr: 3.07 ± 0.723
0.0AlaXaa: 0.0 ± 0.0
Cys
0.903CysAla: 0.903 ± 0.32
0.361CysCys: 0.361 ± 0.191
0.542CysAsp: 0.542 ± 0.243
0.722CysGlu: 0.722 ± 0.405
0.903CysPhe: 0.903 ± 0.312
0.542CysGly: 0.542 ± 0.266
0.181CysHis: 0.181 ± 0.135
0.361CysIle: 0.361 ± 0.192
0.542CysLys: 0.542 ± 0.373
1.264CysLeu: 1.264 ± 0.524
0.722CysMet: 0.722 ± 0.322
0.722CysAsn: 0.722 ± 0.454
0.722CysPro: 0.722 ± 0.389
0.181CysGln: 0.181 ± 0.217
0.361CysArg: 0.361 ± 0.273
0.722CysSer: 0.722 ± 0.257
0.0CysThr: 0.0 ± 0.0
0.542CysVal: 0.542 ± 0.255
0.181CysTrp: 0.181 ± 0.135
0.542CysTyr: 0.542 ± 0.379
0.0CysXaa: 0.0 ± 0.0
Asp
4.153AspAla: 4.153 ± 1.028
0.722AspCys: 0.722 ± 0.376
1.986AspAsp: 1.986 ± 0.78
3.973AspGlu: 3.973 ± 0.882
2.167AspPhe: 2.167 ± 0.901
4.153AspGly: 4.153 ± 0.982
0.542AspHis: 0.542 ± 0.214
5.056AspIle: 5.056 ± 1.507
4.334AspLys: 4.334 ± 0.633
3.792AspLeu: 3.792 ± 0.513
1.083AspMet: 1.083 ± 0.456
3.792AspAsn: 3.792 ± 1.25
0.903AspPro: 0.903 ± 0.292
0.903AspGln: 0.903 ± 0.405
1.264AspArg: 1.264 ± 0.428
5.237AspSer: 5.237 ± 0.66
5.417AspThr: 5.417 ± 0.915
3.07AspVal: 3.07 ± 0.811
0.181AspTrp: 0.181 ± 0.174
2.709AspTyr: 2.709 ± 0.752
0.0AspXaa: 0.0 ± 0.0
Glu
5.598GluAla: 5.598 ± 2.116
0.542GluCys: 0.542 ± 0.366
3.792GluAsp: 3.792 ± 0.804
3.431GluGlu: 3.431 ± 0.887
3.431GluPhe: 3.431 ± 0.947
4.334GluGly: 4.334 ± 0.765
1.445GluHis: 1.445 ± 0.311
6.501GluIle: 6.501 ± 1.331
4.875GluLys: 4.875 ± 0.816
4.875GluLeu: 4.875 ± 1.018
2.528GluMet: 2.528 ± 1.238
3.973GluAsn: 3.973 ± 0.812
2.167GluPro: 2.167 ± 0.645
4.514GluGln: 4.514 ± 1.162
3.792GluArg: 3.792 ± 0.862
4.514GluSer: 4.514 ± 1.075
5.056GluThr: 5.056 ± 1.314
3.611GluVal: 3.611 ± 0.827
1.264GluTrp: 1.264 ± 0.464
2.709GluTyr: 2.709 ± 0.895
0.0GluXaa: 0.0 ± 0.0
Phe
1.445PheAla: 1.445 ± 0.624
0.181PheCys: 0.181 ± 0.135
3.25PheAsp: 3.25 ± 0.843
2.709PheGlu: 2.709 ± 0.864
1.445PhePhe: 1.445 ± 0.673
1.625PheGly: 1.625 ± 0.819
1.083PheHis: 1.083 ± 0.577
2.347PheIle: 2.347 ± 1.015
2.709PheLys: 2.709 ± 0.567
2.528PheLeu: 2.528 ± 0.589
1.445PheMet: 1.445 ± 0.46
4.153PheAsn: 4.153 ± 0.872
1.083PhePro: 1.083 ± 0.361
2.889PheGln: 2.889 ± 0.932
1.264PheArg: 1.264 ± 0.478
2.889PheSer: 2.889 ± 0.732
2.167PheThr: 2.167 ± 0.552
3.07PheVal: 3.07 ± 1.099
0.542PheTrp: 0.542 ± 0.248
1.445PheTyr: 1.445 ± 0.546
0.0PheXaa: 0.0 ± 0.0
Gly
2.709GlyAla: 2.709 ± 0.757
0.903GlyCys: 0.903 ± 0.465
2.167GlyAsp: 2.167 ± 0.663
3.611GlyGlu: 3.611 ± 0.867
3.611GlyPhe: 3.611 ± 0.604
3.792GlyGly: 3.792 ± 0.622
1.264GlyHis: 1.264 ± 0.642
3.611GlyIle: 3.611 ± 1.039
3.973GlyLys: 3.973 ± 0.978
3.07GlyLeu: 3.07 ± 0.826
2.347GlyMet: 2.347 ± 0.716
3.611GlyAsn: 3.611 ± 0.54
1.445GlyPro: 1.445 ± 0.636
1.625GlyGln: 1.625 ± 0.521
1.083GlyArg: 1.083 ± 0.434
4.153GlySer: 4.153 ± 0.732
4.875GlyThr: 4.875 ± 1.159
3.431GlyVal: 3.431 ± 0.849
1.264GlyTrp: 1.264 ± 0.45
4.334GlyTyr: 4.334 ± 0.895
0.0GlyXaa: 0.0 ± 0.0
His
0.903HisAla: 0.903 ± 0.353
0.361HisCys: 0.361 ± 0.285
1.083HisAsp: 1.083 ± 0.433
0.722HisGlu: 0.722 ± 0.435
0.542HisPhe: 0.542 ± 0.316
0.542HisGly: 0.542 ± 0.266
0.542HisHis: 0.542 ± 0.251
1.445HisIle: 1.445 ± 0.456
1.445HisLys: 1.445 ± 0.512
1.264HisLeu: 1.264 ± 0.537
0.181HisMet: 0.181 ± 0.135
1.625HisAsn: 1.625 ± 0.334
0.361HisPro: 0.361 ± 0.273
0.722HisGln: 0.722 ± 0.406
0.542HisArg: 0.542 ± 0.217
1.445HisSer: 1.445 ± 0.374
0.722HisThr: 0.722 ± 0.434
0.0HisVal: 0.0 ± 0.0
0.181HisTrp: 0.181 ± 0.218
1.625HisTyr: 1.625 ± 0.417
0.0HisXaa: 0.0 ± 0.0
Ile
2.528IleAla: 2.528 ± 0.526
0.361IleCys: 0.361 ± 0.272
4.514IleAsp: 4.514 ± 0.991
5.237IleGlu: 5.237 ± 0.932
1.806IlePhe: 1.806 ± 0.534
3.07IleGly: 3.07 ± 0.784
1.445IleHis: 1.445 ± 0.534
4.514IleIle: 4.514 ± 0.922
5.778IleLys: 5.778 ± 1.009
4.875IleLeu: 4.875 ± 1.382
2.528IleMet: 2.528 ± 0.585
3.611IleAsn: 3.611 ± 0.901
2.889IlePro: 2.889 ± 0.695
2.167IleGln: 2.167 ± 0.745
1.625IleArg: 1.625 ± 0.821
5.417IleSer: 5.417 ± 1.052
5.778IleThr: 5.778 ± 1.016
3.792IleVal: 3.792 ± 0.889
0.542IleTrp: 0.542 ± 0.351
1.806IleTyr: 1.806 ± 0.678
0.0IleXaa: 0.0 ± 0.0
Lys
4.875LysAla: 4.875 ± 0.467
0.722LysCys: 0.722 ± 0.284
3.792LysAsp: 3.792 ± 0.582
4.514LysGlu: 4.514 ± 1.07
2.347LysPhe: 2.347 ± 0.847
5.056LysGly: 5.056 ± 1.062
1.445LysHis: 1.445 ± 0.534
4.334LysIle: 4.334 ± 0.942
4.514LysLys: 4.514 ± 1.405
6.862LysLeu: 6.862 ± 0.818
1.445LysMet: 1.445 ± 0.592
3.973LysAsn: 3.973 ± 1.028
1.445LysPro: 1.445 ± 0.504
2.709LysGln: 2.709 ± 0.515
3.611LysArg: 3.611 ± 0.905
3.431LysSer: 3.431 ± 0.801
5.778LysThr: 5.778 ± 0.481
3.792LysVal: 3.792 ± 0.702
0.903LysTrp: 0.903 ± 0.384
1.986LysTyr: 1.986 ± 1.037
0.0LysXaa: 0.0 ± 0.0
Leu
4.334LeuAla: 4.334 ± 0.907
0.722LeuCys: 0.722 ± 0.304
5.417LeuAsp: 5.417 ± 0.837
5.237LeuGlu: 5.237 ± 1.191
4.153LeuPhe: 4.153 ± 1.3
1.445LeuGly: 1.445 ± 0.321
2.167LeuHis: 2.167 ± 0.654
6.501LeuIle: 6.501 ± 0.906
6.501LeuLys: 6.501 ± 1.056
4.875LeuLeu: 4.875 ± 1.133
1.264LeuMet: 1.264 ± 0.588
6.32LeuAsn: 6.32 ± 1.185
4.695LeuPro: 4.695 ± 0.858
3.973LeuGln: 3.973 ± 1.91
2.709LeuArg: 2.709 ± 0.519
6.139LeuSer: 6.139 ± 1.021
6.32LeuThr: 6.32 ± 1.49
3.973LeuVal: 3.973 ± 0.656
0.903LeuTrp: 0.903 ± 0.35
4.514LeuTyr: 4.514 ± 0.745
0.0LeuXaa: 0.0 ± 0.0
Met
1.445MetAla: 1.445 ± 0.426
0.181MetCys: 0.181 ± 0.177
1.806MetAsp: 1.806 ± 0.549
0.903MetGlu: 0.903 ± 0.519
1.083MetPhe: 1.083 ± 0.571
1.625MetGly: 1.625 ± 0.549
0.361MetHis: 0.361 ± 0.236
1.083MetIle: 1.083 ± 0.459
2.528MetLys: 2.528 ± 0.681
2.709MetLeu: 2.709 ± 0.377
0.361MetMet: 0.361 ± 0.24
1.445MetAsn: 1.445 ± 0.556
0.903MetPro: 0.903 ± 0.397
0.722MetGln: 0.722 ± 0.394
1.083MetArg: 1.083 ± 0.372
2.167MetSer: 2.167 ± 0.739
2.528MetThr: 2.528 ± 0.654
1.264MetVal: 1.264 ± 0.644
0.0MetTrp: 0.0 ± 0.0
0.722MetTyr: 0.722 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
6.32AsnAla: 6.32 ± 1.406
0.542AsnCys: 0.542 ± 0.256
1.986AsnAsp: 1.986 ± 0.529
5.417AsnGlu: 5.417 ± 1.144
1.625AsnPhe: 1.625 ± 0.525
3.611AsnGly: 3.611 ± 0.6
0.542AsnHis: 0.542 ± 0.259
5.237AsnIle: 5.237 ± 0.99
2.889AsnLys: 2.889 ± 0.997
5.778AsnLeu: 5.778 ± 0.812
1.625AsnMet: 1.625 ± 0.478
3.611AsnAsn: 3.611 ± 0.601
2.889AsnPro: 2.889 ± 1.016
3.25AsnGln: 3.25 ± 0.873
1.986AsnArg: 1.986 ± 0.545
4.334AsnSer: 4.334 ± 1.099
3.431AsnThr: 3.431 ± 1.042
3.792AsnVal: 3.792 ± 0.831
0.542AsnTrp: 0.542 ± 0.331
3.792AsnTyr: 3.792 ± 0.954
0.0AsnXaa: 0.0 ± 0.0
Pro
1.986ProAla: 1.986 ± 0.587
0.722ProCys: 0.722 ± 0.434
2.167ProAsp: 2.167 ± 0.67
3.611ProGlu: 3.611 ± 1.052
2.347ProPhe: 2.347 ± 0.558
2.167ProGly: 2.167 ± 0.569
0.361ProHis: 0.361 ± 0.239
1.625ProIle: 1.625 ± 0.278
1.264ProLys: 1.264 ± 0.437
4.153ProLeu: 4.153 ± 1.092
0.542ProMet: 0.542 ± 0.283
1.083ProAsn: 1.083 ± 0.336
2.167ProPro: 2.167 ± 1.276
1.083ProGln: 1.083 ± 0.494
0.722ProArg: 0.722 ± 0.54
2.167ProSer: 2.167 ± 0.725
1.625ProThr: 1.625 ± 0.63
3.07ProVal: 3.07 ± 0.844
0.181ProTrp: 0.181 ± 0.182
2.167ProTyr: 2.167 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
4.695GlnAla: 4.695 ± 1.65
0.361GlnCys: 0.361 ± 0.364
2.528GlnAsp: 2.528 ± 0.746
3.431GlnGlu: 3.431 ± 0.862
0.722GlnPhe: 0.722 ± 0.299
3.07GlnGly: 3.07 ± 0.668
0.181GlnHis: 0.181 ± 0.177
1.625GlnIle: 1.625 ± 0.458
3.07GlnLys: 3.07 ± 0.871
3.973GlnLeu: 3.973 ± 0.917
1.083GlnMet: 1.083 ± 0.319
2.709GlnAsn: 2.709 ± 0.461
1.083GlnPro: 1.083 ± 0.433
2.889GlnGln: 2.889 ± 1.654
1.625GlnArg: 1.625 ± 0.482
2.889GlnSer: 2.889 ± 1.146
3.25GlnThr: 3.25 ± 0.837
2.528GlnVal: 2.528 ± 0.913
0.181GlnTrp: 0.181 ± 0.182
1.625GlnTyr: 1.625 ± 0.59
0.0GlnXaa: 0.0 ± 0.0
Arg
4.514ArgAla: 4.514 ± 1.954
0.542ArgCys: 0.542 ± 0.327
1.806ArgAsp: 1.806 ± 0.489
2.528ArgGlu: 2.528 ± 0.76
1.625ArgPhe: 1.625 ± 0.651
1.986ArgGly: 1.986 ± 0.763
0.181ArgHis: 0.181 ± 0.198
1.986ArgIle: 1.986 ± 0.569
1.986ArgLys: 1.986 ± 0.463
2.167ArgLeu: 2.167 ± 0.851
1.083ArgMet: 1.083 ± 0.476
1.445ArgAsn: 1.445 ± 0.473
0.361ArgPro: 0.361 ± 0.242
1.445ArgGln: 1.445 ± 0.608
0.903ArgArg: 0.903 ± 0.333
1.986ArgSer: 1.986 ± 0.441
1.986ArgThr: 1.986 ± 0.673
3.431ArgVal: 3.431 ± 0.908
0.542ArgTrp: 0.542 ± 0.216
1.264ArgTyr: 1.264 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
5.417SerAla: 5.417 ± 1.152
0.903SerCys: 0.903 ± 0.332
4.334SerAsp: 4.334 ± 0.7
4.695SerGlu: 4.695 ± 0.741
2.167SerPhe: 2.167 ± 0.715
5.598SerGly: 5.598 ± 1.096
0.722SerHis: 0.722 ± 0.267
4.153SerIle: 4.153 ± 0.744
3.973SerLys: 3.973 ± 0.617
7.403SerLeu: 7.403 ± 1.1
0.903SerMet: 0.903 ± 0.316
3.25SerAsn: 3.25 ± 0.781
2.528SerPro: 2.528 ± 0.718
2.889SerGln: 2.889 ± 0.895
2.167SerArg: 2.167 ± 0.554
8.306SerSer: 8.306 ± 1.527
5.417SerThr: 5.417 ± 1.344
3.611SerVal: 3.611 ± 0.694
0.722SerTrp: 0.722 ± 0.352
3.611SerTyr: 3.611 ± 0.702
0.0SerXaa: 0.0 ± 0.0
Thr
5.778ThrAla: 5.778 ± 0.971
0.542ThrCys: 0.542 ± 0.29
5.237ThrAsp: 5.237 ± 1.236
5.598ThrGlu: 5.598 ± 0.614
2.167ThrPhe: 2.167 ± 0.582
5.056ThrGly: 5.056 ± 1.45
0.542ThrHis: 0.542 ± 0.465
5.417ThrIle: 5.417 ± 1.4
4.514ThrLys: 4.514 ± 0.717
5.959ThrLeu: 5.959 ± 1.16
0.903ThrMet: 0.903 ± 0.399
4.153ThrAsn: 4.153 ± 0.964
1.986ThrPro: 1.986 ± 0.351
3.973ThrGln: 3.973 ± 0.79
2.889ThrArg: 2.889 ± 0.822
3.973ThrSer: 3.973 ± 0.916
6.32ThrThr: 6.32 ± 2.912
5.056ThrVal: 5.056 ± 1.759
0.181ThrTrp: 0.181 ± 0.214
2.167ThrTyr: 2.167 ± 0.757
0.0ThrXaa: 0.0 ± 0.0
Val
4.514ValAla: 4.514 ± 0.554
0.542ValCys: 0.542 ± 0.259
3.25ValAsp: 3.25 ± 0.965
3.611ValGlu: 3.611 ± 0.725
1.806ValPhe: 1.806 ± 0.541
1.625ValGly: 1.625 ± 0.747
0.542ValHis: 0.542 ± 0.376
3.25ValIle: 3.25 ± 1.21
4.153ValLys: 4.153 ± 1.04
5.237ValLeu: 5.237 ± 0.761
2.528ValMet: 2.528 ± 0.586
4.153ValAsn: 4.153 ± 1.014
2.889ValPro: 2.889 ± 0.546
2.528ValGln: 2.528 ± 0.686
1.625ValArg: 1.625 ± 0.647
5.598ValSer: 5.598 ± 0.912
4.514ValThr: 4.514 ± 1.08
1.986ValVal: 1.986 ± 0.584
0.722ValTrp: 0.722 ± 0.417
3.25ValTyr: 3.25 ± 0.666
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.326
0.181TrpCys: 0.181 ± 0.177
0.542TrpAsp: 0.542 ± 0.295
0.361TrpGlu: 0.361 ± 0.285
0.903TrpPhe: 0.903 ± 0.358
0.722TrpGly: 0.722 ± 0.404
0.361TrpHis: 0.361 ± 0.272
0.542TrpIle: 0.542 ± 0.27
1.445TrpLys: 1.445 ± 0.339
0.722TrpLeu: 0.722 ± 0.31
0.0TrpMet: 0.0 ± 0.0
0.722TrpAsn: 0.722 ± 0.305
0.0TrpPro: 0.0 ± 0.0
0.542TrpGln: 0.542 ± 0.29
0.542TrpArg: 0.542 ± 0.356
0.542TrpSer: 0.542 ± 0.306
0.361TrpThr: 0.361 ± 0.192
0.722TrpVal: 0.722 ± 0.354
0.0TrpTrp: 0.0 ± 0.0
0.903TrpTyr: 0.903 ± 0.325
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.528TyrAla: 2.528 ± 0.948
0.542TyrCys: 0.542 ± 0.306
1.986TyrAsp: 1.986 ± 0.408
2.528TyrGlu: 2.528 ± 0.842
3.431TyrPhe: 3.431 ± 1.155
3.25TyrGly: 3.25 ± 0.763
0.903TyrHis: 0.903 ± 0.35
2.528TyrIle: 2.528 ± 0.672
3.07TyrLys: 3.07 ± 0.833
4.695TyrLeu: 4.695 ± 0.671
0.542TyrMet: 0.542 ± 0.353
3.431TyrAsn: 3.431 ± 0.592
1.986TyrPro: 1.986 ± 0.717
1.625TyrGln: 1.625 ± 0.662
2.167TyrArg: 2.167 ± 0.513
2.889TyrSer: 2.889 ± 0.718
2.709TyrThr: 2.709 ± 0.371
2.889TyrVal: 2.889 ± 0.867
0.542TyrTrp: 0.542 ± 0.404
3.611TyrTyr: 3.611 ± 0.782
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (5539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski