Amino acid dipepetide frequency for Streptococcus satellite phage Javan301

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.001AlaAla: 1.001 ± 0.608
0.0AlaCys: 0.0 ± 0.0
1.668AlaAsp: 1.668 ± 0.607
4.003AlaGlu: 4.003 ± 1.374
2.335AlaPhe: 2.335 ± 0.664
1.668AlaGly: 1.668 ± 0.591
0.0AlaHis: 0.0 ± 0.0
1.334AlaIle: 1.334 ± 0.802
4.67AlaLys: 4.67 ± 1.427
4.003AlaLeu: 4.003 ± 1.04
1.334AlaMet: 1.334 ± 0.813
4.003AlaAsn: 4.003 ± 0.977
1.001AlaPro: 1.001 ± 0.5
1.668AlaGln: 1.668 ± 0.95
3.002AlaArg: 3.002 ± 0.855
1.334AlaSer: 1.334 ± 0.798
3.002AlaThr: 3.002 ± 0.868
2.335AlaVal: 2.335 ± 1.018
0.0AlaTrp: 0.0 ± 0.0
4.003AlaTyr: 4.003 ± 0.841
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.334CysAsp: 0.334 ± 0.293
0.334CysGlu: 0.334 ± 0.305
0.0CysPhe: 0.0 ± 0.0
1.001CysGly: 1.001 ± 0.521
0.0CysHis: 0.0 ± 0.0
0.334CysIle: 0.334 ± 0.348
0.334CysLys: 0.334 ± 0.394
0.667CysLeu: 0.667 ± 0.414
0.334CysMet: 0.334 ± 0.313
0.0CysAsn: 0.0 ± 0.0
0.667CysPro: 0.667 ± 0.456
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.667CysThr: 0.667 ± 0.49
0.334CysVal: 0.334 ± 0.293
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.335AspAla: 2.335 ± 1.036
0.667AspCys: 0.667 ± 0.399
6.004AspAsp: 6.004 ± 1.356
2.668AspGlu: 2.668 ± 0.795
5.67AspPhe: 5.67 ± 1.237
3.669AspGly: 3.669 ± 1.093
0.334AspHis: 0.334 ± 0.327
6.338AspIle: 6.338 ± 1.606
8.005AspLys: 8.005 ± 1.438
8.339AspLeu: 8.339 ± 1.258
0.667AspMet: 0.667 ± 0.501
3.336AspAsn: 3.336 ± 0.934
0.334AspPro: 0.334 ± 0.394
1.001AspGln: 1.001 ± 0.457
3.336AspArg: 3.336 ± 1.287
6.338AspSer: 6.338 ± 1.091
2.335AspThr: 2.335 ± 1.223
3.002AspVal: 3.002 ± 0.764
0.334AspTrp: 0.334 ± 0.315
3.336AspTyr: 3.336 ± 1.04
0.0AspXaa: 0.0 ± 0.0
Glu
3.002GluAla: 3.002 ± 0.836
1.001GluCys: 1.001 ± 0.548
6.004GluAsp: 6.004 ± 1.406
3.669GluGlu: 3.669 ± 1.047
3.669GluPhe: 3.669 ± 1.023
2.668GluGly: 2.668 ± 0.939
1.001GluHis: 1.001 ± 0.441
7.005GluIle: 7.005 ± 1.33
10.34GluLys: 10.34 ± 1.888
7.005GluLeu: 7.005 ± 1.765
0.334GluMet: 0.334 ± 0.345
5.003GluAsn: 5.003 ± 1.458
1.334GluPro: 1.334 ± 0.647
4.336GluGln: 4.336 ± 0.965
4.003GluArg: 4.003 ± 1.032
4.336GluSer: 4.336 ± 1.005
5.67GluThr: 5.67 ± 1.192
2.335GluVal: 2.335 ± 0.866
1.334GluTrp: 1.334 ± 0.613
4.336GluTyr: 4.336 ± 1.016
0.0GluXaa: 0.0 ± 0.0
Phe
0.334PheAla: 0.334 ± 0.265
0.334PheCys: 0.334 ± 0.293
5.337PheAsp: 5.337 ± 1.066
4.336PheGlu: 4.336 ± 1.204
3.669PhePhe: 3.669 ± 0.902
2.668PheGly: 2.668 ± 0.672
0.334PheHis: 0.334 ± 0.327
2.335PheIle: 2.335 ± 0.846
6.004PheLys: 6.004 ± 1.302
4.67PheLeu: 4.67 ± 1.128
0.667PheMet: 0.667 ± 0.505
1.001PheAsn: 1.001 ± 0.486
1.668PhePro: 1.668 ± 0.945
1.334PheGln: 1.334 ± 0.495
0.667PheArg: 0.667 ± 0.414
4.67PheSer: 4.67 ± 1.338
2.001PheThr: 2.001 ± 0.863
2.335PheVal: 2.335 ± 0.872
1.001PheTrp: 1.001 ± 0.457
0.334PheTyr: 0.334 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
2.001GlyAla: 2.001 ± 0.713
0.334GlyCys: 0.334 ± 0.34
3.002GlyAsp: 3.002 ± 1.005
1.668GlyGlu: 1.668 ± 0.582
2.335GlyPhe: 2.335 ± 0.694
3.336GlyGly: 3.336 ± 1.287
1.334GlyHis: 1.334 ± 0.798
3.002GlyIle: 3.002 ± 0.964
7.005GlyLys: 7.005 ± 1.488
5.003GlyLeu: 5.003 ± 1.323
0.334GlyMet: 0.334 ± 0.296
2.668GlyAsn: 2.668 ± 1.0
0.334GlyPro: 0.334 ± 0.265
2.001GlyGln: 2.001 ± 0.608
1.001GlyArg: 1.001 ± 0.456
5.003GlySer: 5.003 ± 1.332
3.002GlyThr: 3.002 ± 0.718
2.335GlyVal: 2.335 ± 0.742
1.001GlyTrp: 1.001 ± 0.526
4.67GlyTyr: 4.67 ± 1.358
0.0GlyXaa: 0.0 ± 0.0
His
2.668HisAla: 2.668 ± 1.092
0.0HisCys: 0.0 ± 0.0
0.334HisAsp: 0.334 ± 0.265
0.334HisGlu: 0.334 ± 0.373
0.334HisPhe: 0.334 ± 0.265
0.0HisGly: 0.0 ± 0.0
0.334HisHis: 0.334 ± 0.345
1.001HisIle: 1.001 ± 0.473
1.001HisLys: 1.001 ± 0.755
2.335HisLeu: 2.335 ± 0.849
0.334HisMet: 0.334 ± 0.34
0.667HisAsn: 0.667 ± 0.53
0.334HisPro: 0.334 ± 0.348
1.001HisGln: 1.001 ± 0.487
0.334HisArg: 0.334 ± 0.296
0.0HisSer: 0.0 ± 0.0
2.335HisThr: 2.335 ± 0.687
0.334HisVal: 0.334 ± 0.296
0.0HisTrp: 0.0 ± 0.0
0.667HisTyr: 0.667 ± 0.415
0.0HisXaa: 0.0 ± 0.0
Ile
1.668IleAla: 1.668 ± 0.681
0.0IleCys: 0.0 ± 0.0
5.67IleAsp: 5.67 ± 1.401
5.003IleGlu: 5.003 ± 1.141
2.668IlePhe: 2.668 ± 0.868
3.669IleGly: 3.669 ± 1.121
0.667IleHis: 0.667 ± 0.43
4.67IleIle: 4.67 ± 1.343
8.672IleLys: 8.672 ± 1.388
8.672IleLeu: 8.672 ± 1.422
1.001IleMet: 1.001 ± 0.47
5.337IleAsn: 5.337 ± 1.332
2.335IlePro: 2.335 ± 0.828
4.003IleGln: 4.003 ± 0.877
2.335IleArg: 2.335 ± 0.775
5.67IleSer: 5.67 ± 1.12
3.669IleThr: 3.669 ± 1.382
2.335IleVal: 2.335 ± 0.758
0.334IleTrp: 0.334 ± 0.265
2.335IleTyr: 2.335 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
5.67LysAla: 5.67 ± 1.409
0.667LysCys: 0.667 ± 0.482
7.005LysAsp: 7.005 ± 1.228
10.34LysGlu: 10.34 ± 1.681
3.336LysPhe: 3.336 ± 0.901
7.005LysGly: 7.005 ± 1.206
1.334LysHis: 1.334 ± 0.796
7.672LysIle: 7.672 ± 1.11
10.674LysLys: 10.674 ± 1.926
6.338LysLeu: 6.338 ± 1.196
4.003LysMet: 4.003 ± 1.198
6.671LysAsn: 6.671 ± 1.687
2.335LysPro: 2.335 ± 1.044
5.003LysGln: 5.003 ± 1.278
8.005LysArg: 8.005 ± 1.351
7.005LysSer: 7.005 ± 1.749
7.005LysThr: 7.005 ± 1.48
6.338LysVal: 6.338 ± 1.31
0.667LysTrp: 0.667 ± 0.493
4.67LysTyr: 4.67 ± 1.512
0.0LysXaa: 0.0 ± 0.0
Leu
5.337LeuAla: 5.337 ± 1.347
0.334LeuCys: 0.334 ± 0.313
7.005LeuAsp: 7.005 ± 1.363
10.007LeuGlu: 10.007 ± 1.509
4.336LeuPhe: 4.336 ± 1.15
6.338LeuGly: 6.338 ± 1.206
1.668LeuHis: 1.668 ± 0.95
7.005LeuIle: 7.005 ± 1.63
8.672LeuLys: 8.672 ± 1.656
7.338LeuLeu: 7.338 ± 2.085
2.335LeuMet: 2.335 ± 0.773
5.003LeuAsn: 5.003 ± 1.21
4.003LeuPro: 4.003 ± 1.446
3.669LeuGln: 3.669 ± 1.039
3.669LeuArg: 3.669 ± 0.742
5.67LeuSer: 5.67 ± 1.254
6.671LeuThr: 6.671 ± 1.583
5.003LeuVal: 5.003 ± 1.207
0.0LeuTrp: 0.0 ± 0.0
2.001LeuTyr: 2.001 ± 1.003
0.0LeuXaa: 0.0 ± 0.0
Met
1.334MetAla: 1.334 ± 0.814
0.0MetCys: 0.0 ± 0.0
2.335MetAsp: 2.335 ± 0.694
2.668MetGlu: 2.668 ± 1.066
0.334MetPhe: 0.334 ± 0.373
0.334MetGly: 0.334 ± 0.369
0.334MetHis: 0.334 ± 0.265
1.001MetIle: 1.001 ± 0.582
2.335MetLys: 2.335 ± 0.907
1.668MetLeu: 1.668 ± 0.805
0.0MetMet: 0.0 ± 0.0
2.335MetAsn: 2.335 ± 0.637
0.334MetPro: 0.334 ± 0.348
0.334MetGln: 0.334 ± 0.36
2.001MetArg: 2.001 ± 0.649
0.334MetSer: 0.334 ± 0.394
2.335MetThr: 2.335 ± 1.297
1.334MetVal: 1.334 ± 0.598
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.668AsnAla: 2.668 ± 0.953
0.334AsnCys: 0.334 ± 0.327
2.001AsnAsp: 2.001 ± 0.702
2.001AsnGlu: 2.001 ± 0.754
2.001AsnPhe: 2.001 ± 0.639
4.67AsnGly: 4.67 ± 1.118
1.334AsnHis: 1.334 ± 0.567
4.003AsnIle: 4.003 ± 1.557
4.336AsnLys: 4.336 ± 1.255
6.671AsnLeu: 6.671 ± 1.284
2.335AsnMet: 2.335 ± 0.938
2.001AsnAsn: 2.001 ± 0.825
3.336AsnPro: 3.336 ± 1.438
2.001AsnGln: 2.001 ± 0.984
2.001AsnArg: 2.001 ± 0.734
4.336AsnSer: 4.336 ± 0.999
3.336AsnThr: 3.336 ± 1.29
1.334AsnVal: 1.334 ± 0.873
1.001AsnTrp: 1.001 ± 0.736
3.336AsnTyr: 3.336 ± 0.914
0.0AsnXaa: 0.0 ± 0.0
Pro
0.667ProAla: 0.667 ± 0.431
0.0ProCys: 0.0 ± 0.0
2.668ProAsp: 2.668 ± 0.856
3.002ProGlu: 3.002 ± 1.072
1.668ProPhe: 1.668 ± 0.79
1.001ProGly: 1.001 ± 0.572
0.667ProHis: 0.667 ± 0.401
2.335ProIle: 2.335 ± 0.716
4.67ProLys: 4.67 ± 1.467
0.667ProLeu: 0.667 ± 0.426
0.0ProMet: 0.0 ± 0.0
1.001ProAsn: 1.001 ± 0.636
0.334ProPro: 0.334 ± 0.373
0.667ProGln: 0.667 ± 0.379
1.668ProArg: 1.668 ± 0.659
0.667ProSer: 0.667 ± 0.395
1.001ProThr: 1.001 ± 0.572
1.668ProVal: 1.668 ± 0.743
0.0ProTrp: 0.0 ± 0.0
1.334ProTyr: 1.334 ± 0.605
0.0ProXaa: 0.0 ± 0.0
Gln
4.003GlnAla: 4.003 ± 1.457
0.334GlnCys: 0.334 ± 0.394
2.668GlnAsp: 2.668 ± 0.879
4.336GlnGlu: 4.336 ± 0.906
1.001GlnPhe: 1.001 ± 0.727
1.001GlnGly: 1.001 ± 0.657
1.001GlnHis: 1.001 ± 0.769
2.001GlnIle: 2.001 ± 0.606
3.002GlnLys: 3.002 ± 0.758
4.003GlnLeu: 4.003 ± 1.234
1.001GlnMet: 1.001 ± 0.517
1.001GlnAsn: 1.001 ± 0.576
0.667GlnPro: 0.667 ± 0.405
1.668GlnGln: 1.668 ± 0.741
2.335GlnArg: 2.335 ± 0.951
1.334GlnSer: 1.334 ± 0.474
2.335GlnThr: 2.335 ± 0.888
4.003GlnVal: 4.003 ± 1.177
0.334GlnTrp: 0.334 ± 0.296
1.334GlnTyr: 1.334 ± 0.673
0.0GlnXaa: 0.0 ± 0.0
Arg
1.668ArgAla: 1.668 ± 0.731
0.0ArgCys: 0.0 ± 0.0
2.668ArgAsp: 2.668 ± 0.768
3.669ArgGlu: 3.669 ± 1.146
2.668ArgPhe: 2.668 ± 0.949
1.334ArgGly: 1.334 ± 0.685
0.667ArgHis: 0.667 ± 0.399
3.669ArgIle: 3.669 ± 0.941
7.338ArgLys: 7.338 ± 1.292
4.336ArgLeu: 4.336 ± 1.046
0.667ArgMet: 0.667 ± 0.426
4.003ArgAsn: 4.003 ± 0.793
0.667ArgPro: 0.667 ± 0.473
2.335ArgGln: 2.335 ± 0.962
2.001ArgArg: 2.001 ± 0.826
2.668ArgSer: 2.668 ± 0.966
2.668ArgThr: 2.668 ± 1.03
2.335ArgVal: 2.335 ± 0.956
0.334ArgTrp: 0.334 ± 0.265
2.335ArgTyr: 2.335 ± 0.839
0.0ArgXaa: 0.0 ± 0.0
Ser
1.668SerAla: 1.668 ± 0.755
0.334SerCys: 0.334 ± 0.348
4.003SerAsp: 4.003 ± 1.458
6.004SerGlu: 6.004 ± 0.934
4.003SerPhe: 4.003 ± 1.091
4.336SerGly: 4.336 ± 0.977
2.001SerHis: 2.001 ± 0.642
2.668SerIle: 2.668 ± 0.962
7.672SerLys: 7.672 ± 1.559
7.005SerLeu: 7.005 ± 1.391
1.334SerMet: 1.334 ± 0.884
1.334SerAsn: 1.334 ± 0.465
2.335SerPro: 2.335 ± 0.895
3.002SerGln: 3.002 ± 0.842
1.334SerArg: 1.334 ± 0.795
3.336SerSer: 3.336 ± 1.16
5.003SerThr: 5.003 ± 1.395
3.669SerVal: 3.669 ± 0.97
0.667SerTrp: 0.667 ± 0.405
2.001SerTyr: 2.001 ± 0.583
0.0SerXaa: 0.0 ± 0.0
Thr
2.001ThrAla: 2.001 ± 0.954
0.0ThrCys: 0.0 ± 0.0
2.001ThrAsp: 2.001 ± 0.723
4.336ThrGlu: 4.336 ± 0.829
2.001ThrPhe: 2.001 ± 0.756
4.336ThrGly: 4.336 ± 1.053
1.001ThrHis: 1.001 ± 0.519
6.004ThrIle: 6.004 ± 1.117
5.337ThrLys: 5.337 ± 1.362
7.338ThrLeu: 7.338 ± 1.266
1.668ThrMet: 1.668 ± 0.557
3.336ThrAsn: 3.336 ± 1.085
1.668ThrPro: 1.668 ± 0.669
2.001ThrGln: 2.001 ± 1.451
4.67ThrArg: 4.67 ± 1.56
2.668ThrSer: 2.668 ± 1.005
3.336ThrThr: 3.336 ± 1.221
4.67ThrVal: 4.67 ± 1.384
0.667ThrTrp: 0.667 ± 0.447
3.002ThrTyr: 3.002 ± 1.178
0.0ThrXaa: 0.0 ± 0.0
Val
2.668ValAla: 2.668 ± 0.902
0.334ValCys: 0.334 ± 0.293
4.336ValAsp: 4.336 ± 1.44
4.336ValGlu: 4.336 ± 1.229
1.001ValPhe: 1.001 ± 0.603
1.334ValGly: 1.334 ± 0.537
0.334ValHis: 0.334 ± 0.265
3.669ValIle: 3.669 ± 1.06
5.67ValLys: 5.67 ± 1.337
3.336ValLeu: 3.336 ± 1.236
1.334ValMet: 1.334 ± 0.515
4.003ValAsn: 4.003 ± 0.882
1.334ValPro: 1.334 ± 0.681
1.334ValGln: 1.334 ± 0.738
1.668ValArg: 1.668 ± 0.703
4.336ValSer: 4.336 ± 1.116
4.003ValThr: 4.003 ± 0.96
2.001ValVal: 2.001 ± 0.816
0.334ValTrp: 0.334 ± 0.305
2.001ValTyr: 2.001 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
1.001TrpAla: 1.001 ± 0.514
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.335TrpGlu: 2.335 ± 0.692
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.667TrpIle: 0.667 ± 0.37
1.001TrpLys: 1.001 ± 0.46
1.001TrpLeu: 1.001 ± 0.764
0.334TrpMet: 0.334 ± 0.329
0.334TrpAsn: 0.334 ± 0.296
0.0TrpPro: 0.0 ± 0.0
0.334TrpGln: 0.334 ± 0.34
0.334TrpArg: 0.334 ± 0.305
0.334TrpSer: 0.334 ± 0.327
0.0TrpThr: 0.0 ± 0.0
0.667TrpVal: 0.667 ± 0.375
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.667TyrAla: 0.667 ± 0.612
0.334TyrCys: 0.334 ± 0.305
3.002TyrAsp: 3.002 ± 1.006
3.336TyrGlu: 3.336 ± 0.869
2.668TyrPhe: 2.668 ± 1.057
1.001TyrGly: 1.001 ± 0.479
0.0TyrHis: 0.0 ± 0.0
4.003TyrIle: 4.003 ± 0.761
5.337TyrLys: 5.337 ± 1.136
5.67TyrLeu: 5.67 ± 1.419
1.001TyrMet: 1.001 ± 0.618
2.001TyrAsn: 2.001 ± 0.911
0.667TyrPro: 0.667 ± 0.405
1.668TyrGln: 1.668 ± 0.728
3.669TyrArg: 3.669 ± 1.462
3.669TyrSer: 3.669 ± 1.187
1.668TyrThr: 1.668 ± 0.779
1.001TyrVal: 1.001 ± 0.616
0.334TyrTrp: 0.334 ± 0.293
1.668TyrTyr: 1.668 ± 0.745
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski