Amino acid dipepetide frequency for Streptococcus satellite phage Javan359

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.302AlaAla: 1.302 ± 0.689
0.326AlaCys: 0.326 ± 0.337
1.628AlaAsp: 1.628 ± 0.475
4.557AlaGlu: 4.557 ± 1.618
1.953AlaPhe: 1.953 ± 0.726
1.302AlaGly: 1.302 ± 0.51
0.0AlaHis: 0.0 ± 0.0
3.255AlaIle: 3.255 ± 1.545
8.138AlaLys: 8.138 ± 1.694
3.581AlaLeu: 3.581 ± 1.038
0.977AlaMet: 0.977 ± 0.478
2.93AlaAsn: 2.93 ± 1.01
0.977AlaPro: 0.977 ± 0.496
1.628AlaGln: 1.628 ± 0.75
2.279AlaArg: 2.279 ± 0.94
1.302AlaSer: 1.302 ± 0.742
2.93AlaThr: 2.93 ± 0.728
2.93AlaVal: 2.93 ± 1.019
0.0AlaTrp: 0.0 ± 0.0
3.581AlaTyr: 3.581 ± 0.802
0.0AlaXaa: 0.0 ± 0.0
Cys
0.326CysAla: 0.326 ± 0.333
0.326CysCys: 0.326 ± 0.319
0.0CysAsp: 0.0 ± 0.0
0.326CysGlu: 0.326 ± 0.409
0.0CysPhe: 0.0 ± 0.0
0.977CysGly: 0.977 ± 0.498
0.0CysHis: 0.0 ± 0.0
0.326CysIle: 0.326 ± 0.248
0.326CysLys: 0.326 ± 0.453
0.651CysLeu: 0.651 ± 0.558
0.326CysMet: 0.326 ± 0.4
0.326CysAsn: 0.326 ± 0.277
0.651CysPro: 0.651 ± 0.323
0.326CysGln: 0.326 ± 0.319
0.0CysArg: 0.0 ± 0.0
0.326CysSer: 0.326 ± 0.337
0.326CysThr: 0.326 ± 0.279
0.326CysVal: 0.326 ± 0.273
0.0CysTrp: 0.0 ± 0.0
0.326CysTyr: 0.326 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
0.977AspAla: 0.977 ± 0.544
0.651AspCys: 0.651 ± 0.362
4.883AspAsp: 4.883 ± 1.328
2.279AspGlu: 2.279 ± 0.661
5.208AspPhe: 5.208 ± 1.231
3.255AspGly: 3.255 ± 0.88
0.326AspHis: 0.326 ± 0.279
4.557AspIle: 4.557 ± 1.608
6.185AspLys: 6.185 ± 1.2
6.836AspLeu: 6.836 ± 1.363
0.977AspMet: 0.977 ± 0.62
5.859AspAsn: 5.859 ± 1.208
0.977AspPro: 0.977 ± 0.67
0.977AspGln: 0.977 ± 0.459
1.953AspArg: 1.953 ± 0.745
5.534AspSer: 5.534 ± 1.27
3.906AspThr: 3.906 ± 1.102
3.906AspVal: 3.906 ± 0.738
0.326AspTrp: 0.326 ± 0.273
3.255AspTyr: 3.255 ± 1.145
0.0AspXaa: 0.0 ± 0.0
Glu
3.255GluAla: 3.255 ± 0.991
0.977GluCys: 0.977 ± 0.437
5.859GluAsp: 5.859 ± 1.369
3.255GluGlu: 3.255 ± 1.021
2.604GluPhe: 2.604 ± 0.728
2.279GluGly: 2.279 ± 0.743
1.302GluHis: 1.302 ± 0.551
7.487GluIle: 7.487 ± 1.11
11.068GluLys: 11.068 ± 1.872
9.115GluLeu: 9.115 ± 1.514
1.302GluMet: 1.302 ± 0.82
4.557GluAsn: 4.557 ± 1.456
1.628GluPro: 1.628 ± 1.043
5.859GluGln: 5.859 ± 1.365
2.93GluArg: 2.93 ± 1.041
3.906GluSer: 3.906 ± 0.852
6.185GluThr: 6.185 ± 1.382
2.604GluVal: 2.604 ± 1.008
1.302GluTrp: 1.302 ± 0.517
4.232GluTyr: 4.232 ± 1.149
0.0GluXaa: 0.0 ± 0.0
Phe
0.651PheAla: 0.651 ± 0.436
0.326PheCys: 0.326 ± 0.273
3.255PheAsp: 3.255 ± 0.956
4.232PheGlu: 4.232 ± 1.156
3.255PhePhe: 3.255 ± 0.983
2.93PheGly: 2.93 ± 0.816
0.326PheHis: 0.326 ± 0.279
2.279PheIle: 2.279 ± 0.702
4.557PheLys: 4.557 ± 1.392
4.232PheLeu: 4.232 ± 0.811
0.977PheMet: 0.977 ± 0.559
0.651PheAsn: 0.651 ± 0.546
0.651PhePro: 0.651 ± 0.495
0.651PheGln: 0.651 ± 0.366
1.953PheArg: 1.953 ± 0.579
4.557PheSer: 4.557 ± 1.222
1.628PheThr: 1.628 ± 0.682
2.604PheVal: 2.604 ± 1.066
0.651PheTrp: 0.651 ± 0.377
0.326PheTyr: 0.326 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
1.953GlyAla: 1.953 ± 0.72
0.326GlyCys: 0.326 ± 0.279
1.628GlyAsp: 1.628 ± 0.717
1.953GlyGlu: 1.953 ± 0.724
1.953GlyPhe: 1.953 ± 0.695
2.604GlyGly: 2.604 ± 1.106
1.302GlyHis: 1.302 ± 0.672
2.604GlyIle: 2.604 ± 0.837
6.836GlyLys: 6.836 ± 1.44
4.232GlyLeu: 4.232 ± 1.199
0.977GlyMet: 0.977 ± 0.619
1.953GlyAsn: 1.953 ± 0.814
0.0GlyPro: 0.0 ± 0.0
1.302GlyGln: 1.302 ± 0.588
1.302GlyArg: 1.302 ± 0.544
3.906GlySer: 3.906 ± 1.296
3.255GlyThr: 3.255 ± 0.585
3.255GlyVal: 3.255 ± 1.003
0.977GlyTrp: 0.977 ± 0.515
4.557GlyTyr: 4.557 ± 1.162
0.0GlyXaa: 0.0 ± 0.0
His
2.279HisAla: 2.279 ± 1.096
0.0HisCys: 0.0 ± 0.0
0.326HisAsp: 0.326 ± 0.337
1.302HisGlu: 1.302 ± 0.582
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.326HisHis: 0.326 ± 0.333
0.651HisIle: 0.651 ± 0.457
1.302HisLys: 1.302 ± 0.575
2.604HisLeu: 2.604 ± 1.136
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.326HisPro: 0.326 ± 0.248
0.977HisGln: 0.977 ± 0.514
0.0HisArg: 0.0 ± 0.0
0.326HisSer: 0.326 ± 0.333
3.255HisThr: 3.255 ± 0.809
0.326HisVal: 0.326 ± 0.268
0.326HisTrp: 0.326 ± 0.277
0.326HisTyr: 0.326 ± 0.325
0.0HisXaa: 0.0 ± 0.0
Ile
2.279IleAla: 2.279 ± 0.869
0.0IleCys: 0.0 ± 0.0
5.208IleAsp: 5.208 ± 1.171
5.534IleGlu: 5.534 ± 1.52
2.279IlePhe: 2.279 ± 0.961
3.255IleGly: 3.255 ± 0.834
0.977IleHis: 0.977 ± 0.504
4.232IleIle: 4.232 ± 1.232
7.487IleLys: 7.487 ± 1.5
7.487IleLeu: 7.487 ± 0.812
0.977IleMet: 0.977 ± 0.496
5.208IleAsn: 5.208 ± 1.088
2.93IlePro: 2.93 ± 0.678
4.883IleGln: 4.883 ± 0.916
1.302IleArg: 1.302 ± 0.603
6.185IleSer: 6.185 ± 1.065
4.557IleThr: 4.557 ± 0.966
2.93IleVal: 2.93 ± 1.039
0.0IleTrp: 0.0 ± 0.0
2.604IleTyr: 2.604 ± 0.714
0.0IleXaa: 0.0 ± 0.0
Lys
4.883LysAla: 4.883 ± 1.52
0.977LysCys: 0.977 ± 0.47
5.859LysAsp: 5.859 ± 1.668
12.37LysGlu: 12.37 ± 1.149
2.93LysPhe: 2.93 ± 1.093
6.836LysGly: 6.836 ± 1.582
2.279LysHis: 2.279 ± 0.795
7.812LysIle: 7.812 ± 1.034
10.091LysLys: 10.091 ± 1.257
8.138LysLeu: 8.138 ± 1.489
3.255LysMet: 3.255 ± 0.925
6.185LysAsn: 6.185 ± 1.466
2.279LysPro: 2.279 ± 0.896
4.883LysGln: 4.883 ± 1.147
7.487LysArg: 7.487 ± 1.546
6.51LysSer: 6.51 ± 1.766
6.51LysThr: 6.51 ± 1.722
5.859LysVal: 5.859 ± 1.336
0.651LysTrp: 0.651 ± 0.426
5.534LysTyr: 5.534 ± 1.504
0.0LysXaa: 0.0 ± 0.0
Leu
5.859LeuAla: 5.859 ± 1.445
0.651LeuCys: 0.651 ± 0.54
8.138LeuAsp: 8.138 ± 1.458
9.44LeuGlu: 9.44 ± 1.709
4.557LeuPhe: 4.557 ± 1.218
6.836LeuGly: 6.836 ± 1.317
1.953LeuHis: 1.953 ± 0.767
6.185LeuIle: 6.185 ± 1.324
9.44LeuLys: 9.44 ± 1.623
7.161LeuLeu: 7.161 ± 1.473
1.628LeuMet: 1.628 ± 0.517
4.883LeuAsn: 4.883 ± 1.308
2.279LeuPro: 2.279 ± 0.746
2.93LeuGln: 2.93 ± 0.671
2.93LeuArg: 2.93 ± 0.781
6.185LeuSer: 6.185 ± 1.297
7.812LeuThr: 7.812 ± 1.551
5.859LeuVal: 5.859 ± 1.408
0.0LeuTrp: 0.0 ± 0.0
2.93LeuTyr: 2.93 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
2.279MetAla: 2.279 ± 0.83
0.0MetCys: 0.0 ± 0.0
2.279MetAsp: 2.279 ± 0.936
1.628MetGlu: 1.628 ± 0.877
0.326MetPhe: 0.326 ± 0.318
0.651MetGly: 0.651 ± 0.533
0.0MetHis: 0.0 ± 0.0
1.302MetIle: 1.302 ± 0.581
2.604MetLys: 2.604 ± 0.615
1.302MetLeu: 1.302 ± 0.457
0.0MetMet: 0.0 ± 0.0
1.953MetAsn: 1.953 ± 0.805
0.977MetPro: 0.977 ± 0.519
0.326MetGln: 0.326 ± 0.342
0.977MetArg: 0.977 ± 0.529
0.651MetSer: 0.651 ± 0.453
1.628MetThr: 1.628 ± 0.602
0.977MetVal: 0.977 ± 0.469
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.255AsnAla: 3.255 ± 0.843
0.326AsnCys: 0.326 ± 0.279
2.279AsnAsp: 2.279 ± 0.907
2.93AsnGlu: 2.93 ± 0.792
2.604AsnPhe: 2.604 ± 0.718
2.93AsnGly: 2.93 ± 1.188
1.628AsnHis: 1.628 ± 0.543
3.255AsnIle: 3.255 ± 1.109
4.883AsnLys: 4.883 ± 1.181
5.534AsnLeu: 5.534 ± 1.222
1.953AsnMet: 1.953 ± 0.682
3.906AsnAsn: 3.906 ± 1.106
1.628AsnPro: 1.628 ± 0.485
3.255AsnGln: 3.255 ± 1.136
2.604AsnArg: 2.604 ± 0.852
4.232AsnSer: 4.232 ± 1.428
4.883AsnThr: 4.883 ± 1.475
1.953AsnVal: 1.953 ± 0.929
0.977AsnTrp: 0.977 ± 0.579
2.604AsnTyr: 2.604 ± 0.705
0.0AsnXaa: 0.0 ± 0.0
Pro
1.302ProAla: 1.302 ± 0.581
0.0ProCys: 0.0 ± 0.0
2.279ProAsp: 2.279 ± 0.829
2.279ProGlu: 2.279 ± 0.844
0.977ProPhe: 0.977 ± 0.477
0.326ProGly: 0.326 ± 0.248
0.651ProHis: 0.651 ± 0.513
1.953ProIle: 1.953 ± 0.766
4.232ProLys: 4.232 ± 1.182
0.977ProLeu: 0.977 ± 0.466
0.326ProMet: 0.326 ± 0.333
0.977ProAsn: 0.977 ± 0.438
0.326ProPro: 0.326 ± 0.318
0.977ProGln: 0.977 ± 0.774
1.302ProArg: 1.302 ± 0.553
0.651ProSer: 0.651 ± 0.366
0.326ProThr: 0.326 ± 0.248
1.302ProVal: 1.302 ± 0.457
0.326ProTrp: 0.326 ± 0.374
1.628ProTyr: 1.628 ± 0.79
0.0ProXaa: 0.0 ± 0.0
Gln
4.557GlnAla: 4.557 ± 1.113
0.326GlnCys: 0.326 ± 0.453
2.279GlnAsp: 2.279 ± 0.805
4.232GlnGlu: 4.232 ± 0.905
1.628GlnPhe: 1.628 ± 0.816
0.651GlnGly: 0.651 ± 0.526
0.977GlnHis: 0.977 ± 0.554
1.953GlnIle: 1.953 ± 0.531
4.232GlnLys: 4.232 ± 0.832
3.581GlnLeu: 3.581 ± 1.349
0.326GlnMet: 0.326 ± 0.273
1.953GlnAsn: 1.953 ± 0.743
1.302GlnPro: 1.302 ± 0.542
2.279GlnGln: 2.279 ± 0.988
2.279GlnArg: 2.279 ± 0.794
2.279GlnSer: 2.279 ± 0.861
3.255GlnThr: 3.255 ± 0.936
3.255GlnVal: 3.255 ± 0.953
0.326GlnTrp: 0.326 ± 0.268
1.953GlnTyr: 1.953 ± 0.565
0.0GlnXaa: 0.0 ± 0.0
Arg
0.977ArgAla: 0.977 ± 0.628
0.0ArgCys: 0.0 ± 0.0
2.93ArgAsp: 2.93 ± 0.907
2.93ArgGlu: 2.93 ± 1.041
1.628ArgPhe: 1.628 ± 0.627
1.953ArgGly: 1.953 ± 0.893
0.326ArgHis: 0.326 ± 0.279
3.581ArgIle: 3.581 ± 0.893
6.836ArgLys: 6.836 ± 1.04
3.906ArgLeu: 3.906 ± 0.935
0.977ArgMet: 0.977 ± 0.472
3.255ArgAsn: 3.255 ± 1.038
0.977ArgPro: 0.977 ± 0.609
2.93ArgGln: 2.93 ± 0.827
1.953ArgArg: 1.953 ± 0.806
1.628ArgSer: 1.628 ± 0.553
2.93ArgThr: 2.93 ± 0.967
0.977ArgVal: 0.977 ± 0.549
0.326ArgTrp: 0.326 ± 0.4
1.953ArgTyr: 1.953 ± 0.922
0.0ArgXaa: 0.0 ± 0.0
Ser
2.93SerAla: 2.93 ± 1.473
0.326SerCys: 0.326 ± 0.248
4.557SerAsp: 4.557 ± 0.807
8.138SerGlu: 8.138 ± 1.389
2.604SerPhe: 2.604 ± 0.946
2.93SerGly: 2.93 ± 0.898
1.302SerHis: 1.302 ± 0.632
3.581SerIle: 3.581 ± 1.101
8.138SerLys: 8.138 ± 2.032
5.859SerLeu: 5.859 ± 1.727
1.302SerMet: 1.302 ± 0.703
2.604SerAsn: 2.604 ± 1.085
1.628SerPro: 1.628 ± 0.668
1.953SerGln: 1.953 ± 0.806
1.953SerArg: 1.953 ± 0.951
3.906SerSer: 3.906 ± 1.304
4.232SerThr: 4.232 ± 1.165
3.906SerVal: 3.906 ± 1.065
0.326SerTrp: 0.326 ± 0.268
1.953SerTyr: 1.953 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
2.279ThrAla: 2.279 ± 0.792
0.0ThrCys: 0.0 ± 0.0
1.953ThrAsp: 1.953 ± 0.636
4.557ThrGlu: 4.557 ± 1.122
1.628ThrPhe: 1.628 ± 0.622
4.232ThrGly: 4.232 ± 0.908
0.977ThrHis: 0.977 ± 0.596
5.859ThrIle: 5.859 ± 1.358
4.557ThrLys: 4.557 ± 1.431
7.812ThrLeu: 7.812 ± 0.797
1.628ThrMet: 1.628 ± 0.561
3.906ThrAsn: 3.906 ± 0.933
1.628ThrPro: 1.628 ± 0.636
3.581ThrGln: 3.581 ± 1.307
3.906ThrArg: 3.906 ± 1.46
4.557ThrSer: 4.557 ± 1.438
4.232ThrThr: 4.232 ± 1.155
6.185ThrVal: 6.185 ± 1.044
0.651ThrTrp: 0.651 ± 0.46
2.604ThrTyr: 2.604 ± 1.333
0.0ThrXaa: 0.0 ± 0.0
Val
1.953ValAla: 1.953 ± 0.861
0.326ValCys: 0.326 ± 0.273
5.208ValAsp: 5.208 ± 1.274
3.906ValGlu: 3.906 ± 0.915
2.279ValPhe: 2.279 ± 0.614
0.977ValGly: 0.977 ± 0.492
0.0ValHis: 0.0 ± 0.0
4.883ValIle: 4.883 ± 0.919
4.883ValLys: 4.883 ± 1.219
5.534ValLeu: 5.534 ± 1.073
0.326ValMet: 0.326 ± 0.342
4.232ValAsn: 4.232 ± 0.907
0.977ValPro: 0.977 ± 0.477
2.279ValGln: 2.279 ± 0.81
1.953ValArg: 1.953 ± 0.849
4.883ValSer: 4.883 ± 1.139
2.93ValThr: 2.93 ± 0.854
2.604ValVal: 2.604 ± 0.938
0.326ValTrp: 0.326 ± 0.409
2.279ValTyr: 2.279 ± 0.806
0.0ValXaa: 0.0 ± 0.0
Trp
1.628TrpAla: 1.628 ± 0.484
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.953TrpGlu: 1.953 ± 0.833
0.0TrpPhe: 0.0 ± 0.0
0.651TrpGly: 0.651 ± 0.484
0.0TrpHis: 0.0 ± 0.0
0.326TrpIle: 0.326 ± 0.4
0.651TrpLys: 0.651 ± 0.373
0.977TrpLeu: 0.977 ± 0.538
0.326TrpMet: 0.326 ± 0.333
0.326TrpAsn: 0.326 ± 0.268
0.0TrpPro: 0.0 ± 0.0
0.326TrpGln: 0.326 ± 0.279
0.326TrpArg: 0.326 ± 0.409
0.326TrpSer: 0.326 ± 0.279
0.0TrpThr: 0.0 ± 0.0
0.326TrpVal: 0.326 ± 0.409
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.651TyrAla: 0.651 ± 0.536
0.326TyrCys: 0.326 ± 0.409
2.279TyrAsp: 2.279 ± 0.739
3.906TyrGlu: 3.906 ± 1.145
2.279TyrPhe: 2.279 ± 0.815
1.302TyrGly: 1.302 ± 0.544
0.0TyrHis: 0.0 ± 0.0
4.232TyrIle: 4.232 ± 1.058
4.883TyrLys: 4.883 ± 1.461
8.138TyrLeu: 8.138 ± 1.529
0.977TyrMet: 0.977 ± 0.693
1.953TyrAsn: 1.953 ± 0.729
0.977TyrPro: 0.977 ± 0.659
1.302TyrGln: 1.302 ± 0.492
3.581TyrArg: 3.581 ± 1.441
2.279TyrSer: 2.279 ± 1.003
1.953TyrThr: 1.953 ± 0.67
0.651TyrVal: 0.651 ± 0.411
0.651TyrTrp: 0.651 ± 0.375
0.977TyrTyr: 0.977 ± 0.465
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski