Amino acid dipepetide frequency for Streptococcus satellite phage Javan276

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.334AlaAla: 0.334 ± 0.291
0.334AlaCys: 0.334 ± 0.325
2.338AlaAsp: 2.338 ± 0.956
6.012AlaGlu: 6.012 ± 1.517
0.668AlaPhe: 0.668 ± 0.45
0.334AlaGly: 0.334 ± 0.31
0.334AlaHis: 0.334 ± 0.397
2.672AlaIle: 2.672 ± 0.777
5.344AlaLys: 5.344 ± 1.45
2.338AlaLeu: 2.338 ± 0.758
0.334AlaMet: 0.334 ± 0.308
3.34AlaAsn: 3.34 ± 0.923
0.334AlaPro: 0.334 ± 0.337
1.336AlaGln: 1.336 ± 0.582
3.006AlaArg: 3.006 ± 1.139
2.004AlaSer: 2.004 ± 0.789
2.004AlaThr: 2.004 ± 1.093
2.004AlaVal: 2.004 ± 0.644
1.002AlaTrp: 1.002 ± 0.569
2.338AlaTyr: 2.338 ± 0.725
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.334CysPhe: 0.334 ± 0.291
0.668CysGly: 0.668 ± 0.379
0.0CysHis: 0.0 ± 0.0
0.668CysIle: 0.668 ± 0.457
1.002CysLys: 1.002 ± 0.859
1.336CysLeu: 1.336 ± 0.532
0.0CysMet: 0.0 ± 0.0
0.334CysAsn: 0.334 ± 0.286
0.0CysPro: 0.0 ± 0.0
0.668CysGln: 0.668 ± 0.524
1.336CysArg: 1.336 ± 0.651
0.668CysSer: 0.668 ± 0.48
0.0CysThr: 0.0 ± 0.0
0.668CysVal: 0.668 ± 0.65
0.334CysTrp: 0.334 ± 0.325
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.004AspAla: 2.004 ± 0.883
1.002AspCys: 1.002 ± 0.575
3.34AspAsp: 3.34 ± 1.258
5.678AspGlu: 5.678 ± 1.473
4.008AspPhe: 4.008 ± 1.277
0.668AspGly: 0.668 ± 0.472
0.668AspHis: 0.668 ± 0.443
6.68AspIle: 6.68 ± 1.278
6.346AspLys: 6.346 ± 1.574
8.016AspLeu: 8.016 ± 1.294
1.67AspMet: 1.67 ± 0.946
4.342AspAsn: 4.342 ± 1.004
2.004AspPro: 2.004 ± 0.759
1.002AspGln: 1.002 ± 0.556
2.004AspArg: 2.004 ± 0.617
2.672AspSer: 2.672 ± 0.749
1.67AspThr: 1.67 ± 0.697
2.672AspVal: 2.672 ± 0.83
0.334AspTrp: 0.334 ± 0.364
5.344AspTyr: 5.344 ± 1.157
0.0AspXaa: 0.0 ± 0.0
Glu
5.344GluAla: 5.344 ± 1.47
1.002GluCys: 1.002 ± 0.587
4.676GluAsp: 4.676 ± 1.343
5.678GluGlu: 5.678 ± 1.399
6.346GluPhe: 6.346 ± 1.635
1.336GluGly: 1.336 ± 0.605
1.336GluHis: 1.336 ± 0.784
7.014GluIle: 7.014 ± 1.3
10.688GluLys: 10.688 ± 1.974
11.022GluLeu: 11.022 ± 2.169
2.004GluMet: 2.004 ± 0.857
4.008GluAsn: 4.008 ± 1.126
0.668GluPro: 0.668 ± 0.624
2.338GluGln: 2.338 ± 1.006
4.342GluArg: 4.342 ± 1.567
5.01GluSer: 5.01 ± 1.419
3.674GluThr: 3.674 ± 1.017
3.674GluVal: 3.674 ± 1.43
1.336GluTrp: 1.336 ± 0.55
2.672GluTyr: 2.672 ± 0.737
0.0GluXaa: 0.0 ± 0.0
Phe
1.336PheAla: 1.336 ± 0.517
0.334PheCys: 0.334 ± 0.308
3.674PheAsp: 3.674 ± 0.726
2.004PheGlu: 2.004 ± 0.717
1.336PhePhe: 1.336 ± 0.721
3.34PheGly: 3.34 ± 1.047
1.336PheHis: 1.336 ± 0.721
5.344PheIle: 5.344 ± 1.481
4.342PheLys: 4.342 ± 1.242
3.674PheLeu: 3.674 ± 1.154
1.002PheMet: 1.002 ± 0.599
4.342PheAsn: 4.342 ± 0.903
1.67PhePro: 1.67 ± 0.797
1.336PheGln: 1.336 ± 0.45
1.67PheArg: 1.67 ± 0.882
3.006PheSer: 3.006 ± 1.243
2.338PheThr: 2.338 ± 0.943
4.008PheVal: 4.008 ± 0.971
0.334PheTrp: 0.334 ± 0.325
3.674PheTyr: 3.674 ± 1.101
0.0PheXaa: 0.0 ± 0.0
Gly
1.336GlyAla: 1.336 ± 0.566
0.334GlyCys: 0.334 ± 0.325
0.668GlyAsp: 0.668 ± 0.398
2.338GlyGlu: 2.338 ± 0.898
2.004GlyPhe: 2.004 ± 0.646
1.002GlyGly: 1.002 ± 0.739
1.336GlyHis: 1.336 ± 0.648
3.674GlyIle: 3.674 ± 0.942
4.342GlyLys: 4.342 ± 1.179
2.338GlyLeu: 2.338 ± 0.781
1.67GlyMet: 1.67 ± 0.673
3.674GlyAsn: 3.674 ± 0.991
0.334GlyPro: 0.334 ± 0.333
1.67GlyGln: 1.67 ± 0.69
0.668GlyArg: 0.668 ± 0.442
1.336GlySer: 1.336 ± 0.781
1.67GlyThr: 1.67 ± 0.695
2.004GlyVal: 2.004 ± 0.932
0.668GlyTrp: 0.668 ± 0.595
3.34GlyTyr: 3.34 ± 1.066
0.0GlyXaa: 0.0 ± 0.0
His
0.334HisAla: 0.334 ± 0.325
0.0HisCys: 0.0 ± 0.0
0.668HisAsp: 0.668 ± 0.379
1.002HisGlu: 1.002 ± 0.495
1.002HisPhe: 1.002 ± 0.513
1.336HisGly: 1.336 ± 0.712
0.334HisHis: 0.334 ± 0.291
2.672HisIle: 2.672 ± 0.854
1.002HisLys: 1.002 ± 0.48
1.67HisLeu: 1.67 ± 0.683
0.0HisMet: 0.0 ± 0.0
1.67HisAsn: 1.67 ± 0.72
0.334HisPro: 0.334 ± 0.286
1.002HisGln: 1.002 ± 0.509
0.668HisArg: 0.668 ± 0.386
1.67HisSer: 1.67 ± 0.968
1.67HisThr: 1.67 ± 0.805
0.668HisVal: 0.668 ± 0.567
0.0HisTrp: 0.0 ± 0.0
1.67HisTyr: 1.67 ± 0.863
0.0HisXaa: 0.0 ± 0.0
Ile
6.012IleAla: 6.012 ± 1.394
1.67IleCys: 1.67 ± 1.157
7.348IleAsp: 7.348 ± 1.507
8.684IleGlu: 8.684 ± 1.592
4.342IlePhe: 4.342 ± 1.462
3.006IleGly: 3.006 ± 1.046
1.336IleHis: 1.336 ± 0.928
10.02IleIle: 10.02 ± 2.254
6.68IleLys: 6.68 ± 1.44
8.016IleLeu: 8.016 ± 1.885
1.336IleMet: 1.336 ± 0.568
4.342IleAsn: 4.342 ± 1.228
2.338IlePro: 2.338 ± 0.753
2.672IleGln: 2.672 ± 0.944
3.674IleArg: 3.674 ± 1.198
7.682IleSer: 7.682 ± 1.716
3.006IleThr: 3.006 ± 1.029
3.006IleVal: 3.006 ± 0.877
0.0IleTrp: 0.0 ± 0.0
2.004IleTyr: 2.004 ± 0.792
0.0IleXaa: 0.0 ± 0.0
Lys
4.676LysAla: 4.676 ± 1.047
0.334LysCys: 0.334 ± 0.337
7.014LysAsp: 7.014 ± 1.59
10.354LysGlu: 10.354 ± 2.834
5.678LysPhe: 5.678 ± 1.791
3.674LysGly: 3.674 ± 1.184
2.338LysHis: 2.338 ± 0.796
8.684LysIle: 8.684 ± 1.421
11.69LysLys: 11.69 ± 1.913
9.686LysLeu: 9.686 ± 2.27
2.338LysMet: 2.338 ± 0.644
7.014LysAsn: 7.014 ± 1.383
2.004LysPro: 2.004 ± 0.531
6.012LysGln: 6.012 ± 1.354
4.008LysArg: 4.008 ± 0.98
5.344LysSer: 5.344 ± 1.135
6.012LysThr: 6.012 ± 1.754
7.014LysVal: 7.014 ± 1.248
0.334LysTrp: 0.334 ± 0.383
3.006LysTyr: 3.006 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
3.674LeuAla: 3.674 ± 1.06
0.668LeuCys: 0.668 ± 0.405
8.684LeuAsp: 8.684 ± 1.83
8.684LeuGlu: 8.684 ± 1.784
2.672LeuPhe: 2.672 ± 0.97
2.672LeuGly: 2.672 ± 1.352
1.336LeuHis: 1.336 ± 0.56
8.35LeuIle: 8.35 ± 1.184
9.686LeuLys: 9.686 ± 1.418
6.346LeuLeu: 6.346 ± 2.226
2.338LeuMet: 2.338 ± 1.06
8.684LeuAsn: 8.684 ± 1.656
1.002LeuPro: 1.002 ± 0.595
4.342LeuGln: 4.342 ± 1.365
3.34LeuArg: 3.34 ± 1.044
7.348LeuSer: 7.348 ± 1.422
4.342LeuThr: 4.342 ± 0.985
3.34LeuVal: 3.34 ± 0.665
1.002LeuTrp: 1.002 ± 0.937
4.676LeuTyr: 4.676 ± 1.092
0.0LeuXaa: 0.0 ± 0.0
Met
2.338MetAla: 2.338 ± 0.782
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.002MetGlu: 1.002 ± 0.582
1.002MetPhe: 1.002 ± 0.485
1.336MetGly: 1.336 ± 0.661
0.0MetHis: 0.0 ± 0.0
1.336MetIle: 1.336 ± 0.63
3.006MetLys: 3.006 ± 0.934
2.338MetLeu: 2.338 ± 0.831
1.002MetMet: 1.002 ± 1.092
1.67MetAsn: 1.67 ± 0.623
0.334MetPro: 0.334 ± 0.286
0.334MetGln: 0.334 ± 0.364
1.336MetArg: 1.336 ± 0.471
1.002MetSer: 1.002 ± 0.53
2.004MetThr: 2.004 ± 1.017
2.004MetVal: 2.004 ± 1.008
0.0MetTrp: 0.0 ± 0.0
0.668MetTyr: 0.668 ± 0.487
0.0MetXaa: 0.0 ± 0.0
Asn
0.668AsnAla: 0.668 ± 0.435
0.334AsnCys: 0.334 ± 0.337
6.346AsnAsp: 6.346 ± 1.22
4.342AsnGlu: 4.342 ± 0.978
4.676AsnPhe: 4.676 ± 1.407
3.006AsnGly: 3.006 ± 1.138
2.672AsnHis: 2.672 ± 1.158
4.676AsnIle: 4.676 ± 1.398
10.02AsnLys: 10.02 ± 1.328
7.014AsnLeu: 7.014 ± 0.916
1.67AsnMet: 1.67 ± 0.787
5.01AsnAsn: 5.01 ± 1.011
1.336AsnPro: 1.336 ± 0.584
3.674AsnGln: 3.674 ± 0.733
2.338AsnArg: 2.338 ± 0.883
6.68AsnSer: 6.68 ± 2.08
3.006AsnThr: 3.006 ± 0.647
1.67AsnVal: 1.67 ± 0.711
1.002AsnTrp: 1.002 ± 0.483
2.672AsnTyr: 2.672 ± 0.986
0.0AsnXaa: 0.0 ± 0.0
Pro
0.334ProAla: 0.334 ± 0.364
0.0ProCys: 0.0 ± 0.0
1.002ProAsp: 1.002 ± 0.641
2.338ProGlu: 2.338 ± 0.76
0.334ProPhe: 0.334 ± 0.451
0.0ProGly: 0.0 ± 0.0
0.334ProHis: 0.334 ± 0.286
2.004ProIle: 2.004 ± 1.045
3.674ProLys: 3.674 ± 0.954
1.67ProLeu: 1.67 ± 0.762
0.334ProMet: 0.334 ± 0.364
2.004ProAsn: 2.004 ± 0.709
0.668ProPro: 0.668 ± 0.519
0.334ProGln: 0.334 ± 0.334
0.334ProArg: 0.334 ± 0.382
2.338ProSer: 2.338 ± 0.979
1.67ProThr: 1.67 ± 1.136
0.334ProVal: 0.334 ± 0.312
0.0ProTrp: 0.0 ± 0.0
1.336ProTyr: 1.336 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
1.002GlnAla: 1.002 ± 0.696
0.0GlnCys: 0.0 ± 0.0
2.338GlnAsp: 2.338 ± 0.711
3.34GlnGlu: 3.34 ± 1.052
2.338GlnPhe: 2.338 ± 0.675
0.334GlnGly: 0.334 ± 0.325
1.336GlnHis: 1.336 ± 0.628
3.006GlnIle: 3.006 ± 0.833
4.342GlnLys: 4.342 ± 1.373
2.338GlnLeu: 2.338 ± 1.147
1.002GlnMet: 1.002 ± 0.659
2.672GlnAsn: 2.672 ± 1.227
0.0GlnPro: 0.0 ± 0.0
1.002GlnGln: 1.002 ± 0.673
3.006GlnArg: 3.006 ± 1.009
5.678GlnSer: 5.678 ± 1.426
3.006GlnThr: 3.006 ± 1.231
2.672GlnVal: 2.672 ± 0.902
0.668GlnTrp: 0.668 ± 0.41
1.002GlnTyr: 1.002 ± 0.659
0.0GlnXaa: 0.0 ± 0.0
Arg
1.67ArgAla: 1.67 ± 0.644
0.334ArgCys: 0.334 ± 0.325
1.67ArgAsp: 1.67 ± 0.89
2.672ArgGlu: 2.672 ± 1.099
2.338ArgPhe: 2.338 ± 0.553
3.006ArgGly: 3.006 ± 1.244
2.004ArgHis: 2.004 ± 0.724
5.344ArgIle: 5.344 ± 1.54
4.008ArgLys: 4.008 ± 1.335
4.342ArgLeu: 4.342 ± 1.168
0.334ArgMet: 0.334 ± 0.381
2.338ArgAsn: 2.338 ± 0.837
1.336ArgPro: 1.336 ± 0.689
1.67ArgGln: 1.67 ± 0.849
2.338ArgArg: 2.338 ± 1.136
2.004ArgSer: 2.004 ± 0.66
2.338ArgThr: 2.338 ± 1.009
1.67ArgVal: 1.67 ± 0.807
0.668ArgTrp: 0.668 ± 0.464
1.002ArgTyr: 1.002 ± 0.702
0.0ArgXaa: 0.0 ± 0.0
Ser
2.338SerAla: 2.338 ± 0.757
0.0SerCys: 0.0 ± 0.0
4.676SerAsp: 4.676 ± 1.095
8.016SerGlu: 8.016 ± 1.286
2.338SerPhe: 2.338 ± 0.717
3.34SerGly: 3.34 ± 0.972
0.668SerHis: 0.668 ± 0.438
6.012SerIle: 6.012 ± 1.07
7.348SerLys: 7.348 ± 1.344
7.682SerLeu: 7.682 ± 1.266
1.67SerMet: 1.67 ± 0.592
6.012SerAsn: 6.012 ± 1.401
2.672SerPro: 2.672 ± 1.031
3.006SerGln: 3.006 ± 1.048
1.336SerArg: 1.336 ± 0.767
3.34SerSer: 3.34 ± 1.397
1.336SerThr: 1.336 ± 0.791
3.674SerVal: 3.674 ± 0.914
0.334SerTrp: 0.334 ± 0.325
1.002SerTyr: 1.002 ± 0.645
0.0SerXaa: 0.0 ± 0.0
Thr
1.336ThrAla: 1.336 ± 0.689
0.334ThrCys: 0.334 ± 0.334
4.008ThrAsp: 4.008 ± 1.089
3.674ThrGlu: 3.674 ± 1.157
2.004ThrPhe: 2.004 ± 0.933
2.338ThrGly: 2.338 ± 1.031
0.0ThrHis: 0.0 ± 0.0
3.34ThrIle: 3.34 ± 1.133
3.674ThrLys: 3.674 ± 1.575
6.012ThrLeu: 6.012 ± 1.117
1.002ThrMet: 1.002 ± 0.589
4.342ThrAsn: 4.342 ± 1.253
0.668ThrPro: 0.668 ± 0.443
2.672ThrGln: 2.672 ± 0.73
2.004ThrArg: 2.004 ± 0.846
4.008ThrSer: 4.008 ± 1.043
1.336ThrThr: 1.336 ± 0.428
2.338ThrVal: 2.338 ± 0.931
0.334ThrTrp: 0.334 ± 0.312
2.338ThrTyr: 2.338 ± 0.889
0.0ThrXaa: 0.0 ± 0.0
Val
2.004ValAla: 2.004 ± 0.955
0.668ValCys: 0.668 ± 0.415
1.336ValAsp: 1.336 ± 0.535
3.674ValGlu: 3.674 ± 1.135
2.672ValPhe: 2.672 ± 0.852
1.002ValGly: 1.002 ± 0.556
1.002ValHis: 1.002 ± 0.55
3.34ValIle: 3.34 ± 1.05
6.68ValLys: 6.68 ± 1.548
2.672ValLeu: 2.672 ± 0.925
1.336ValMet: 1.336 ± 0.769
2.338ValAsn: 2.338 ± 0.789
1.67ValPro: 1.67 ± 1.025
2.004ValGln: 2.004 ± 0.841
3.674ValArg: 3.674 ± 1.488
3.006ValSer: 3.006 ± 0.924
4.008ValThr: 4.008 ± 0.977
1.336ValVal: 1.336 ± 0.734
0.0ValTrp: 0.0 ± 0.0
1.67ValTyr: 1.67 ± 0.919
0.0ValXaa: 0.0 ± 0.0
Trp
0.334TrpAla: 0.334 ± 0.312
0.334TrpCys: 0.334 ± 0.325
0.334TrpAsp: 0.334 ± 0.451
0.668TrpGlu: 0.668 ± 0.457
0.0TrpPhe: 0.0 ± 0.0
1.336TrpGly: 1.336 ± 0.776
0.334TrpHis: 0.334 ± 0.312
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.668TrpLeu: 0.668 ± 0.435
0.334TrpMet: 0.334 ± 0.364
0.668TrpAsn: 0.668 ± 0.445
0.334TrpPro: 0.334 ± 0.371
1.336TrpGln: 1.336 ± 0.674
0.0TrpArg: 0.0 ± 0.0
0.334TrpSer: 0.334 ± 0.383
0.668TrpThr: 0.668 ± 0.65
0.334TrpVal: 0.334 ± 0.325
1.002TrpTrp: 1.002 ± 0.718
1.002TrpTyr: 1.002 ± 0.763
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.336TyrAla: 1.336 ± 0.512
0.334TyrCys: 0.334 ± 0.337
2.004TyrAsp: 2.004 ± 0.827
4.342TyrGlu: 4.342 ± 1.416
4.008TyrPhe: 4.008 ± 1.348
2.672TyrGly: 2.672 ± 1.258
0.668TyrHis: 0.668 ± 0.443
2.338TyrIle: 2.338 ± 0.743
3.34TyrLys: 3.34 ± 0.951
4.008TyrLeu: 4.008 ± 1.175
1.002TyrMet: 1.002 ± 0.524
4.008TyrAsn: 4.008 ± 0.96
1.336TyrPro: 1.336 ± 0.628
2.672TyrGln: 2.672 ± 0.751
2.004TyrArg: 2.004 ± 0.604
1.67TyrSer: 1.67 ± 0.659
2.004TyrThr: 2.004 ± 0.805
1.002TyrVal: 1.002 ± 0.455
0.668TyrTrp: 0.668 ± 0.617
1.002TyrTyr: 1.002 ± 0.541
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2995 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski