Amino acid dipepetide frequency for Streptococcus satellite phage Javan379

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.374AlaAla: 0.374 ± 0.297
1.121AlaCys: 1.121 ± 0.644
4.483AlaAsp: 4.483 ± 1.251
4.856AlaGlu: 4.856 ± 1.252
2.988AlaPhe: 2.988 ± 1.368
2.988AlaGly: 2.988 ± 0.884
0.747AlaHis: 0.747 ± 0.629
4.483AlaIle: 4.483 ± 1.421
4.856AlaLys: 4.856 ± 1.075
4.109AlaLeu: 4.109 ± 0.833
2.615AlaMet: 2.615 ± 1.189
3.736AlaAsn: 3.736 ± 0.792
2.241AlaPro: 2.241 ± 0.719
1.494AlaGln: 1.494 ± 0.64
3.362AlaArg: 3.362 ± 0.821
1.868AlaSer: 1.868 ± 0.907
2.615AlaThr: 2.615 ± 0.769
2.988AlaVal: 2.988 ± 0.917
0.747AlaTrp: 0.747 ± 0.424
1.868AlaTyr: 1.868 ± 0.73
0.0AlaXaa: 0.0 ± 0.0
Cys
0.374CysAla: 0.374 ± 0.39
0.0CysCys: 0.0 ± 0.0
0.747CysAsp: 0.747 ± 0.456
0.747CysGlu: 0.747 ± 0.504
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.747CysLys: 0.747 ± 0.441
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.747CysAsn: 0.747 ± 0.632
0.0CysPro: 0.0 ± 0.0
0.374CysGln: 0.374 ± 0.433
0.374CysArg: 0.374 ± 0.327
0.0CysSer: 0.0 ± 0.0
0.747CysThr: 0.747 ± 0.766
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.374CysTyr: 0.374 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
1.494AspAla: 1.494 ± 0.887
0.374AspCys: 0.374 ± 0.383
2.988AspAsp: 2.988 ± 1.004
4.483AspGlu: 4.483 ± 1.296
4.109AspPhe: 4.109 ± 1.184
2.241AspGly: 2.241 ± 0.814
0.0AspHis: 0.0 ± 0.0
4.483AspIle: 4.483 ± 1.208
4.856AspLys: 4.856 ± 0.972
5.603AspLeu: 5.603 ± 1.099
1.868AspMet: 1.868 ± 0.551
2.988AspAsn: 2.988 ± 1.18
1.121AspPro: 1.121 ± 0.435
0.374AspGln: 0.374 ± 0.327
2.241AspArg: 2.241 ± 0.776
3.362AspSer: 3.362 ± 1.159
2.615AspThr: 2.615 ± 1.162
2.615AspVal: 2.615 ± 1.047
0.374AspTrp: 0.374 ± 0.316
7.845AspTyr: 7.845 ± 2.357
0.0AspXaa: 0.0 ± 0.0
Glu
3.362GluAla: 3.362 ± 0.773
0.374GluCys: 0.374 ± 0.39
4.856GluAsp: 4.856 ± 1.142
7.471GluGlu: 7.471 ± 1.991
2.988GluPhe: 2.988 ± 1.343
1.868GluGly: 1.868 ± 0.884
1.494GluHis: 1.494 ± 0.566
5.23GluIle: 5.23 ± 1.197
4.856GluLys: 4.856 ± 2.01
14.569GluLeu: 14.569 ± 2.042
1.868GluMet: 1.868 ± 0.756
6.35GluAsn: 6.35 ± 2.01
1.494GluPro: 1.494 ± 0.804
4.109GluGln: 4.109 ± 1.948
3.362GluArg: 3.362 ± 1.255
5.977GluSer: 5.977 ± 1.333
3.736GluThr: 3.736 ± 1.084
3.362GluVal: 3.362 ± 1.261
1.121GluTrp: 1.121 ± 0.433
5.23GluTyr: 5.23 ± 1.292
0.0GluXaa: 0.0 ± 0.0
Phe
1.494PheAla: 1.494 ± 0.73
0.0PheCys: 0.0 ± 0.0
2.988PheAsp: 2.988 ± 0.916
3.736PheGlu: 3.736 ± 1.107
1.868PhePhe: 1.868 ± 1.013
2.615PheGly: 2.615 ± 1.038
1.121PheHis: 1.121 ± 0.608
2.615PheIle: 2.615 ± 0.973
4.856PheLys: 4.856 ± 1.4
4.109PheLeu: 4.109 ± 1.282
0.747PheMet: 0.747 ± 0.461
4.109PheAsn: 4.109 ± 1.33
0.374PhePro: 0.374 ± 0.322
1.868PheGln: 1.868 ± 0.748
2.241PheArg: 2.241 ± 1.058
2.241PheSer: 2.241 ± 0.775
2.241PheThr: 2.241 ± 0.857
1.121PheVal: 1.121 ± 0.552
1.121PheTrp: 1.121 ± 0.435
1.494PheTyr: 1.494 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
3.736GlyAla: 3.736 ± 1.75
0.374GlyCys: 0.374 ± 0.327
2.988GlyAsp: 2.988 ± 1.182
3.736GlyGlu: 3.736 ± 1.054
1.494GlyPhe: 1.494 ± 0.902
1.494GlyGly: 1.494 ± 0.566
0.374GlyHis: 0.374 ± 0.327
4.856GlyIle: 4.856 ± 1.259
3.736GlyLys: 3.736 ± 1.055
4.109GlyLeu: 4.109 ± 1.796
1.121GlyMet: 1.121 ± 0.749
2.988GlyAsn: 2.988 ± 0.892
0.374GlyPro: 0.374 ± 0.393
2.615GlyGln: 2.615 ± 0.983
1.868GlyArg: 1.868 ± 0.636
1.868GlySer: 1.868 ± 0.896
1.868GlyThr: 1.868 ± 0.762
3.736GlyVal: 3.736 ± 1.46
0.747GlyTrp: 0.747 ± 0.632
2.615GlyTyr: 2.615 ± 0.977
0.0GlyXaa: 0.0 ± 0.0
His
2.988HisAla: 2.988 ± 0.898
0.0HisCys: 0.0 ± 0.0
0.747HisAsp: 0.747 ± 0.491
0.747HisGlu: 0.747 ± 0.632
1.868HisPhe: 1.868 ± 0.626
1.121HisGly: 1.121 ± 0.59
0.374HisHis: 0.374 ± 0.39
1.868HisIle: 1.868 ± 0.79
0.374HisLys: 0.374 ± 0.327
0.747HisLeu: 0.747 ± 0.441
0.0HisMet: 0.0 ± 0.0
1.494HisAsn: 1.494 ± 0.915
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.121HisArg: 1.121 ± 0.602
0.747HisSer: 0.747 ± 0.424
1.868HisThr: 1.868 ± 0.725
0.747HisVal: 0.747 ± 0.654
0.0HisTrp: 0.0 ± 0.0
0.747HisTyr: 0.747 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
5.23IleAla: 5.23 ± 1.828
0.0IleCys: 0.0 ± 0.0
5.23IleAsp: 5.23 ± 1.789
4.109IleGlu: 4.109 ± 1.366
2.615IlePhe: 2.615 ± 0.95
3.362IleGly: 3.362 ± 1.247
0.747IleHis: 0.747 ± 0.41
2.615IleIle: 2.615 ± 1.388
8.218IleLys: 8.218 ± 1.495
5.977IleLeu: 5.977 ± 0.853
1.494IleMet: 1.494 ± 0.57
4.109IleAsn: 4.109 ± 1.524
3.736IlePro: 3.736 ± 1.234
1.868IleGln: 1.868 ± 0.649
1.868IleArg: 1.868 ± 0.69
6.724IleSer: 6.724 ± 1.227
4.856IleThr: 4.856 ± 1.634
3.736IleVal: 3.736 ± 0.955
0.0IleTrp: 0.0 ± 0.0
2.988IleTyr: 2.988 ± 1.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.483LysAla: 4.483 ± 1.092
0.747LysCys: 0.747 ± 0.511
2.615LysAsp: 2.615 ± 1.384
8.218LysGlu: 8.218 ± 1.59
0.747LysPhe: 0.747 ± 0.382
4.483LysGly: 4.483 ± 1.357
3.736LysHis: 3.736 ± 0.79
6.35LysIle: 6.35 ± 1.478
4.109LysLys: 4.109 ± 1.402
5.977LysLeu: 5.977 ± 1.445
1.494LysMet: 1.494 ± 1.26
4.483LysAsn: 4.483 ± 1.738
4.856LysPro: 4.856 ± 1.511
5.603LysGln: 5.603 ± 1.483
8.218LysArg: 8.218 ± 1.885
3.736LysSer: 3.736 ± 0.897
8.218LysThr: 8.218 ± 1.948
5.603LysVal: 5.603 ± 1.09
1.121LysTrp: 1.121 ± 0.596
2.241LysTyr: 2.241 ± 0.758
0.0LysXaa: 0.0 ± 0.0
Leu
6.35LeuAla: 6.35 ± 1.331
0.747LeuCys: 0.747 ± 0.481
8.218LeuAsp: 8.218 ± 2.03
11.58LeuGlu: 11.58 ± 2.567
4.483LeuPhe: 4.483 ± 1.269
2.988LeuGly: 2.988 ± 1.389
1.494LeuHis: 1.494 ± 0.661
5.603LeuIle: 5.603 ± 1.252
7.097LeuLys: 7.097 ± 1.359
9.339LeuLeu: 9.339 ± 2.227
1.121LeuMet: 1.121 ± 0.696
6.35LeuAsn: 6.35 ± 1.531
3.736LeuPro: 3.736 ± 1.125
3.736LeuGln: 3.736 ± 0.707
2.615LeuArg: 2.615 ± 1.241
7.097LeuSer: 7.097 ± 1.483
7.097LeuThr: 7.097 ± 1.383
3.736LeuVal: 3.736 ± 1.364
1.121LeuTrp: 1.121 ± 0.685
3.736LeuTyr: 3.736 ± 0.935
0.0LeuXaa: 0.0 ± 0.0
Met
2.615MetAla: 2.615 ± 1.032
0.0MetCys: 0.0 ± 0.0
1.121MetAsp: 1.121 ± 0.54
2.615MetGlu: 2.615 ± 0.946
0.374MetPhe: 0.374 ± 0.482
1.121MetGly: 1.121 ± 0.542
0.0MetHis: 0.0 ± 0.0
1.494MetIle: 1.494 ± 0.628
2.615MetLys: 2.615 ± 0.654
1.121MetLeu: 1.121 ± 0.589
0.0MetMet: 0.0 ± 0.0
0.747MetAsn: 0.747 ± 0.533
0.747MetPro: 0.747 ± 0.411
1.121MetGln: 1.121 ± 0.492
1.494MetArg: 1.494 ± 0.91
1.494MetSer: 1.494 ± 0.687
1.494MetThr: 1.494 ± 0.96
1.494MetVal: 1.494 ± 0.577
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.362AsnAla: 3.362 ± 1.04
0.0AsnCys: 0.0 ± 0.0
1.868AsnAsp: 1.868 ± 0.971
3.736AsnGlu: 3.736 ± 0.825
4.109AsnPhe: 4.109 ± 1.55
5.603AsnGly: 5.603 ± 0.76
1.494AsnHis: 1.494 ± 0.764
5.603AsnIle: 5.603 ± 1.378
4.483AsnLys: 4.483 ± 1.099
6.35AsnLeu: 6.35 ± 1.406
1.121AsnMet: 1.121 ± 0.534
5.23AsnAsn: 5.23 ± 1.483
2.988AsnPro: 2.988 ± 0.989
3.362AsnGln: 3.362 ± 0.97
1.494AsnArg: 1.494 ± 0.837
3.736AsnSer: 3.736 ± 1.222
3.362AsnThr: 3.362 ± 1.048
2.615AsnVal: 2.615 ± 1.002
0.0AsnTrp: 0.0 ± 0.0
2.241AsnTyr: 2.241 ± 0.787
0.0AsnXaa: 0.0 ± 0.0
Pro
1.868ProAla: 1.868 ± 0.626
0.0ProCys: 0.0 ± 0.0
0.747ProAsp: 0.747 ± 0.423
4.109ProGlu: 4.109 ± 1.754
2.615ProPhe: 2.615 ± 1.093
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.121ProIle: 1.121 ± 0.689
2.988ProLys: 2.988 ± 1.139
4.109ProLeu: 4.109 ± 0.801
0.374ProMet: 0.374 ± 0.409
3.362ProAsn: 3.362 ± 1.196
0.374ProPro: 0.374 ± 0.482
2.241ProGln: 2.241 ± 1.04
2.241ProArg: 2.241 ± 0.682
0.374ProSer: 0.374 ± 0.316
1.494ProThr: 1.494 ± 0.467
1.494ProVal: 1.494 ± 0.882
0.374ProTrp: 0.374 ± 0.316
1.868ProTyr: 1.868 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
3.362GlnAla: 3.362 ± 1.294
0.0GlnCys: 0.0 ± 0.0
3.736GlnAsp: 3.736 ± 1.49
5.977GlnGlu: 5.977 ± 1.158
1.868GlnPhe: 1.868 ± 0.673
1.121GlnGly: 1.121 ± 0.876
1.494GlnHis: 1.494 ± 0.784
1.868GlnIle: 1.868 ± 0.666
5.603GlnLys: 5.603 ± 0.915
2.615GlnLeu: 2.615 ± 0.81
0.374GlnMet: 0.374 ± 0.389
2.241GlnAsn: 2.241 ± 0.743
0.747GlnPro: 0.747 ± 0.541
2.241GlnGln: 2.241 ± 0.99
1.868GlnArg: 1.868 ± 0.908
1.121GlnSer: 1.121 ± 0.712
2.988GlnThr: 2.988 ± 0.894
3.362GlnVal: 3.362 ± 0.618
0.0GlnTrp: 0.0 ± 0.0
1.868GlnTyr: 1.868 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
3.362ArgAla: 3.362 ± 0.725
0.0ArgCys: 0.0 ± 0.0
1.868ArgAsp: 1.868 ± 0.781
3.736ArgGlu: 3.736 ± 1.178
1.868ArgPhe: 1.868 ± 0.626
1.868ArgGly: 1.868 ± 1.025
0.747ArgHis: 0.747 ± 0.41
2.241ArgIle: 2.241 ± 0.706
2.988ArgLys: 2.988 ± 1.252
5.977ArgLeu: 5.977 ± 1.585
0.747ArgMet: 0.747 ± 0.609
4.109ArgAsn: 4.109 ± 1.122
0.747ArgPro: 0.747 ± 0.439
3.362ArgGln: 3.362 ± 1.489
1.868ArgArg: 1.868 ± 0.7
1.868ArgSer: 1.868 ± 0.83
2.241ArgThr: 2.241 ± 0.809
2.615ArgVal: 2.615 ± 0.882
0.0ArgTrp: 0.0 ± 0.0
4.856ArgTyr: 4.856 ± 0.927
0.0ArgXaa: 0.0 ± 0.0
Ser
2.988SerAla: 2.988 ± 0.702
0.374SerCys: 0.374 ± 0.383
4.483SerAsp: 4.483 ± 1.151
2.988SerGlu: 2.988 ± 0.748
1.494SerPhe: 1.494 ± 0.637
3.736SerGly: 3.736 ± 0.701
0.747SerHis: 0.747 ± 0.504
8.218SerIle: 8.218 ± 1.848
4.483SerLys: 4.483 ± 1.396
6.35SerLeu: 6.35 ± 1.264
0.747SerMet: 0.747 ± 0.513
3.736SerAsn: 3.736 ± 1.002
1.121SerPro: 1.121 ± 0.457
1.868SerGln: 1.868 ± 1.112
1.121SerArg: 1.121 ± 0.576
2.615SerSer: 2.615 ± 1.356
4.483SerThr: 4.483 ± 1.444
0.747SerVal: 0.747 ± 0.527
0.747SerTrp: 0.747 ± 0.423
3.362SerTyr: 3.362 ± 0.941
0.0SerXaa: 0.0 ± 0.0
Thr
2.241ThrAla: 2.241 ± 0.983
0.0ThrCys: 0.0 ± 0.0
3.362ThrAsp: 3.362 ± 1.233
4.109ThrGlu: 4.109 ± 1.152
4.483ThrPhe: 4.483 ± 2.019
4.109ThrGly: 4.109 ± 1.239
1.494ThrHis: 1.494 ± 0.648
3.736ThrIle: 3.736 ± 0.996
6.724ThrLys: 6.724 ± 1.634
7.471ThrLeu: 7.471 ± 1.43
2.988ThrMet: 2.988 ± 1.051
2.241ThrAsn: 2.241 ± 0.808
2.241ThrPro: 2.241 ± 0.809
1.868ThrGln: 1.868 ± 0.807
4.109ThrArg: 4.109 ± 0.702
4.109ThrSer: 4.109 ± 1.231
4.483ThrThr: 4.483 ± 1.364
3.736ThrVal: 3.736 ± 0.918
0.374ThrTrp: 0.374 ± 0.383
1.494ThrTyr: 1.494 ± 0.676
0.0ThrXaa: 0.0 ± 0.0
Val
2.615ValAla: 2.615 ± 1.155
0.374ValCys: 0.374 ± 0.316
0.747ValAsp: 0.747 ± 0.461
1.868ValGlu: 1.868 ± 1.021
1.121ValPhe: 1.121 ± 0.56
2.988ValGly: 2.988 ± 1.14
0.747ValHis: 0.747 ± 0.451
3.362ValIle: 3.362 ± 0.904
4.483ValLys: 4.483 ± 1.356
4.483ValLeu: 4.483 ± 1.285
1.121ValMet: 1.121 ± 0.546
2.615ValAsn: 2.615 ± 0.839
2.988ValPro: 2.988 ± 1.199
2.615ValGln: 2.615 ± 1.122
1.494ValArg: 1.494 ± 0.736
3.736ValSer: 3.736 ± 0.932
5.977ValThr: 5.977 ± 1.204
4.483ValVal: 4.483 ± 1.523
0.0ValTrp: 0.0 ± 0.0
2.988ValTyr: 2.988 ± 0.743
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.374TrpAsp: 0.374 ± 0.327
1.121TrpGlu: 1.121 ± 0.651
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.374TrpHis: 0.374 ± 0.316
0.374TrpIle: 0.374 ± 0.316
0.374TrpLys: 0.374 ± 0.327
0.747TrpLeu: 0.747 ± 0.481
0.0TrpMet: 0.0 ± 0.0
0.374TrpAsn: 0.374 ± 0.322
0.374TrpPro: 0.374 ± 0.316
0.374TrpGln: 0.374 ± 0.316
0.747TrpArg: 0.747 ± 0.411
0.747TrpSer: 0.747 ± 0.451
0.374TrpThr: 0.374 ± 0.39
0.747TrpVal: 0.747 ± 0.441
0.374TrpTrp: 0.374 ± 0.327
1.121TrpTyr: 1.121 ± 0.435
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.241TyrAla: 2.241 ± 0.818
0.747TyrCys: 0.747 ± 0.456
1.868TyrAsp: 1.868 ± 0.613
3.736TyrGlu: 3.736 ± 1.308
1.868TyrPhe: 1.868 ± 0.683
2.988TyrGly: 2.988 ± 0.871
0.374TyrHis: 0.374 ± 0.316
3.362TyrIle: 3.362 ± 1.233
8.592TyrLys: 8.592 ± 2.11
4.483TyrLeu: 4.483 ± 1.109
1.494TyrMet: 1.494 ± 0.631
0.747TyrAsn: 0.747 ± 0.578
1.494TyrPro: 1.494 ± 1.138
3.362TyrGln: 3.362 ± 1.23
3.736TyrArg: 3.736 ± 1.259
2.988TyrSer: 2.988 ± 0.962
2.615TyrThr: 2.615 ± 0.88
1.868TyrVal: 1.868 ± 0.663
0.374TyrTrp: 0.374 ± 0.322
0.747TyrTyr: 0.747 ± 0.772
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2678 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski