Amino acid dipepetide frequency for Streptococcus satellite phage Javan428

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.316AlaAla: 0.316 ± 0.372
0.0AlaCys: 0.0 ± 0.0
4.735AlaAsp: 4.735 ± 1.577
3.472AlaGlu: 3.472 ± 0.921
3.157AlaPhe: 3.157 ± 0.959
1.894AlaGly: 1.894 ± 0.52
1.263AlaHis: 1.263 ± 0.773
6.944AlaIle: 6.944 ± 1.457
4.419AlaLys: 4.419 ± 0.847
4.735AlaLeu: 4.735 ± 0.983
0.947AlaMet: 0.947 ± 0.529
2.525AlaAsn: 2.525 ± 1.024
1.578AlaPro: 1.578 ± 0.644
1.894AlaGln: 1.894 ± 0.653
1.894AlaArg: 1.894 ± 0.675
3.788AlaSer: 3.788 ± 0.909
3.472AlaThr: 3.472 ± 1.162
3.788AlaVal: 3.788 ± 0.989
0.0AlaTrp: 0.0 ± 0.0
2.525AlaTyr: 2.525 ± 0.845
0.0AlaXaa: 0.0 ± 0.0
Cys
0.316CysAla: 0.316 ± 0.301
0.0CysCys: 0.0 ± 0.0
0.316CysAsp: 0.316 ± 0.32
0.316CysGlu: 0.316 ± 0.278
0.631CysPhe: 0.631 ± 0.374
0.316CysGly: 0.316 ± 0.329
0.316CysHis: 0.316 ± 0.278
0.0CysIle: 0.0 ± 0.0
0.316CysLys: 0.316 ± 0.25
0.631CysLeu: 0.631 ± 0.406
0.316CysMet: 0.316 ± 0.336
0.316CysAsn: 0.316 ± 0.31
0.316CysPro: 0.316 ± 0.278
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.894AspAla: 1.894 ± 0.517
0.316AspCys: 0.316 ± 0.278
5.051AspAsp: 5.051 ± 1.063
5.682AspGlu: 5.682 ± 1.707
3.157AspPhe: 3.157 ± 0.929
2.525AspGly: 2.525 ± 0.804
0.0AspHis: 0.0 ± 0.0
6.629AspIle: 6.629 ± 1.07
4.104AspLys: 4.104 ± 1.337
7.891AspLeu: 7.891 ± 1.863
0.947AspMet: 0.947 ± 0.568
4.104AspAsn: 4.104 ± 0.905
0.631AspPro: 0.631 ± 0.398
0.947AspGln: 0.947 ± 0.485
2.841AspArg: 2.841 ± 0.908
2.525AspSer: 2.525 ± 1.009
3.788AspThr: 3.788 ± 1.105
1.578AspVal: 1.578 ± 0.469
0.0AspTrp: 0.0 ± 0.0
4.104AspTyr: 4.104 ± 1.153
0.0AspXaa: 0.0 ± 0.0
Glu
4.104GluAla: 4.104 ± 1.121
0.631GluCys: 0.631 ± 0.374
3.788GluAsp: 3.788 ± 1.207
7.26GluGlu: 7.26 ± 2.19
2.525GluPhe: 2.525 ± 0.796
2.841GluGly: 2.841 ± 0.713
1.263GluHis: 1.263 ± 0.597
8.523GluIle: 8.523 ± 2.141
7.891GluLys: 7.891 ± 1.44
10.417GluLeu: 10.417 ± 1.502
3.472GluMet: 3.472 ± 1.216
6.313GluAsn: 6.313 ± 1.334
2.841GluPro: 2.841 ± 0.977
4.419GluGln: 4.419 ± 1.182
4.735GluArg: 4.735 ± 1.211
3.788GluSer: 3.788 ± 1.331
4.735GluThr: 4.735 ± 0.924
7.26GluVal: 7.26 ± 1.635
0.316GluTrp: 0.316 ± 0.325
4.104GluTyr: 4.104 ± 1.395
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.316PheCys: 0.316 ± 0.32
2.841PheAsp: 2.841 ± 1.122
4.104PheGlu: 4.104 ± 1.233
0.316PhePhe: 0.316 ± 0.25
0.947PheGly: 0.947 ± 0.514
1.263PheHis: 1.263 ± 0.46
2.525PheIle: 2.525 ± 0.719
4.419PheLys: 4.419 ± 1.323
3.472PheLeu: 3.472 ± 0.99
0.947PheMet: 0.947 ± 0.562
1.894PheAsn: 1.894 ± 0.743
1.263PhePro: 1.263 ± 0.767
2.21PheGln: 2.21 ± 0.789
0.631PheArg: 0.631 ± 0.385
3.788PheSer: 3.788 ± 0.865
1.894PheThr: 1.894 ± 0.861
2.21PheVal: 2.21 ± 0.97
0.947PheTrp: 0.947 ± 0.489
2.525PheTyr: 2.525 ± 0.735
0.0PheXaa: 0.0 ± 0.0
Gly
1.578GlyAla: 1.578 ± 0.865
0.0GlyCys: 0.0 ± 0.0
2.21GlyAsp: 2.21 ± 0.759
3.788GlyGlu: 3.788 ± 0.875
2.21GlyPhe: 2.21 ± 0.808
1.894GlyGly: 1.894 ± 0.855
0.631GlyHis: 0.631 ± 0.354
3.788GlyIle: 3.788 ± 0.992
5.366GlyLys: 5.366 ± 1.06
6.313GlyLeu: 6.313 ± 1.677
1.263GlyMet: 1.263 ± 0.565
2.21GlyAsn: 2.21 ± 0.82
0.0GlyPro: 0.0 ± 0.0
2.21GlyGln: 2.21 ± 0.606
4.104GlyArg: 4.104 ± 0.978
2.21GlySer: 2.21 ± 0.782
2.21GlyThr: 2.21 ± 0.685
4.419GlyVal: 4.419 ± 1.263
1.263GlyTrp: 1.263 ± 0.668
3.472GlyTyr: 3.472 ± 0.885
0.0GlyXaa: 0.0 ± 0.0
His
1.263HisAla: 1.263 ± 0.731
0.0HisCys: 0.0 ± 0.0
0.316HisAsp: 0.316 ± 0.291
2.21HisGlu: 2.21 ± 0.795
0.631HisPhe: 0.631 ± 0.532
0.631HisGly: 0.631 ± 0.423
0.0HisHis: 0.0 ± 0.0
0.631HisIle: 0.631 ± 0.322
0.947HisLys: 0.947 ± 0.447
0.631HisLeu: 0.631 ± 0.556
0.0HisMet: 0.0 ± 0.325
0.631HisAsn: 0.631 ± 0.377
1.263HisPro: 1.263 ± 0.531
0.316HisGln: 0.316 ± 0.291
1.263HisArg: 1.263 ± 0.502
0.631HisSer: 0.631 ± 0.366
0.631HisThr: 0.631 ± 0.556
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.947HisTyr: 0.947 ± 0.653
0.0HisXaa: 0.0 ± 0.0
Ile
2.841IleAla: 2.841 ± 0.999
0.316IleCys: 0.316 ± 0.31
2.841IleAsp: 2.841 ± 1.019
5.997IleGlu: 5.997 ± 1.345
3.157IlePhe: 3.157 ± 0.828
4.735IleGly: 4.735 ± 1.317
0.631IleHis: 0.631 ± 0.322
5.051IleIle: 5.051 ± 1.211
6.629IleLys: 6.629 ± 1.407
7.26IleLeu: 7.26 ± 1.242
0.947IleMet: 0.947 ± 0.415
3.157IleAsn: 3.157 ± 1.016
2.525IlePro: 2.525 ± 0.846
3.788IleGln: 3.788 ± 0.811
3.472IleArg: 3.472 ± 0.816
5.997IleSer: 5.997 ± 1.625
3.788IleThr: 3.788 ± 0.804
2.841IleVal: 2.841 ± 0.671
0.631IleTrp: 0.631 ± 0.423
4.735IleTyr: 4.735 ± 1.153
0.0IleXaa: 0.0 ± 0.0
Lys
8.207LysAla: 8.207 ± 1.273
0.631LysCys: 0.631 ± 0.378
5.997LysAsp: 5.997 ± 1.311
9.785LysGlu: 9.785 ± 1.708
1.578LysPhe: 1.578 ± 0.832
5.682LysGly: 5.682 ± 1.444
0.947LysHis: 0.947 ± 0.511
5.682LysIle: 5.682 ± 1.461
8.207LysLys: 8.207 ± 1.948
7.891LysLeu: 7.891 ± 1.545
1.578LysMet: 1.578 ± 0.666
7.26LysAsn: 7.26 ± 1.37
2.21LysPro: 2.21 ± 0.905
3.788LysGln: 3.788 ± 1.114
5.366LysArg: 5.366 ± 1.343
6.944LysSer: 6.944 ± 1.202
5.682LysThr: 5.682 ± 1.495
3.157LysVal: 3.157 ± 0.852
0.947LysTrp: 0.947 ± 0.42
1.894LysTyr: 1.894 ± 0.585
0.0LysXaa: 0.0 ± 0.0
Leu
11.048LeuAla: 11.048 ± 1.514
0.0LeuCys: 0.0 ± 0.0
7.26LeuAsp: 7.26 ± 1.694
10.101LeuGlu: 10.101 ± 1.405
4.735LeuPhe: 4.735 ± 1.32
5.051LeuGly: 5.051 ± 0.803
1.263LeuHis: 1.263 ± 0.762
3.472LeuIle: 3.472 ± 0.974
8.207LeuLys: 8.207 ± 2.127
8.838LeuLeu: 8.838 ± 1.277
2.525LeuMet: 2.525 ± 0.856
6.313LeuAsn: 6.313 ± 1.436
4.104LeuPro: 4.104 ± 0.998
4.419LeuGln: 4.419 ± 0.99
4.419LeuArg: 4.419 ± 1.12
4.104LeuSer: 4.104 ± 1.244
7.26LeuThr: 7.26 ± 1.897
4.735LeuVal: 4.735 ± 1.033
0.316LeuTrp: 0.316 ± 0.325
3.788LeuTyr: 3.788 ± 1.103
0.0LeuXaa: 0.0 ± 0.0
Met
1.263MetAla: 1.263 ± 0.716
0.0MetCys: 0.0 ± 0.0
2.525MetAsp: 2.525 ± 1.036
1.578MetGlu: 1.578 ± 0.705
1.263MetPhe: 1.263 ± 0.608
1.578MetGly: 1.578 ± 0.667
0.0MetHis: 0.0 ± 0.0
1.263MetIle: 1.263 ± 0.765
2.841MetLys: 2.841 ± 0.989
0.947MetLeu: 0.947 ± 0.545
0.316MetMet: 0.316 ± 0.325
1.894MetAsn: 1.894 ± 0.716
0.316MetPro: 0.316 ± 0.335
0.316MetGln: 0.316 ± 0.372
0.947MetArg: 0.947 ± 0.528
0.631MetSer: 0.631 ± 0.45
3.157MetThr: 3.157 ± 0.977
0.631MetVal: 0.631 ± 0.474
0.0MetTrp: 0.0 ± 0.0
0.316MetTyr: 0.316 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
3.157AsnAla: 3.157 ± 0.706
0.631AsnCys: 0.631 ± 0.556
3.788AsnAsp: 3.788 ± 1.116
2.841AsnGlu: 2.841 ± 0.647
1.263AsnPhe: 1.263 ± 0.734
5.997AsnGly: 5.997 ± 1.246
1.263AsnHis: 1.263 ± 0.797
1.894AsnIle: 1.894 ± 0.772
5.051AsnLys: 5.051 ± 1.315
2.525AsnLeu: 2.525 ± 0.881
1.578AsnMet: 1.578 ± 0.704
0.316AsnAsn: 0.316 ± 0.25
1.894AsnPro: 1.894 ± 0.611
2.841AsnGln: 2.841 ± 0.612
1.578AsnArg: 1.578 ± 0.575
3.472AsnSer: 3.472 ± 1.266
4.104AsnThr: 4.104 ± 1.189
2.525AsnVal: 2.525 ± 0.762
0.947AsnTrp: 0.947 ± 0.418
3.157AsnTyr: 3.157 ± 1.147
0.0AsnXaa: 0.0 ± 0.0
Pro
1.894ProAla: 1.894 ± 0.666
0.0ProCys: 0.0 ± 0.0
3.157ProAsp: 3.157 ± 0.866
3.157ProGlu: 3.157 ± 0.947
1.578ProPhe: 1.578 ± 0.606
0.316ProGly: 0.316 ± 0.292
0.316ProHis: 0.316 ± 0.25
1.263ProIle: 1.263 ± 0.58
2.21ProLys: 2.21 ± 0.863
1.263ProLeu: 1.263 ± 0.805
0.316ProMet: 0.316 ± 0.325
0.947ProAsn: 0.947 ± 0.534
1.263ProPro: 1.263 ± 0.813
0.631ProGln: 0.631 ± 0.374
2.21ProArg: 2.21 ± 0.783
2.525ProSer: 2.525 ± 1.112
2.525ProThr: 2.525 ± 0.797
1.263ProVal: 1.263 ± 0.615
0.0ProTrp: 0.0 ± 0.0
2.525ProTyr: 2.525 ± 0.759
0.0ProXaa: 0.0 ± 0.0
Gln
3.788GlnAla: 3.788 ± 0.948
0.0GlnCys: 0.0 ± 0.0
1.578GlnAsp: 1.578 ± 0.632
4.104GlnGlu: 4.104 ± 1.332
0.631GlnPhe: 0.631 ± 0.583
2.841GlnGly: 2.841 ± 0.862
0.631GlnHis: 0.631 ± 0.322
2.841GlnIle: 2.841 ± 0.833
3.157GlnLys: 3.157 ± 0.743
5.997GlnLeu: 5.997 ± 1.267
1.263GlnMet: 1.263 ± 0.883
2.525GlnAsn: 2.525 ± 1.133
0.631GlnPro: 0.631 ± 0.423
2.525GlnGln: 2.525 ± 0.668
2.525GlnArg: 2.525 ± 1.095
1.263GlnSer: 1.263 ± 0.601
1.578GlnThr: 1.578 ± 0.615
4.104GlnVal: 4.104 ± 1.216
0.631GlnTrp: 0.631 ± 0.366
0.631GlnTyr: 0.631 ± 0.528
0.0GlnXaa: 0.0 ± 0.0
Arg
1.263ArgAla: 1.263 ± 0.495
0.316ArgCys: 0.316 ± 0.329
1.894ArgAsp: 1.894 ± 0.828
7.26ArgGlu: 7.26 ± 1.415
1.578ArgPhe: 1.578 ± 0.873
1.263ArgGly: 1.263 ± 0.501
0.947ArgHis: 0.947 ± 0.581
3.788ArgIle: 3.788 ± 1.267
5.682ArgLys: 5.682 ± 1.248
6.629ArgLeu: 6.629 ± 1.525
2.21ArgMet: 2.21 ± 0.809
1.263ArgAsn: 1.263 ± 0.517
1.578ArgPro: 1.578 ± 0.797
3.472ArgGln: 3.472 ± 0.879
2.525ArgArg: 2.525 ± 0.917
1.894ArgSer: 1.894 ± 0.585
3.472ArgThr: 3.472 ± 0.802
2.525ArgVal: 2.525 ± 0.91
0.316ArgTrp: 0.316 ± 0.383
2.525ArgTyr: 2.525 ± 1.061
0.0ArgXaa: 0.0 ± 0.0
Ser
2.21SerAla: 2.21 ± 0.872
0.0SerCys: 0.0 ± 0.0
3.157SerAsp: 3.157 ± 1.106
7.891SerGlu: 7.891 ± 1.853
1.263SerPhe: 1.263 ± 0.502
2.525SerGly: 2.525 ± 1.12
0.631SerHis: 0.631 ± 0.417
5.366SerIle: 5.366 ± 0.986
5.997SerLys: 5.997 ± 1.167
5.682SerLeu: 5.682 ± 1.318
0.631SerMet: 0.631 ± 0.368
3.157SerAsn: 3.157 ± 1.141
2.21SerPro: 2.21 ± 0.649
2.525SerGln: 2.525 ± 0.676
2.841SerArg: 2.841 ± 0.879
4.104SerSer: 4.104 ± 0.912
2.841SerThr: 2.841 ± 0.952
1.578SerVal: 1.578 ± 0.579
0.631SerTrp: 0.631 ± 0.347
2.525SerTyr: 2.525 ± 0.725
0.0SerXaa: 0.0 ± 0.0
Thr
3.472ThrAla: 3.472 ± 1.077
0.316ThrCys: 0.316 ± 0.336
1.578ThrAsp: 1.578 ± 0.754
3.472ThrGlu: 3.472 ± 0.96
2.841ThrPhe: 2.841 ± 0.873
4.104ThrGly: 4.104 ± 0.787
0.631ThrHis: 0.631 ± 0.366
6.313ThrIle: 6.313 ± 1.674
5.997ThrLys: 5.997 ± 1.411
5.997ThrLeu: 5.997 ± 1.271
0.631ThrMet: 0.631 ± 0.386
0.631ThrAsn: 0.631 ± 0.472
3.157ThrPro: 3.157 ± 0.933
3.157ThrGln: 3.157 ± 1.233
2.841ThrArg: 2.841 ± 1.044
3.157ThrSer: 3.157 ± 0.797
2.21ThrThr: 2.21 ± 0.763
6.629ThrVal: 6.629 ± 1.396
0.316ThrTrp: 0.316 ± 0.319
3.157ThrTyr: 3.157 ± 1.611
0.0ThrXaa: 0.0 ± 0.0
Val
2.841ValAla: 2.841 ± 1.135
0.316ValCys: 0.316 ± 0.25
3.472ValAsp: 3.472 ± 1.405
3.472ValGlu: 3.472 ± 1.297
2.525ValPhe: 2.525 ± 0.658
4.104ValGly: 4.104 ± 1.177
0.0ValHis: 0.0 ± 0.0
3.157ValIle: 3.157 ± 1.097
5.366ValLys: 5.366 ± 1.348
5.997ValLeu: 5.997 ± 1.69
0.631ValMet: 0.631 ± 0.438
2.841ValAsn: 2.841 ± 0.832
0.316ValPro: 0.316 ± 0.278
2.841ValGln: 2.841 ± 0.707
3.472ValArg: 3.472 ± 0.993
3.472ValSer: 3.472 ± 0.883
3.788ValThr: 3.788 ± 0.857
3.157ValVal: 3.157 ± 1.268
0.947ValTrp: 0.947 ± 0.507
1.263ValTyr: 1.263 ± 0.69
0.0ValXaa: 0.0 ± 0.0
Trp
0.631TrpAla: 0.631 ± 0.366
0.0TrpCys: 0.0 ± 0.0
0.316TrpAsp: 0.316 ± 0.32
1.263TrpGlu: 1.263 ± 0.718
0.0TrpPhe: 0.0 ± 0.0
0.316TrpGly: 0.316 ± 0.325
0.0TrpHis: 0.0 ± 0.0
0.631TrpIle: 0.631 ± 0.394
1.578TrpLys: 1.578 ± 0.643
1.578TrpLeu: 1.578 ± 0.743
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.631TrpArg: 0.631 ± 0.476
0.947TrpSer: 0.947 ± 0.446
0.316TrpThr: 0.316 ± 0.325
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.263TyrAla: 1.263 ± 0.666
0.316TyrCys: 0.316 ± 0.278
1.894TyrAsp: 1.894 ± 0.726
3.472TyrGlu: 3.472 ± 0.879
3.472TyrPhe: 3.472 ± 1.184
1.578TyrGly: 1.578 ± 0.889
1.263TyrHis: 1.263 ± 0.538
1.578TyrIle: 1.578 ± 0.795
5.051TyrLys: 5.051 ± 1.666
7.26TyrLeu: 7.26 ± 1.664
0.947TyrMet: 0.947 ± 0.597
2.525TyrAsn: 2.525 ± 0.599
1.263TyrPro: 1.263 ± 0.472
0.947TyrGln: 0.947 ± 0.519
4.104TyrArg: 4.104 ± 1.166
2.525TyrSer: 2.525 ± 0.766
3.157TyrThr: 3.157 ± 0.928
1.578TyrVal: 1.578 ± 0.746
0.0TyrTrp: 0.0 ± 0.0
0.947TyrTyr: 0.947 ± 0.477
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3169 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski