Amino acid dipepetide frequency for Streptococcus satellite phage Javan435

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.719AlaAla: 0.719 ± 0.451
0.0AlaCys: 0.0 ± 0.0
2.516AlaAsp: 2.516 ± 0.855
7.908AlaGlu: 7.908 ± 1.767
3.595AlaPhe: 3.595 ± 1.066
2.157AlaGly: 2.157 ± 1.195
0.719AlaHis: 0.719 ± 0.478
4.673AlaIle: 4.673 ± 1.548
5.392AlaLys: 5.392 ± 1.067
6.47AlaLeu: 6.47 ± 1.974
1.797AlaMet: 1.797 ± 0.957
2.876AlaAsn: 2.876 ± 1.182
0.719AlaPro: 0.719 ± 0.503
1.797AlaGln: 1.797 ± 1.019
2.157AlaArg: 2.157 ± 0.711
3.595AlaSer: 3.595 ± 1.224
1.797AlaThr: 1.797 ± 0.791
3.595AlaVal: 3.595 ± 1.108
1.078AlaTrp: 1.078 ± 0.574
1.797AlaTyr: 1.797 ± 1.104
0.0AlaXaa: 0.0 ± 0.0
Cys
0.359CysAla: 0.359 ± 0.295
0.0CysCys: 0.0 ± 0.0
0.719CysAsp: 0.719 ± 0.5
0.359CysGlu: 0.359 ± 0.306
0.0CysPhe: 0.0 ± 0.0
0.719CysGly: 0.719 ± 0.477
0.0CysHis: 0.0 ± 0.0
0.359CysIle: 0.359 ± 0.35
0.359CysLys: 0.359 ± 0.35
0.359CysLeu: 0.359 ± 0.423
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.359CysPro: 0.359 ± 0.306
0.0CysGln: 0.0 ± 0.0
0.359CysArg: 0.359 ± 0.426
0.0CysSer: 0.0 ± 0.0
0.719CysThr: 0.719 ± 0.445
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.359CysTyr: 0.359 ± 0.328
0.0CysXaa: 0.0 ± 0.0
Asp
0.719AspAla: 0.719 ± 0.558
1.078AspCys: 1.078 ± 0.667
2.157AspAsp: 2.157 ± 1.229
5.392AspGlu: 5.392 ± 1.479
4.673AspPhe: 4.673 ± 1.215
3.235AspGly: 3.235 ± 1.005
0.0AspHis: 0.0 ± 0.0
6.47AspIle: 6.47 ± 1.958
7.189AspLys: 7.189 ± 1.679
7.549AspLeu: 7.549 ± 2.354
1.078AspMet: 1.078 ± 0.58
3.595AspAsn: 3.595 ± 1.194
0.0AspPro: 0.0 ± 0.0
0.719AspGln: 0.719 ± 0.469
2.516AspArg: 2.516 ± 0.841
2.876AspSer: 2.876 ± 0.868
2.876AspThr: 2.876 ± 1.16
2.157AspVal: 2.157 ± 0.584
0.0AspTrp: 0.0 ± 0.0
2.876AspTyr: 2.876 ± 0.803
0.0AspXaa: 0.0 ± 0.0
Glu
5.751GluAla: 5.751 ± 1.545
0.719GluCys: 0.719 ± 0.367
6.47GluAsp: 6.47 ± 1.77
6.83GluGlu: 6.83 ± 1.416
2.157GluPhe: 2.157 ± 0.994
3.595GluGly: 3.595 ± 1.266
1.078GluHis: 1.078 ± 0.583
8.986GluIle: 8.986 ± 1.604
6.47GluLys: 6.47 ± 1.613
12.94GluLeu: 12.94 ± 1.558
1.797GluMet: 1.797 ± 0.996
7.908GluAsn: 7.908 ± 1.689
2.876GluPro: 2.876 ± 1.039
4.313GluGln: 4.313 ± 0.952
7.549GluArg: 7.549 ± 1.439
6.47GluSer: 6.47 ± 1.243
2.876GluThr: 2.876 ± 0.696
7.549GluVal: 7.549 ± 2.027
1.438GluTrp: 1.438 ± 0.811
2.157GluTyr: 2.157 ± 0.912
0.0GluXaa: 0.0 ± 0.0
Phe
1.797PheAla: 1.797 ± 0.816
0.359PheCys: 0.359 ± 0.306
1.438PheAsp: 1.438 ± 0.649
5.032PheGlu: 5.032 ± 1.197
1.797PhePhe: 1.797 ± 0.684
3.235PheGly: 3.235 ± 1.3
0.719PheHis: 0.719 ± 0.598
2.157PheIle: 2.157 ± 0.839
3.954PheLys: 3.954 ± 1.029
2.516PheLeu: 2.516 ± 1.123
1.438PheMet: 1.438 ± 0.618
1.438PheAsn: 1.438 ± 1.002
0.719PhePro: 0.719 ± 0.671
1.078PheGln: 1.078 ± 0.567
1.438PheArg: 1.438 ± 0.719
3.235PheSer: 3.235 ± 1.162
1.797PheThr: 1.797 ± 0.847
1.078PheVal: 1.078 ± 0.638
0.359PheTrp: 0.359 ± 0.306
1.797PheTyr: 1.797 ± 0.744
0.0PheXaa: 0.0 ± 0.0
Gly
2.157GlyAla: 2.157 ± 1.128
0.719GlyCys: 0.719 ± 0.469
1.078GlyAsp: 1.078 ± 0.603
2.876GlyGlu: 2.876 ± 0.815
1.438GlyPhe: 1.438 ± 0.971
1.438GlyGly: 1.438 ± 0.697
0.719GlyHis: 0.719 ± 0.506
2.876GlyIle: 2.876 ± 0.809
5.032GlyLys: 5.032 ± 1.287
5.392GlyLeu: 5.392 ± 1.289
1.438GlyMet: 1.438 ± 0.714
1.797GlyAsn: 1.797 ± 0.861
0.0GlyPro: 0.0 ± 0.0
1.078GlyGln: 1.078 ± 0.64
1.438GlyArg: 1.438 ± 0.648
2.516GlySer: 2.516 ± 0.789
2.157GlyThr: 2.157 ± 1.178
4.313GlyVal: 4.313 ± 1.148
0.0GlyTrp: 0.0 ± 0.0
5.032GlyTyr: 5.032 ± 1.006
0.0GlyXaa: 0.0 ± 0.0
His
1.797HisAla: 1.797 ± 0.645
0.0HisCys: 0.0 ± 0.0
1.438HisAsp: 1.438 ± 0.585
1.078HisGlu: 1.078 ± 0.558
1.438HisPhe: 1.438 ± 0.679
0.719HisGly: 0.719 ± 0.529
0.0HisHis: 0.0 ± 0.0
1.078HisIle: 1.078 ± 0.581
1.078HisLys: 1.078 ± 0.515
1.797HisLeu: 1.797 ± 0.622
0.0HisMet: 0.0 ± 0.0
1.078HisAsn: 1.078 ± 0.461
0.719HisPro: 0.719 ± 0.524
0.0HisGln: 0.0 ± 0.0
1.078HisArg: 1.078 ± 0.704
0.719HisSer: 0.719 ± 0.493
0.359HisThr: 0.359 ± 0.426
0.359HisVal: 0.359 ± 0.426
0.0HisTrp: 0.0 ± 0.0
0.719HisTyr: 0.719 ± 0.472
0.0HisXaa: 0.0 ± 0.0
Ile
1.438IleAla: 1.438 ± 0.74
0.359IleCys: 0.359 ± 0.334
6.111IleAsp: 6.111 ± 2.239
3.954IleGlu: 3.954 ± 1.314
2.157IlePhe: 2.157 ± 0.826
3.235IleGly: 3.235 ± 1.022
1.797IleHis: 1.797 ± 0.62
1.797IleIle: 1.797 ± 0.573
7.549IleLys: 7.549 ± 1.409
6.83IleLeu: 6.83 ± 1.56
0.359IleMet: 0.359 ± 0.335
3.235IleAsn: 3.235 ± 0.888
2.876IlePro: 2.876 ± 0.802
2.876IleGln: 2.876 ± 1.045
2.157IleArg: 2.157 ± 0.768
6.83IleSer: 6.83 ± 1.258
4.313IleThr: 4.313 ± 1.212
1.078IleVal: 1.078 ± 0.555
0.359IleTrp: 0.359 ± 0.419
2.876IleTyr: 2.876 ± 0.97
0.0IleXaa: 0.0 ± 0.0
Lys
9.705LysAla: 9.705 ± 1.217
0.359LysCys: 0.359 ± 0.35
2.516LysAsp: 2.516 ± 1.199
13.659LysGlu: 13.659 ± 1.738
1.797LysPhe: 1.797 ± 0.856
4.673LysGly: 4.673 ± 1.24
3.235LysHis: 3.235 ± 1.104
5.751LysIle: 5.751 ± 1.34
6.47LysLys: 6.47 ± 1.619
6.111LysLeu: 6.111 ± 1.483
2.157LysMet: 2.157 ± 0.748
2.516LysAsn: 2.516 ± 0.995
2.876LysPro: 2.876 ± 1.004
4.673LysGln: 4.673 ± 1.265
5.392LysArg: 5.392 ± 1.303
4.313LysSer: 4.313 ± 1.813
6.83LysThr: 6.83 ± 1.385
5.392LysVal: 5.392 ± 1.425
0.719LysTrp: 0.719 ± 0.581
2.157LysTyr: 2.157 ± 0.73
0.0LysXaa: 0.0 ± 0.0
Leu
8.627LeuAla: 8.627 ± 1.353
0.359LeuCys: 0.359 ± 0.404
8.267LeuAsp: 8.267 ± 1.943
12.94LeuGlu: 12.94 ± 2.388
2.516LeuPhe: 2.516 ± 0.847
5.751LeuGly: 5.751 ± 1.504
0.719LeuHis: 0.719 ± 0.462
5.751LeuIle: 5.751 ± 1.368
10.784LeuLys: 10.784 ± 1.629
9.346LeuLeu: 9.346 ± 1.745
1.438LeuMet: 1.438 ± 0.624
8.986LeuAsn: 8.986 ± 1.701
2.157LeuPro: 2.157 ± 1.026
5.032LeuGln: 5.032 ± 1.482
4.673LeuArg: 4.673 ± 1.585
5.392LeuSer: 5.392 ± 1.337
8.267LeuThr: 8.267 ± 2.034
4.673LeuVal: 4.673 ± 1.008
0.359LeuTrp: 0.359 ± 0.322
3.954LeuTyr: 3.954 ± 1.836
0.0LeuXaa: 0.0 ± 0.0
Met
2.516MetAla: 2.516 ± 0.854
0.0MetCys: 0.0 ± 0.0
2.516MetAsp: 2.516 ± 0.841
2.157MetGlu: 2.157 ± 0.741
0.719MetPhe: 0.719 ± 0.539
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.719MetIle: 0.719 ± 0.503
1.797MetLys: 1.797 ± 0.819
2.516MetLeu: 2.516 ± 0.989
0.0MetMet: 0.0 ± 0.0
2.157MetAsn: 2.157 ± 0.903
0.0MetPro: 0.0 ± 0.0
1.438MetGln: 1.438 ± 0.66
1.438MetArg: 1.438 ± 0.575
1.078MetSer: 1.078 ± 0.56
3.954MetThr: 3.954 ± 1.397
1.438MetVal: 1.438 ± 0.612
0.0MetTrp: 0.0 ± 0.0
0.719MetTyr: 0.719 ± 0.497
0.0MetXaa: 0.0 ± 0.0
Asn
3.595AsnAla: 3.595 ± 1.031
0.0AsnCys: 0.0 ± 0.0
2.157AsnAsp: 2.157 ± 1.077
3.595AsnGlu: 3.595 ± 0.899
2.516AsnPhe: 2.516 ± 1.001
1.797AsnGly: 1.797 ± 0.678
1.078AsnHis: 1.078 ± 0.6
3.954AsnIle: 3.954 ± 1.425
4.673AsnLys: 4.673 ± 1.16
5.392AsnLeu: 5.392 ± 1.474
2.516AsnMet: 2.516 ± 1.388
2.516AsnAsn: 2.516 ± 1.009
1.797AsnPro: 1.797 ± 0.634
2.876AsnGln: 2.876 ± 0.96
2.876AsnArg: 2.876 ± 0.883
3.235AsnSer: 3.235 ± 1.27
3.954AsnThr: 3.954 ± 1.243
2.876AsnVal: 2.876 ± 0.948
0.0AsnTrp: 0.0 ± 0.0
1.078AsnTyr: 1.078 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
0.719ProAla: 0.719 ± 0.446
0.0ProCys: 0.0 ± 0.0
1.438ProAsp: 1.438 ± 0.759
1.797ProGlu: 1.797 ± 0.584
1.438ProPhe: 1.438 ± 0.74
0.359ProGly: 0.359 ± 0.331
0.0ProHis: 0.0 ± 0.0
1.438ProIle: 1.438 ± 0.569
2.876ProLys: 2.876 ± 1.002
2.157ProLeu: 2.157 ± 0.87
0.359ProMet: 0.359 ± 0.334
0.719ProAsn: 0.719 ± 0.469
0.359ProPro: 0.359 ± 0.419
1.438ProGln: 1.438 ± 0.915
1.078ProArg: 1.078 ± 0.586
2.876ProSer: 2.876 ± 0.933
0.719ProThr: 0.719 ± 0.426
1.078ProVal: 1.078 ± 0.539
0.0ProTrp: 0.0 ± 0.0
1.797ProTyr: 1.797 ± 0.832
0.0ProXaa: 0.0 ± 0.0
Gln
4.313GlnAla: 4.313 ± 1.318
0.0GlnCys: 0.0 ± 0.0
2.157GlnAsp: 2.157 ± 1.092
4.673GlnGlu: 4.673 ± 1.04
1.078GlnPhe: 1.078 ± 0.704
1.438GlnGly: 1.438 ± 0.548
0.719GlnHis: 0.719 ± 0.513
0.719GlnIle: 0.719 ± 0.473
5.392GlnLys: 5.392 ± 1.699
3.595GlnLeu: 3.595 ± 1.162
0.719GlnMet: 0.719 ± 0.593
1.438GlnAsn: 1.438 ± 1.252
0.719GlnPro: 0.719 ± 0.565
3.235GlnGln: 3.235 ± 1.328
3.235GlnArg: 3.235 ± 1.063
2.516GlnSer: 2.516 ± 0.857
2.157GlnThr: 2.157 ± 0.809
2.876GlnVal: 2.876 ± 0.742
0.359GlnTrp: 0.359 ± 0.404
2.157GlnTyr: 2.157 ± 0.871
0.0GlnXaa: 0.0 ± 0.0
Arg
1.797ArgAla: 1.797 ± 0.64
0.359ArgCys: 0.359 ± 0.282
3.595ArgAsp: 3.595 ± 0.949
9.346ArgGlu: 9.346 ± 1.752
1.438ArgPhe: 1.438 ± 0.814
1.078ArgGly: 1.078 ± 0.512
1.078ArgHis: 1.078 ± 0.557
1.797ArgIle: 1.797 ± 0.777
3.954ArgLys: 3.954 ± 0.828
7.189ArgLeu: 7.189 ± 1.767
1.797ArgMet: 1.797 ± 0.705
2.516ArgAsn: 2.516 ± 1.303
0.359ArgPro: 0.359 ± 0.331
3.235ArgGln: 3.235 ± 1.086
1.797ArgArg: 1.797 ± 0.896
1.797ArgSer: 1.797 ± 0.636
1.797ArgThr: 1.797 ± 0.625
4.313ArgVal: 4.313 ± 1.481
0.719ArgTrp: 0.719 ± 0.473
3.595ArgTyr: 3.595 ± 1.325
0.0ArgXaa: 0.0 ± 0.0
Ser
1.797SerAla: 1.797 ± 0.828
0.0SerCys: 0.0 ± 0.0
5.392SerAsp: 5.392 ± 1.099
5.032SerGlu: 5.032 ± 1.362
3.595SerPhe: 3.595 ± 1.223
1.797SerGly: 1.797 ± 0.72
1.438SerHis: 1.438 ± 0.669
4.673SerIle: 4.673 ± 0.995
4.673SerLys: 4.673 ± 1.46
4.673SerLeu: 4.673 ± 1.014
2.157SerMet: 2.157 ± 1.198
1.797SerAsn: 1.797 ± 0.763
2.516SerPro: 2.516 ± 1.388
1.797SerGln: 1.797 ± 0.821
3.235SerArg: 3.235 ± 1.551
2.516SerSer: 2.516 ± 1.292
3.954SerThr: 3.954 ± 1.075
3.595SerVal: 3.595 ± 1.256
0.719SerTrp: 0.719 ± 0.52
2.516SerTyr: 2.516 ± 1.007
0.0SerXaa: 0.0 ± 0.0
Thr
2.876ThrAla: 2.876 ± 1.201
0.359ThrCys: 0.359 ± 0.35
1.078ThrAsp: 1.078 ± 0.607
5.392ThrGlu: 5.392 ± 0.831
2.876ThrPhe: 2.876 ± 1.356
3.595ThrGly: 3.595 ± 1.421
1.078ThrHis: 1.078 ± 0.873
4.673ThrIle: 4.673 ± 1.286
2.516ThrLys: 2.516 ± 1.022
9.346ThrLeu: 9.346 ± 1.254
2.516ThrMet: 2.516 ± 0.872
1.797ThrAsn: 1.797 ± 0.638
2.157ThrPro: 2.157 ± 0.641
3.595ThrGln: 3.595 ± 1.323
2.876ThrArg: 2.876 ± 0.729
2.516ThrSer: 2.516 ± 0.865
3.954ThrThr: 3.954 ± 1.629
3.954ThrVal: 3.954 ± 1.001
0.359ThrTrp: 0.359 ± 0.295
2.516ThrTyr: 2.516 ± 0.933
0.0ThrXaa: 0.0 ± 0.0
Val
3.595ValAla: 3.595 ± 1.075
0.359ValCys: 0.359 ± 0.306
4.673ValAsp: 4.673 ± 1.165
4.313ValGlu: 4.313 ± 1.582
0.719ValPhe: 0.719 ± 0.548
2.876ValGly: 2.876 ± 1.04
0.719ValHis: 0.719 ± 0.541
1.797ValIle: 1.797 ± 0.626
3.595ValLys: 3.595 ± 1.38
7.908ValLeu: 7.908 ± 1.785
1.078ValMet: 1.078 ± 0.57
2.876ValAsn: 2.876 ± 0.876
0.719ValPro: 0.719 ± 0.506
1.438ValGln: 1.438 ± 0.559
3.954ValArg: 3.954 ± 1.05
3.235ValSer: 3.235 ± 1.296
5.032ValThr: 5.032 ± 0.972
4.313ValVal: 4.313 ± 1.338
0.719ValTrp: 0.719 ± 0.528
2.157ValTyr: 2.157 ± 1.004
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.359TrpAsp: 0.359 ± 0.387
1.078TrpGlu: 1.078 ± 0.617
0.0TrpPhe: 0.0 ± 0.0
0.359TrpGly: 0.359 ± 0.413
0.0TrpHis: 0.0 ± 0.0
0.359TrpIle: 0.359 ± 0.357
2.157TrpLys: 2.157 ± 0.71
1.797TrpLeu: 1.797 ± 0.715
0.0TrpMet: 0.0 ± 0.0
0.719TrpAsn: 0.719 ± 0.566
0.0TrpPro: 0.0 ± 0.0
0.359TrpGln: 0.359 ± 0.357
0.719TrpArg: 0.719 ± 0.419
0.359TrpSer: 0.359 ± 0.426
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.078TyrAla: 1.078 ± 0.499
0.0TyrCys: 0.0 ± 0.0
1.078TyrAsp: 1.078 ± 0.893
2.516TyrGlu: 2.516 ± 1.082
1.438TyrPhe: 1.438 ± 0.793
1.438TyrGly: 1.438 ± 0.629
0.359TyrHis: 0.359 ± 0.342
2.157TyrIle: 2.157 ± 0.891
5.392TyrLys: 5.392 ± 1.446
6.111TyrLeu: 6.111 ± 1.815
2.157TyrMet: 2.157 ± 0.821
2.516TyrAsn: 2.516 ± 0.74
0.719TyrPro: 0.719 ± 0.587
2.876TyrGln: 2.876 ± 1.237
3.595TyrArg: 3.595 ± 1.217
1.797TyrSer: 1.797 ± 1.128
2.516TyrThr: 2.516 ± 1.14
1.438TyrVal: 1.438 ± 0.537
1.078TyrTrp: 1.078 ± 0.626
2.876TyrTyr: 2.876 ± 0.977
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (2783 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski