Amino acid dipepetide frequency for Streptococcus satellite phage Javan740

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.239AlaAla: 0.239 ± 0.26
0.239AlaCys: 0.239 ± 0.252
4.304AlaAsp: 4.304 ± 1.346
6.217AlaGlu: 6.217 ± 1.532
2.391AlaPhe: 2.391 ± 0.683
1.674AlaGly: 1.674 ± 0.632
0.478AlaHis: 0.478 ± 0.337
4.543AlaIle: 4.543 ± 0.889
5.261AlaLys: 5.261 ± 1.235
5.978AlaLeu: 5.978 ± 1.047
1.913AlaMet: 1.913 ± 0.758
2.63AlaAsn: 2.63 ± 0.736
0.956AlaPro: 0.956 ± 0.462
2.63AlaGln: 2.63 ± 0.747
3.587AlaArg: 3.587 ± 0.969
2.152AlaSer: 2.152 ± 0.703
2.869AlaThr: 2.869 ± 0.916
4.304AlaVal: 4.304 ± 0.802
0.239AlaTrp: 0.239 ± 0.259
2.391AlaTyr: 2.391 ± 0.753
0.0AlaXaa: 0.0 ± 0.0
Cys
0.717CysAla: 0.717 ± 0.341
0.0CysCys: 0.0 ± 0.0
0.717CysAsp: 0.717 ± 0.43
0.0CysGlu: 0.0 ± 0.0
0.478CysPhe: 0.478 ± 0.291
0.239CysGly: 0.239 ± 0.245
0.478CysHis: 0.478 ± 0.267
0.956CysIle: 0.956 ± 0.459
0.239CysLys: 0.239 ± 0.261
0.717CysLeu: 0.717 ± 0.354
0.717CysMet: 0.717 ± 0.508
0.239CysAsn: 0.239 ± 0.186
0.239CysPro: 0.239 ± 0.209
0.239CysGln: 0.239 ± 0.269
0.239CysArg: 0.239 ± 0.261
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.239CysTyr: 0.239 ± 0.343
0.0CysXaa: 0.0 ± 0.0
Asp
0.956AspAla: 0.956 ± 0.409
0.478AspCys: 0.478 ± 0.29
4.304AspAsp: 4.304 ± 0.75
6.217AspGlu: 6.217 ± 1.28
5.022AspPhe: 5.022 ± 0.994
3.348AspGly: 3.348 ± 0.799
0.0AspHis: 0.0 ± 0.0
5.978AspIle: 5.978 ± 0.932
4.543AspLys: 4.543 ± 0.935
6.934AspLeu: 6.934 ± 1.417
2.63AspMet: 2.63 ± 0.625
2.869AspAsn: 2.869 ± 0.96
0.956AspPro: 0.956 ± 0.423
0.717AspGln: 0.717 ± 0.424
4.065AspArg: 4.065 ± 0.991
2.869AspSer: 2.869 ± 0.838
2.63AspThr: 2.63 ± 0.808
1.196AspVal: 1.196 ± 0.447
0.239AspTrp: 0.239 ± 0.2
3.109AspTyr: 3.109 ± 1.017
0.0AspXaa: 0.0 ± 0.0
Glu
5.022GluAla: 5.022 ± 0.989
0.956GluCys: 0.956 ± 0.401
3.587GluAsp: 3.587 ± 0.834
6.934GluGlu: 6.934 ± 2.127
3.587GluPhe: 3.587 ± 0.968
3.587GluGly: 3.587 ± 1.051
1.196GluHis: 1.196 ± 0.58
9.326GluIle: 9.326 ± 1.156
6.934GluLys: 6.934 ± 1.155
10.521GluLeu: 10.521 ± 1.641
2.63GluMet: 2.63 ± 0.71
5.261GluAsn: 5.261 ± 1.121
3.109GluPro: 3.109 ± 1.146
4.304GluGln: 4.304 ± 1.45
5.978GluArg: 5.978 ± 1.188
5.022GluSer: 5.022 ± 1.098
4.782GluThr: 4.782 ± 0.967
6.217GluVal: 6.217 ± 1.594
0.239GluTrp: 0.239 ± 0.186
3.109GluTyr: 3.109 ± 0.917
0.0GluXaa: 0.0 ± 0.0
Phe
0.717PheAla: 0.717 ± 0.339
0.0PheCys: 0.0 ± 0.0
3.587PheAsp: 3.587 ± 0.985
1.913PheGlu: 1.913 ± 0.801
0.717PhePhe: 0.717 ± 0.341
2.869PheGly: 2.869 ± 0.892
0.478PheHis: 0.478 ± 0.275
2.869PheIle: 2.869 ± 0.762
5.261PheLys: 5.261 ± 1.23
3.587PheLeu: 3.587 ± 1.037
0.717PheMet: 0.717 ± 0.381
2.152PheAsn: 2.152 ± 0.716
0.239PhePro: 0.239 ± 0.245
2.391PheGln: 2.391 ± 0.6
1.674PheArg: 1.674 ± 0.698
2.869PheSer: 2.869 ± 0.742
0.956PheThr: 0.956 ± 0.449
1.913PheVal: 1.913 ± 0.641
0.478PheTrp: 0.478 ± 0.375
2.63PheTyr: 2.63 ± 0.786
0.0PheXaa: 0.0 ± 0.0
Gly
2.63GlyAla: 2.63 ± 0.669
0.0GlyCys: 0.0 ± 0.0
2.391GlyAsp: 2.391 ± 0.719
1.913GlyGlu: 1.913 ± 0.529
2.152GlyPhe: 2.152 ± 0.717
1.674GlyGly: 1.674 ± 0.637
0.717GlyHis: 0.717 ± 0.366
3.587GlyIle: 3.587 ± 0.686
5.739GlyLys: 5.739 ± 1.054
4.543GlyLeu: 4.543 ± 1.112
1.435GlyMet: 1.435 ± 0.444
3.348GlyAsn: 3.348 ± 0.854
0.239GlyPro: 0.239 ± 0.209
2.391GlyGln: 2.391 ± 0.677
2.63GlyArg: 2.63 ± 0.708
1.196GlySer: 1.196 ± 0.496
2.152GlyThr: 2.152 ± 0.772
4.304GlyVal: 4.304 ± 0.911
0.717GlyTrp: 0.717 ± 0.557
2.869GlyTyr: 2.869 ± 1.024
0.0GlyXaa: 0.0 ± 0.0
His
1.913HisAla: 1.913 ± 0.788
0.0HisCys: 0.0 ± 0.0
0.478HisAsp: 0.478 ± 0.346
0.717HisGlu: 0.717 ± 0.404
0.717HisPhe: 0.717 ± 0.377
0.956HisGly: 0.956 ± 0.377
0.0HisHis: 0.0 ± 0.0
0.478HisIle: 0.478 ± 0.344
1.196HisLys: 1.196 ± 0.496
1.435HisLeu: 1.435 ± 0.578
0.239HisMet: 0.239 ± 0.265
0.717HisAsn: 0.717 ± 0.391
0.239HisPro: 0.239 ± 0.249
0.956HisGln: 0.956 ± 0.51
1.196HisArg: 1.196 ± 0.625
1.435HisSer: 1.435 ± 0.714
0.478HisThr: 0.478 ± 0.418
0.239HisVal: 0.239 ± 0.186
0.0HisTrp: 0.0 ± 0.0
0.717HisTyr: 0.717 ± 0.377
0.0HisXaa: 0.0 ± 0.0
Ile
3.587IleAla: 3.587 ± 0.877
0.717IleCys: 0.717 ± 0.461
5.5IleAsp: 5.5 ± 1.587
7.174IleGlu: 7.174 ± 1.491
2.869IlePhe: 2.869 ± 0.711
3.587IleGly: 3.587 ± 0.903
0.478IleHis: 0.478 ± 0.312
4.304IleIle: 4.304 ± 1.153
7.174IleLys: 7.174 ± 1.332
4.543IleLeu: 4.543 ± 1.207
0.717IleMet: 0.717 ± 0.408
2.152IleAsn: 2.152 ± 0.829
2.63IlePro: 2.63 ± 0.862
3.348IleGln: 3.348 ± 0.809
2.391IleArg: 2.391 ± 0.638
6.695IleSer: 6.695 ± 1.374
3.348IleThr: 3.348 ± 0.845
3.348IleVal: 3.348 ± 0.852
0.478IleTrp: 0.478 ± 0.276
2.152IleTyr: 2.152 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
9.087LysAla: 9.087 ± 1.868
0.239LysCys: 0.239 ± 0.2
3.348LysAsp: 3.348 ± 0.816
9.087LysGlu: 9.087 ± 1.225
1.913LysPhe: 1.913 ± 0.742
2.391LysGly: 2.391 ± 0.677
2.63LysHis: 2.63 ± 0.981
4.543LysIle: 4.543 ± 1.057
9.326LysLys: 9.326 ± 1.61
9.087LysLeu: 9.087 ± 1.582
1.196LysMet: 1.196 ± 0.565
6.934LysAsn: 6.934 ± 1.334
2.869LysPro: 2.869 ± 0.915
5.261LysGln: 5.261 ± 1.129
6.934LysArg: 6.934 ± 1.287
5.5LysSer: 5.5 ± 1.036
5.5LysThr: 5.5 ± 1.369
4.304LysVal: 4.304 ± 0.777
1.435LysTrp: 1.435 ± 0.494
2.63LysTyr: 2.63 ± 0.857
0.0LysXaa: 0.0 ± 0.0
Leu
7.413LeuAla: 7.413 ± 1.418
1.196LeuCys: 1.196 ± 0.483
9.565LeuAsp: 9.565 ± 1.751
14.347LeuGlu: 14.347 ± 2.079
3.826LeuPhe: 3.826 ± 0.871
3.826LeuGly: 3.826 ± 0.993
0.717LeuHis: 0.717 ± 0.48
4.543LeuIle: 4.543 ± 0.936
7.652LeuLys: 7.652 ± 1.676
9.565LeuLeu: 9.565 ± 1.252
2.63LeuMet: 2.63 ± 0.684
6.695LeuAsn: 6.695 ± 1.311
2.63LeuPro: 2.63 ± 0.926
3.587LeuGln: 3.587 ± 0.693
3.587LeuArg: 3.587 ± 0.97
4.782LeuSer: 4.782 ± 0.94
6.695LeuThr: 6.695 ± 1.312
4.782LeuVal: 4.782 ± 1.257
0.239LeuTrp: 0.239 ± 0.186
4.065LeuTyr: 4.065 ± 0.838
0.0LeuXaa: 0.0 ± 0.0
Met
2.391MetAla: 2.391 ± 0.776
0.0MetCys: 0.0 ± 0.0
1.913MetAsp: 1.913 ± 0.664
3.348MetGlu: 3.348 ± 1.037
0.717MetPhe: 0.717 ± 0.435
1.913MetGly: 1.913 ± 0.699
0.0MetHis: 0.0 ± 0.0
1.435MetIle: 1.435 ± 0.679
2.391MetLys: 2.391 ± 0.629
1.435MetLeu: 1.435 ± 0.398
0.478MetMet: 0.478 ± 0.394
1.674MetAsn: 1.674 ± 0.689
0.956MetPro: 0.956 ± 0.5
0.717MetGln: 0.717 ± 0.525
1.435MetArg: 1.435 ± 0.654
1.435MetSer: 1.435 ± 0.73
3.587MetThr: 3.587 ± 1.139
0.956MetVal: 0.956 ± 0.475
0.239MetTrp: 0.239 ± 0.186
0.239MetTyr: 0.239 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
3.587AsnAla: 3.587 ± 0.867
0.717AsnCys: 0.717 ± 0.514
2.869AsnAsp: 2.869 ± 1.031
2.63AsnGlu: 2.63 ± 0.722
1.435AsnPhe: 1.435 ± 0.497
4.065AsnGly: 4.065 ± 1.173
1.674AsnHis: 1.674 ± 0.676
3.109AsnIle: 3.109 ± 1.099
6.217AsnLys: 6.217 ± 1.193
4.065AsnLeu: 4.065 ± 0.782
3.348AsnMet: 3.348 ± 0.974
1.913AsnAsn: 1.913 ± 0.77
3.348AsnPro: 3.348 ± 0.661
1.674AsnGln: 1.674 ± 0.692
1.674AsnArg: 1.674 ± 0.537
1.674AsnSer: 1.674 ± 0.711
4.065AsnThr: 4.065 ± 1.122
2.63AsnVal: 2.63 ± 0.691
0.239AsnTrp: 0.239 ± 0.209
3.348AsnTyr: 3.348 ± 1.04
0.0AsnXaa: 0.0 ± 0.0
Pro
1.196ProAla: 1.196 ± 0.531
0.239ProCys: 0.239 ± 0.186
1.913ProAsp: 1.913 ± 0.638
2.152ProGlu: 2.152 ± 0.734
1.674ProPhe: 1.674 ± 0.718
0.478ProGly: 0.478 ± 0.332
0.478ProHis: 0.478 ± 0.356
1.674ProIle: 1.674 ± 0.499
2.63ProLys: 2.63 ± 0.762
2.152ProLeu: 2.152 ± 0.662
0.717ProMet: 0.717 ± 0.435
1.435ProAsn: 1.435 ± 0.592
1.435ProPro: 1.435 ± 0.603
1.674ProGln: 1.674 ± 0.638
3.587ProArg: 3.587 ± 1.01
0.956ProSer: 0.956 ± 0.446
0.956ProThr: 0.956 ± 0.454
1.674ProVal: 1.674 ± 0.682
0.0ProTrp: 0.0 ± 0.0
1.674ProTyr: 1.674 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
3.109GlnAla: 3.109 ± 0.876
0.0GlnCys: 0.0 ± 0.0
2.391GlnAsp: 2.391 ± 0.732
3.826GlnGlu: 3.826 ± 0.886
1.674GlnPhe: 1.674 ± 0.724
3.109GlnGly: 3.109 ± 0.903
0.717GlnHis: 0.717 ± 0.413
1.674GlnIle: 1.674 ± 0.592
2.869GlnLys: 2.869 ± 0.683
4.543GlnLeu: 4.543 ± 1.053
0.717GlnMet: 0.717 ± 0.54
2.63GlnAsn: 2.63 ± 1.171
0.478GlnPro: 0.478 ± 0.276
1.674GlnGln: 1.674 ± 0.695
2.869GlnArg: 2.869 ± 0.86
2.152GlnSer: 2.152 ± 0.588
2.869GlnThr: 2.869 ± 0.99
3.109GlnVal: 3.109 ± 0.821
1.196GlnTrp: 1.196 ± 0.375
0.478GlnTyr: 0.478 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
2.391ArgAla: 2.391 ± 0.727
0.478ArgCys: 0.478 ± 0.398
1.674ArgAsp: 1.674 ± 0.699
6.695ArgGlu: 6.695 ± 1.452
3.109ArgPhe: 3.109 ± 0.962
2.391ArgGly: 2.391 ± 0.9
1.196ArgHis: 1.196 ± 0.458
4.782ArgIle: 4.782 ± 0.919
4.543ArgLys: 4.543 ± 0.97
8.369ArgLeu: 8.369 ± 1.207
1.435ArgMet: 1.435 ± 0.547
2.391ArgAsn: 2.391 ± 0.643
0.478ArgPro: 0.478 ± 0.314
2.63ArgGln: 2.63 ± 0.845
1.196ArgArg: 1.196 ± 0.406
1.913ArgSer: 1.913 ± 0.636
3.109ArgThr: 3.109 ± 0.645
2.63ArgVal: 2.63 ± 0.776
0.239ArgTrp: 0.239 ± 0.186
2.869ArgTyr: 2.869 ± 0.878
0.0ArgXaa: 0.0 ± 0.0
Ser
1.674SerAla: 1.674 ± 0.559
0.239SerCys: 0.239 ± 0.261
2.869SerAsp: 2.869 ± 0.713
5.261SerGlu: 5.261 ± 1.331
1.196SerPhe: 1.196 ± 0.449
3.109SerGly: 3.109 ± 0.714
0.717SerHis: 0.717 ± 0.569
4.304SerIle: 4.304 ± 0.862
5.5SerLys: 5.5 ± 0.813
5.5SerLeu: 5.5 ± 1.066
1.196SerMet: 1.196 ± 0.521
4.065SerAsn: 4.065 ± 0.999
2.391SerPro: 2.391 ± 0.62
1.674SerGln: 1.674 ± 0.5
3.348SerArg: 3.348 ± 1.123
2.869SerSer: 2.869 ± 0.815
1.913SerThr: 1.913 ± 0.68
2.63SerVal: 2.63 ± 0.823
0.717SerTrp: 0.717 ± 0.489
2.63SerTyr: 2.63 ± 0.756
0.0SerXaa: 0.0 ± 0.0
Thr
2.391ThrAla: 2.391 ± 0.747
0.239ThrCys: 0.239 ± 0.257
1.913ThrAsp: 1.913 ± 0.586
3.587ThrGlu: 3.587 ± 0.796
1.913ThrPhe: 1.913 ± 0.543
3.109ThrGly: 3.109 ± 0.753
1.196ThrHis: 1.196 ± 0.417
2.869ThrIle: 2.869 ± 0.764
5.261ThrLys: 5.261 ± 1.577
6.695ThrLeu: 6.695 ± 1.33
1.196ThrMet: 1.196 ± 0.493
1.435ThrAsn: 1.435 ± 0.747
2.152ThrPro: 2.152 ± 0.794
2.391ThrGln: 2.391 ± 0.759
1.913ThrArg: 1.913 ± 0.507
2.63ThrSer: 2.63 ± 0.721
5.261ThrThr: 5.261 ± 1.408
5.261ThrVal: 5.261 ± 1.042
0.717ThrTrp: 0.717 ± 0.393
3.826ThrTyr: 3.826 ± 1.287
0.0ThrXaa: 0.0 ± 0.0
Val
3.826ValAla: 3.826 ± 1.011
0.239ValCys: 0.239 ± 0.186
3.109ValAsp: 3.109 ± 0.928
5.739ValGlu: 5.739 ± 1.206
1.196ValPhe: 1.196 ± 0.604
2.391ValGly: 2.391 ± 0.797
0.0ValHis: 0.0 ± 0.0
2.391ValIle: 2.391 ± 0.818
6.456ValLys: 6.456 ± 1.085
5.022ValLeu: 5.022 ± 1.185
1.913ValMet: 1.913 ± 0.669
4.065ValAsn: 4.065 ± 1.111
1.674ValPro: 1.674 ± 0.61
1.913ValGln: 1.913 ± 0.69
2.152ValArg: 2.152 ± 0.811
3.826ValSer: 3.826 ± 0.774
2.869ValThr: 2.869 ± 0.77
4.304ValVal: 4.304 ± 0.994
0.956ValTrp: 0.956 ± 0.477
2.63ValTyr: 2.63 ± 0.736
0.0ValXaa: 0.0 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.332
0.0TrpCys: 0.0 ± 0.0
0.239TrpAsp: 0.239 ± 0.313
1.674TrpGlu: 1.674 ± 0.529
0.239TrpPhe: 0.239 ± 0.186
0.717TrpGly: 0.717 ± 0.39
0.239TrpHis: 0.239 ± 0.186
0.478TrpIle: 0.478 ± 0.364
0.478TrpLys: 0.478 ± 0.288
0.478TrpLeu: 0.478 ± 0.267
0.0TrpMet: 0.0 ± 0.0
0.239TrpAsn: 0.239 ± 0.222
0.0TrpPro: 0.0 ± 0.0
0.717TrpGln: 0.717 ± 0.388
0.239TrpArg: 0.239 ± 0.186
0.717TrpSer: 0.717 ± 0.353
0.239TrpThr: 0.239 ± 0.289
0.956TrpVal: 0.956 ± 0.446
0.0TrpTrp: 0.0 ± 0.0
0.239TrpTyr: 0.239 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.435TyrAla: 1.435 ± 0.617
0.478TyrCys: 0.478 ± 0.296
2.391TyrAsp: 2.391 ± 0.67
2.869TyrGlu: 2.869 ± 0.757
1.196TyrPhe: 1.196 ± 0.544
1.196TyrGly: 1.196 ± 0.654
0.717TyrHis: 0.717 ± 0.382
3.348TyrIle: 3.348 ± 1.009
4.543TyrLys: 4.543 ± 1.383
7.174TyrLeu: 7.174 ± 1.366
1.196TyrMet: 1.196 ± 0.553
1.674TyrAsn: 1.674 ± 0.582
1.674TyrPro: 1.674 ± 0.682
1.196TyrGln: 1.196 ± 0.573
3.826TyrArg: 3.826 ± 0.735
3.109TyrSer: 3.109 ± 0.757
1.674TyrThr: 1.674 ± 0.612
1.913TyrVal: 1.913 ± 0.619
0.239TyrTrp: 0.239 ± 0.289
0.956TyrTyr: 0.956 ± 0.523
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31 proteins (4183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski