Amino acid dipepetide frequency for Streptococcus phage SW4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.114AlaAla: 5.114 ± 2.87
0.284AlaCys: 0.284 ± 0.148
4.072AlaAsp: 4.072 ± 0.657
4.641AlaGlu: 4.641 ± 0.67
2.652AlaPhe: 2.652 ± 0.827
4.356AlaGly: 4.356 ± 1.192
0.568AlaHis: 0.568 ± 0.218
6.724AlaIle: 6.724 ± 1.857
6.345AlaLys: 6.345 ± 0.733
6.345AlaLeu: 6.345 ± 1.006
2.746AlaMet: 2.746 ± 0.998
5.209AlaAsn: 5.209 ± 0.777
2.746AlaPro: 2.746 ± 0.604
2.936AlaGln: 2.936 ± 0.78
2.746AlaArg: 2.746 ± 0.494
4.262AlaSer: 4.262 ± 0.905
4.735AlaThr: 4.735 ± 1.24
4.83AlaVal: 4.83 ± 1.415
0.379AlaTrp: 0.379 ± 0.196
1.894AlaTyr: 1.894 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
0.379CysAla: 0.379 ± 0.212
0.284CysCys: 0.284 ± 0.205
0.568CysAsp: 0.568 ± 0.253
0.758CysGlu: 0.758 ± 0.309
0.095CysPhe: 0.095 ± 0.096
0.474CysGly: 0.474 ± 0.236
0.284CysHis: 0.284 ± 0.149
0.379CysIle: 0.379 ± 0.149
0.474CysLys: 0.474 ± 0.186
0.663CysLeu: 0.663 ± 0.266
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.095CysPro: 0.095 ± 0.103
0.095CysGln: 0.095 ± 0.107
0.284CysArg: 0.284 ± 0.17
0.758CysSer: 0.758 ± 0.292
0.095CysThr: 0.095 ± 0.099
0.284CysVal: 0.284 ± 0.139
0.095CysTrp: 0.095 ± 0.097
0.284CysTyr: 0.284 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
2.936AspAla: 2.936 ± 0.622
0.379AspCys: 0.379 ± 0.202
4.83AspAsp: 4.83 ± 1.057
3.978AspGlu: 3.978 ± 0.764
4.072AspPhe: 4.072 ± 0.739
3.694AspGly: 3.694 ± 0.51
0.758AspHis: 0.758 ± 0.287
4.451AspIle: 4.451 ± 0.711
4.735AspLys: 4.735 ± 0.686
3.883AspLeu: 3.883 ± 0.831
1.799AspMet: 1.799 ± 0.323
3.22AspAsn: 3.22 ± 0.746
1.136AspPro: 1.136 ± 0.316
1.515AspGln: 1.515 ± 0.394
2.557AspArg: 2.557 ± 0.545
3.22AspSer: 3.22 ± 0.84
3.125AspThr: 3.125 ± 0.486
2.841AspVal: 2.841 ± 0.65
0.758AspTrp: 0.758 ± 0.373
2.462AspTyr: 2.462 ± 0.507
0.0AspXaa: 0.0 ± 0.0
Glu
4.735GluAla: 4.735 ± 0.785
0.189GluCys: 0.189 ± 0.127
3.883GluAsp: 3.883 ± 0.736
6.914GluGlu: 6.914 ± 1.508
2.841GluPhe: 2.841 ± 0.594
3.031GluGly: 3.031 ± 0.382
1.326GluHis: 1.326 ± 0.346
4.262GluIle: 4.262 ± 0.83
5.304GluLys: 5.304 ± 1.092
7.103GluLeu: 7.103 ± 1.101
2.178GluMet: 2.178 ± 0.514
4.356GluAsn: 4.356 ± 0.83
1.326GluPro: 1.326 ± 0.4
3.22GluGln: 3.22 ± 0.639
3.125GluArg: 3.125 ± 0.492
3.409GluSer: 3.409 ± 0.503
4.072GluThr: 4.072 ± 0.742
5.777GluVal: 5.777 ± 0.899
1.136GluTrp: 1.136 ± 0.383
3.031GluTyr: 3.031 ± 0.719
0.0GluXaa: 0.0 ± 0.0
Phe
2.273PheAla: 2.273 ± 0.433
0.379PheCys: 0.379 ± 0.182
3.599PheAsp: 3.599 ± 0.672
3.788PheGlu: 3.788 ± 0.617
1.515PhePhe: 1.515 ± 0.355
3.599PheGly: 3.599 ± 0.647
0.568PheHis: 0.568 ± 0.217
3.125PheIle: 3.125 ± 0.428
3.978PheLys: 3.978 ± 0.591
1.894PheLeu: 1.894 ± 0.477
0.852PheMet: 0.852 ± 0.243
3.599PheAsn: 3.599 ± 0.536
0.852PhePro: 0.852 ± 0.276
1.136PheGln: 1.136 ± 0.28
1.515PheArg: 1.515 ± 0.402
3.788PheSer: 3.788 ± 0.787
2.273PheThr: 2.273 ± 0.417
2.273PheVal: 2.273 ± 0.449
0.284PheTrp: 0.284 ± 0.163
1.705PheTyr: 1.705 ± 0.545
0.0PheXaa: 0.0 ± 0.0
Gly
3.788GlyAla: 3.788 ± 1.167
0.474GlyCys: 0.474 ± 0.186
3.599GlyAsp: 3.599 ± 0.589
3.883GlyGlu: 3.883 ± 0.562
2.936GlyPhe: 2.936 ± 0.581
2.462GlyGly: 2.462 ± 0.394
1.042GlyHis: 1.042 ± 0.315
7.482GlyIle: 7.482 ± 1.892
5.304GlyLys: 5.304 ± 0.686
5.588GlyLeu: 5.588 ± 0.98
1.894GlyMet: 1.894 ± 0.742
3.031GlyAsn: 3.031 ± 0.542
0.379GlyPro: 0.379 ± 0.156
2.746GlyGln: 2.746 ± 0.505
1.799GlyArg: 1.799 ± 0.434
3.788GlySer: 3.788 ± 0.969
4.735GlyThr: 4.735 ± 1.363
2.746GlyVal: 2.746 ± 0.577
0.474GlyTrp: 0.474 ± 0.164
2.084GlyTyr: 2.084 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.474HisAla: 0.474 ± 0.22
0.095HisCys: 0.095 ± 0.092
0.947HisAsp: 0.947 ± 0.376
0.852HisGlu: 0.852 ± 0.273
0.663HisPhe: 0.663 ± 0.259
0.947HisGly: 0.947 ± 0.359
0.474HisHis: 0.474 ± 0.178
0.568HisIle: 0.568 ± 0.237
1.136HisLys: 1.136 ± 0.338
0.947HisLeu: 0.947 ± 0.245
0.284HisMet: 0.284 ± 0.168
0.568HisAsn: 0.568 ± 0.269
0.568HisPro: 0.568 ± 0.207
0.663HisGln: 0.663 ± 0.299
0.947HisArg: 0.947 ± 0.302
0.663HisSer: 0.663 ± 0.196
0.947HisThr: 0.947 ± 0.299
0.947HisVal: 0.947 ± 0.28
0.284HisTrp: 0.284 ± 0.148
0.474HisTyr: 0.474 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
6.251IleAla: 6.251 ± 1.424
0.474IleCys: 0.474 ± 0.236
4.262IleAsp: 4.262 ± 0.612
5.114IleGlu: 5.114 ± 0.865
1.61IlePhe: 1.61 ± 0.406
5.114IleGly: 5.114 ± 1.483
1.042IleHis: 1.042 ± 0.457
3.409IleIle: 3.409 ± 1.13
8.05IleLys: 8.05 ± 0.614
4.641IleLeu: 4.641 ± 0.79
1.515IleMet: 1.515 ± 0.425
6.061IleAsn: 6.061 ± 0.949
1.894IlePro: 1.894 ± 0.337
2.557IleGln: 2.557 ± 0.399
3.125IleArg: 3.125 ± 0.568
6.535IleSer: 6.535 ± 1.38
4.641IleThr: 4.641 ± 0.846
3.409IleVal: 3.409 ± 0.55
0.663IleTrp: 0.663 ± 0.263
2.368IleTyr: 2.368 ± 0.542
0.0IleXaa: 0.0 ± 0.0
Lys
6.345LysAla: 6.345 ± 0.648
0.758LysCys: 0.758 ± 0.274
4.262LysAsp: 4.262 ± 0.851
6.535LysGlu: 6.535 ± 1.128
2.652LysPhe: 2.652 ± 0.466
4.735LysGly: 4.735 ± 0.529
0.947LysHis: 0.947 ± 0.333
5.777LysIle: 5.777 ± 0.71
5.966LysLys: 5.966 ± 1.136
7.292LysLeu: 7.292 ± 1.03
2.462LysMet: 2.462 ± 0.401
3.694LysAsn: 3.694 ± 0.635
3.031LysPro: 3.031 ± 0.538
4.356LysGln: 4.356 ± 0.736
4.356LysArg: 4.356 ± 0.789
6.535LysSer: 6.535 ± 0.621
6.061LysThr: 6.061 ± 0.848
5.114LysVal: 5.114 ± 0.608
0.852LysTrp: 0.852 ± 0.228
4.072LysTyr: 4.072 ± 1.036
0.0LysXaa: 0.0 ± 0.0
Leu
6.629LeuAla: 6.629 ± 0.893
0.284LeuCys: 0.284 ± 0.173
4.735LeuAsp: 4.735 ± 0.703
6.156LeuGlu: 6.156 ± 1.096
3.031LeuPhe: 3.031 ± 0.571
5.588LeuGly: 5.588 ± 0.897
0.852LeuHis: 0.852 ± 0.277
3.599LeuIle: 3.599 ± 0.509
7.292LeuLys: 7.292 ± 0.934
3.883LeuLeu: 3.883 ± 0.68
1.894LeuMet: 1.894 ± 0.461
5.019LeuAsn: 5.019 ± 0.66
2.273LeuPro: 2.273 ± 0.529
3.125LeuGln: 3.125 ± 0.537
3.504LeuArg: 3.504 ± 0.606
6.251LeuSer: 6.251 ± 0.874
5.304LeuThr: 5.304 ± 0.711
4.641LeuVal: 4.641 ± 0.581
0.474LeuTrp: 0.474 ± 0.2
2.746LeuTyr: 2.746 ± 0.674
0.0LeuXaa: 0.0 ± 0.0
Met
2.273MetAla: 2.273 ± 1.009
0.095MetCys: 0.095 ± 0.085
0.852MetAsp: 0.852 ± 0.419
1.136MetGlu: 1.136 ± 0.306
1.326MetPhe: 1.326 ± 0.24
1.705MetGly: 1.705 ± 0.379
0.568MetHis: 0.568 ± 0.235
2.557MetIle: 2.557 ± 0.458
2.936MetLys: 2.936 ± 0.411
2.368MetLeu: 2.368 ± 0.582
1.421MetMet: 1.421 ± 0.434
1.421MetAsn: 1.421 ± 0.293
0.284MetPro: 0.284 ± 0.138
1.799MetGln: 1.799 ± 0.589
1.136MetArg: 1.136 ± 0.377
2.084MetSer: 2.084 ± 0.443
2.368MetThr: 2.368 ± 0.48
1.989MetVal: 1.989 ± 0.68
0.189MetTrp: 0.189 ± 0.122
0.852MetTyr: 0.852 ± 0.348
0.0MetXaa: 0.0 ± 0.0
Asn
3.978AsnAla: 3.978 ± 0.515
0.663AsnCys: 0.663 ± 0.201
3.599AsnAsp: 3.599 ± 0.735
3.315AsnGlu: 3.315 ± 0.735
3.22AsnPhe: 3.22 ± 0.64
3.978AsnGly: 3.978 ± 0.628
1.042AsnHis: 1.042 ± 0.332
4.356AsnIle: 4.356 ± 0.818
5.398AsnLys: 5.398 ± 1.051
3.504AsnLeu: 3.504 ± 0.494
1.421AsnMet: 1.421 ± 0.272
2.652AsnAsn: 2.652 ± 0.419
1.894AsnPro: 1.894 ± 0.4
3.694AsnGln: 3.694 ± 0.634
1.231AsnArg: 1.231 ± 0.32
3.599AsnSer: 3.599 ± 0.773
2.936AsnThr: 2.936 ± 0.499
2.746AsnVal: 2.746 ± 0.543
0.852AsnTrp: 0.852 ± 0.23
2.368AsnTyr: 2.368 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
1.136ProAla: 1.136 ± 0.355
0.0ProCys: 0.0 ± 0.0
0.947ProAsp: 0.947 ± 0.234
1.61ProGlu: 1.61 ± 0.42
1.515ProPhe: 1.515 ± 0.395
1.231ProGly: 1.231 ± 0.283
0.379ProHis: 0.379 ± 0.174
2.084ProIle: 2.084 ± 0.422
2.841ProLys: 2.841 ± 0.572
1.989ProLeu: 1.989 ± 0.535
0.189ProMet: 0.189 ± 0.13
1.231ProAsn: 1.231 ± 0.463
0.568ProPro: 0.568 ± 0.174
1.421ProGln: 1.421 ± 0.361
1.042ProArg: 1.042 ± 0.402
2.178ProSer: 2.178 ± 0.546
1.515ProThr: 1.515 ± 0.333
1.989ProVal: 1.989 ± 0.477
0.095ProTrp: 0.095 ± 0.099
1.799ProTyr: 1.799 ± 0.516
0.0ProXaa: 0.0 ± 0.0
Gln
5.588GlnAla: 5.588 ± 1.221
0.189GlnCys: 0.189 ± 0.144
1.799GlnAsp: 1.799 ± 0.303
2.178GlnGlu: 2.178 ± 0.653
1.989GlnPhe: 1.989 ± 0.374
2.746GlnGly: 2.746 ± 0.742
0.474GlnHis: 0.474 ± 0.219
3.694GlnIle: 3.694 ± 0.8
3.409GlnLys: 3.409 ± 0.563
4.167GlnLeu: 4.167 ± 0.793
2.368GlnMet: 2.368 ± 0.453
2.084GlnAsn: 2.084 ± 0.469
0.568GlnPro: 0.568 ± 0.211
3.315GlnGln: 3.315 ± 0.75
1.61GlnArg: 1.61 ± 0.334
3.599GlnSer: 3.599 ± 0.535
1.799GlnThr: 1.799 ± 0.352
2.368GlnVal: 2.368 ± 0.429
0.947GlnTrp: 0.947 ± 0.275
1.326GlnTyr: 1.326 ± 0.453
0.0GlnXaa: 0.0 ± 0.0
Arg
2.557ArgAla: 2.557 ± 0.425
0.095ArgCys: 0.095 ± 0.074
2.936ArgAsp: 2.936 ± 0.462
3.315ArgGlu: 3.315 ± 0.579
1.231ArgPhe: 1.231 ± 0.446
1.799ArgGly: 1.799 ± 0.404
0.568ArgHis: 0.568 ± 0.209
2.178ArgIle: 2.178 ± 0.462
3.504ArgLys: 3.504 ± 0.749
3.504ArgLeu: 3.504 ± 0.681
1.515ArgMet: 1.515 ± 0.454
2.178ArgAsn: 2.178 ± 0.533
1.042ArgPro: 1.042 ± 0.403
2.368ArgGln: 2.368 ± 0.572
1.894ArgArg: 1.894 ± 0.435
1.989ArgSer: 1.989 ± 0.341
2.368ArgThr: 2.368 ± 0.536
1.894ArgVal: 1.894 ± 0.487
0.379ArgTrp: 0.379 ± 0.17
2.178ArgTyr: 2.178 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
6.535SerAla: 6.535 ± 2.837
0.474SerCys: 0.474 ± 0.194
2.462SerAsp: 2.462 ± 0.623
4.262SerGlu: 4.262 ± 0.706
3.22SerPhe: 3.22 ± 0.503
4.641SerGly: 4.641 ± 0.934
0.568SerHis: 0.568 ± 0.177
5.493SerIle: 5.493 ± 0.876
5.682SerLys: 5.682 ± 0.756
6.724SerLeu: 6.724 ± 0.807
2.273SerMet: 2.273 ± 0.456
3.125SerAsn: 3.125 ± 0.636
1.894SerPro: 1.894 ± 0.376
3.788SerGln: 3.788 ± 0.841
2.652SerArg: 2.652 ± 0.586
4.546SerSer: 4.546 ± 0.816
3.409SerThr: 3.409 ± 0.746
5.019SerVal: 5.019 ± 0.725
0.758SerTrp: 0.758 ± 0.241
2.462SerTyr: 2.462 ± 0.611
0.0SerXaa: 0.0 ± 0.0
Thr
5.114ThrAla: 5.114 ± 1.306
0.379ThrCys: 0.379 ± 0.193
3.599ThrAsp: 3.599 ± 0.752
3.599ThrGlu: 3.599 ± 0.686
2.462ThrPhe: 2.462 ± 0.478
3.599ThrGly: 3.599 ± 0.764
0.474ThrHis: 0.474 ± 0.209
4.546ThrIle: 4.546 ± 0.632
4.735ThrLys: 4.735 ± 0.73
5.304ThrLeu: 5.304 ± 0.713
1.421ThrMet: 1.421 ± 0.606
3.599ThrAsn: 3.599 ± 0.609
2.462ThrPro: 2.462 ± 0.47
3.031ThrGln: 3.031 ± 0.821
1.989ThrArg: 1.989 ± 0.532
4.262ThrSer: 4.262 ± 1.188
4.546ThrThr: 4.546 ± 0.769
4.262ThrVal: 4.262 ± 0.551
0.284ThrTrp: 0.284 ± 0.144
2.841ThrTyr: 2.841 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
4.735ValAla: 4.735 ± 0.698
0.284ValCys: 0.284 ± 0.155
2.368ValAsp: 2.368 ± 0.507
5.114ValGlu: 5.114 ± 0.994
3.031ValPhe: 3.031 ± 0.51
3.599ValGly: 3.599 ± 0.91
0.758ValHis: 0.758 ± 0.397
4.546ValIle: 4.546 ± 0.612
4.735ValLys: 4.735 ± 0.65
2.652ValLeu: 2.652 ± 0.442
1.421ValMet: 1.421 ± 0.339
2.841ValAsn: 2.841 ± 0.504
1.421ValPro: 1.421 ± 0.261
2.178ValGln: 2.178 ± 0.567
1.989ValArg: 1.989 ± 0.338
5.019ValSer: 5.019 ± 0.655
5.398ValThr: 5.398 ± 0.626
4.072ValVal: 4.072 ± 0.565
0.758ValTrp: 0.758 ± 0.317
2.652ValTyr: 2.652 ± 0.567
0.0ValXaa: 0.0 ± 0.0
Trp
0.379TrpAla: 0.379 ± 0.182
0.095TrpCys: 0.095 ± 0.099
0.474TrpAsp: 0.474 ± 0.197
0.568TrpGlu: 0.568 ± 0.211
0.379TrpPhe: 0.379 ± 0.175
0.284TrpGly: 0.284 ± 0.181
0.189TrpHis: 0.189 ± 0.132
0.474TrpIle: 0.474 ± 0.187
0.663TrpLys: 0.663 ± 0.211
0.947TrpLeu: 0.947 ± 0.31
0.379TrpMet: 0.379 ± 0.194
0.852TrpAsn: 0.852 ± 0.246
0.189TrpPro: 0.189 ± 0.136
0.474TrpGln: 0.474 ± 0.164
0.568TrpArg: 0.568 ± 0.268
0.758TrpSer: 0.758 ± 0.29
0.758TrpThr: 0.758 ± 0.285
0.758TrpVal: 0.758 ± 0.314
0.095TrpTrp: 0.095 ± 0.082
0.852TrpTyr: 0.852 ± 0.274
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.368TyrAla: 2.368 ± 0.533
0.474TyrCys: 0.474 ± 0.264
2.652TyrAsp: 2.652 ± 0.649
3.504TyrGlu: 3.504 ± 0.747
2.557TyrPhe: 2.557 ± 0.643
2.652TyrGly: 2.652 ± 0.548
0.568TyrHis: 0.568 ± 0.199
3.031TyrIle: 3.031 ± 0.602
3.315TyrLys: 3.315 ± 0.792
3.883TyrLeu: 3.883 ± 0.926
1.231TyrMet: 1.231 ± 0.406
1.705TyrAsn: 1.705 ± 0.475
1.136TyrPro: 1.136 ± 0.359
1.799TyrGln: 1.799 ± 0.494
1.421TyrArg: 1.421 ± 0.476
2.746TyrSer: 2.746 ± 0.572
1.421TyrThr: 1.421 ± 0.426
1.61TyrVal: 1.61 ± 0.446
0.379TyrTrp: 0.379 ± 0.165
2.084TyrTyr: 2.084 ± 0.588
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (10560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski