Amino acid dipepetide frequency for Marinomonas phage CPP1m

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.75AlaAla: 7.75 ± 0.889
1.076AlaCys: 1.076 ± 0.326
5.597AlaAsp: 5.597 ± 0.539
4.521AlaGlu: 4.521 ± 0.556
2.727AlaPhe: 2.727 ± 0.454
6.171AlaGly: 6.171 ± 0.481
0.861AlaHis: 0.861 ± 0.222
5.454AlaIle: 5.454 ± 0.715
5.31AlaLys: 5.31 ± 0.666
5.238AlaLeu: 5.238 ± 0.84
2.224AlaMet: 2.224 ± 0.353
3.086AlaAsn: 3.086 ± 0.302
2.224AlaPro: 2.224 ± 0.34
2.799AlaGln: 2.799 ± 0.645
4.234AlaArg: 4.234 ± 0.559
6.171AlaSer: 6.171 ± 0.736
3.947AlaThr: 3.947 ± 0.565
5.023AlaVal: 5.023 ± 0.389
0.718AlaTrp: 0.718 ± 0.209
2.727AlaTyr: 2.727 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.223
0.0CysCys: 0.0 ± 0.0
0.789CysAsp: 0.789 ± 0.289
0.574CysGlu: 0.574 ± 0.207
0.646CysPhe: 0.646 ± 0.21
0.215CysGly: 0.215 ± 0.113
0.287CysHis: 0.287 ± 0.16
0.574CysIle: 0.574 ± 0.226
1.148CysLys: 1.148 ± 0.333
1.22CysLeu: 1.22 ± 0.31
0.359CysMet: 0.359 ± 0.147
0.646CysAsn: 0.646 ± 0.279
0.431CysPro: 0.431 ± 0.142
0.144CysGln: 0.144 ± 0.087
0.431CysArg: 0.431 ± 0.22
0.431CysSer: 0.431 ± 0.174
0.431CysThr: 0.431 ± 0.199
0.646CysVal: 0.646 ± 0.207
0.0CysTrp: 0.0 ± 0.0
0.287CysTyr: 0.287 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
5.741AspAla: 5.741 ± 0.733
0.431AspCys: 0.431 ± 0.198
3.66AspAsp: 3.66 ± 0.488
3.875AspGlu: 3.875 ± 0.709
2.583AspPhe: 2.583 ± 0.429
4.879AspGly: 4.879 ± 0.672
0.789AspHis: 0.789 ± 0.266
4.808AspIle: 4.808 ± 0.658
4.305AspLys: 4.305 ± 0.846
5.31AspLeu: 5.31 ± 0.679
1.866AspMet: 1.866 ± 0.435
2.727AspAsn: 2.727 ± 0.476
2.153AspPro: 2.153 ± 0.418
1.866AspGln: 1.866 ± 0.287
2.799AspArg: 2.799 ± 0.554
4.879AspSer: 4.879 ± 0.596
3.731AspThr: 3.731 ± 0.511
3.516AspVal: 3.516 ± 0.635
1.076AspTrp: 1.076 ± 0.302
3.301AspTyr: 3.301 ± 0.408
0.0AspXaa: 0.0 ± 0.0
Glu
6.602GluAla: 6.602 ± 0.74
0.502GluCys: 0.502 ± 0.238
5.382GluAsp: 5.382 ± 0.519
8.97GluGlu: 8.97 ± 1.321
3.373GluPhe: 3.373 ± 0.449
6.099GluGly: 6.099 ± 0.608
1.22GluHis: 1.22 ± 0.292
2.799GluIle: 2.799 ± 0.474
4.449GluLys: 4.449 ± 0.494
7.319GluLeu: 7.319 ± 0.682
2.153GluMet: 2.153 ± 0.391
2.87GluAsn: 2.87 ± 0.388
2.081GluPro: 2.081 ± 0.537
2.296GluGln: 2.296 ± 0.599
3.66GluArg: 3.66 ± 0.647
5.454GluSer: 5.454 ± 0.752
3.229GluThr: 3.229 ± 0.512
6.745GluVal: 6.745 ± 0.615
1.65GluTrp: 1.65 ± 0.285
3.229GluTyr: 3.229 ± 0.621
0.0GluXaa: 0.0 ± 0.0
Phe
2.224PheAla: 2.224 ± 0.286
0.287PheCys: 0.287 ± 0.2
3.444PheAsp: 3.444 ± 0.399
2.296PheGlu: 2.296 ± 0.419
1.148PhePhe: 1.148 ± 0.331
1.435PheGly: 1.435 ± 0.351
0.431PheHis: 0.431 ± 0.141
1.722PheIle: 1.722 ± 0.384
3.086PheLys: 3.086 ± 0.372
2.583PheLeu: 2.583 ± 0.459
1.292PheMet: 1.292 ± 0.245
2.511PheAsn: 2.511 ± 0.403
1.579PhePro: 1.579 ± 0.397
0.718PheGln: 0.718 ± 0.165
2.153PheArg: 2.153 ± 0.321
3.157PheSer: 3.157 ± 0.549
2.153PheThr: 2.153 ± 0.471
2.009PheVal: 2.009 ± 0.428
0.718PheTrp: 0.718 ± 0.262
0.933PheTyr: 0.933 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
4.808GlyAla: 4.808 ± 0.505
0.718GlyCys: 0.718 ± 0.265
4.377GlyAsp: 4.377 ± 0.647
5.454GlyGlu: 5.454 ± 0.646
2.44GlyPhe: 2.44 ± 0.5
4.951GlyGly: 4.951 ± 0.843
1.363GlyHis: 1.363 ± 0.379
3.803GlyIle: 3.803 ± 0.456
5.956GlyLys: 5.956 ± 0.85
5.525GlyLeu: 5.525 ± 0.739
1.866GlyMet: 1.866 ± 0.402
2.942GlyAsn: 2.942 ± 0.337
0.144GlyPro: 0.144 ± 0.091
2.583GlyGln: 2.583 ± 0.551
3.444GlyArg: 3.444 ± 0.599
5.023GlySer: 5.023 ± 0.678
5.095GlyThr: 5.095 ± 0.775
4.305GlyVal: 4.305 ± 0.756
0.861GlyTrp: 0.861 ± 0.225
2.799GlyTyr: 2.799 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
0.718HisAla: 0.718 ± 0.255
0.431HisCys: 0.431 ± 0.148
1.579HisAsp: 1.579 ± 0.273
1.005HisGlu: 1.005 ± 0.291
0.646HisPhe: 0.646 ± 0.202
1.076HisGly: 1.076 ± 0.407
0.359HisHis: 0.359 ± 0.161
0.718HisIle: 0.718 ± 0.286
1.579HisLys: 1.579 ± 0.501
2.224HisLeu: 2.224 ± 0.485
0.502HisMet: 0.502 ± 0.218
0.431HisAsn: 0.431 ± 0.181
0.646HisPro: 0.646 ± 0.206
0.646HisGln: 0.646 ± 0.218
0.933HisArg: 0.933 ± 0.255
1.292HisSer: 1.292 ± 0.344
0.933HisThr: 0.933 ± 0.327
0.789HisVal: 0.789 ± 0.239
0.359HisTrp: 0.359 ± 0.148
0.789HisTyr: 0.789 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
4.808IleAla: 4.808 ± 0.53
0.431IleCys: 0.431 ± 0.159
3.947IleAsp: 3.947 ± 0.541
4.951IleGlu: 4.951 ± 0.546
1.292IlePhe: 1.292 ± 0.199
3.947IleGly: 3.947 ± 0.472
1.22IleHis: 1.22 ± 0.332
2.583IleIle: 2.583 ± 0.546
5.023IleLys: 5.023 ± 0.653
3.731IleLeu: 3.731 ± 0.55
1.148IleMet: 1.148 ± 0.252
2.942IleAsn: 2.942 ± 0.503
1.937IlePro: 1.937 ± 0.329
1.937IleGln: 1.937 ± 0.319
2.727IleArg: 2.727 ± 0.421
3.803IleSer: 3.803 ± 0.555
3.731IleThr: 3.731 ± 0.441
3.086IleVal: 3.086 ± 0.419
0.574IleTrp: 0.574 ± 0.212
2.153IleTyr: 2.153 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
6.96LysAla: 6.96 ± 0.875
0.861LysCys: 0.861 ± 0.31
5.741LysAsp: 5.741 ± 0.535
8.037LysGlu: 8.037 ± 1.082
1.435LysPhe: 1.435 ± 0.317
3.875LysGly: 3.875 ± 0.834
1.435LysHis: 1.435 ± 0.324
2.081LysIle: 2.081 ± 0.341
3.731LysLys: 3.731 ± 0.686
5.382LysLeu: 5.382 ± 0.683
2.224LysMet: 2.224 ± 0.358
2.296LysAsn: 2.296 ± 0.438
2.368LysPro: 2.368 ± 0.519
3.157LysGln: 3.157 ± 0.489
4.377LysArg: 4.377 ± 0.606
3.947LysSer: 3.947 ± 0.555
2.942LysThr: 2.942 ± 0.359
4.879LysVal: 4.879 ± 0.632
0.718LysTrp: 0.718 ± 0.197
2.655LysTyr: 2.655 ± 0.42
0.0LysXaa: 0.0 ± 0.0
Leu
6.099LeuAla: 6.099 ± 0.696
0.933LeuCys: 0.933 ± 0.295
5.095LeuAsp: 5.095 ± 0.567
6.243LeuGlu: 6.243 ± 0.758
2.87LeuPhe: 2.87 ± 0.478
5.597LeuGly: 5.597 ± 0.611
1.65LeuHis: 1.65 ± 0.468
4.521LeuIle: 4.521 ± 0.422
5.095LeuLys: 5.095 ± 0.67
5.382LeuLeu: 5.382 ± 0.722
2.44LeuMet: 2.44 ± 0.451
4.305LeuAsn: 4.305 ± 0.558
3.516LeuPro: 3.516 ± 0.42
3.157LeuGln: 3.157 ± 0.497
4.018LeuArg: 4.018 ± 0.561
7.319LeuSer: 7.319 ± 0.735
5.454LeuThr: 5.454 ± 0.601
4.664LeuVal: 4.664 ± 0.651
0.502LeuTrp: 0.502 ± 0.165
3.086LeuTyr: 3.086 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.655MetAla: 2.655 ± 0.424
0.215MetCys: 0.215 ± 0.114
1.292MetAsp: 1.292 ± 0.425
1.866MetGlu: 1.866 ± 0.464
1.363MetPhe: 1.363 ± 0.35
1.866MetGly: 1.866 ± 0.424
0.574MetHis: 0.574 ± 0.208
1.363MetIle: 1.363 ± 0.269
2.009MetLys: 2.009 ± 0.388
3.014MetLeu: 3.014 ± 0.458
0.144MetMet: 0.144 ± 0.083
1.076MetAsn: 1.076 ± 0.267
0.718MetPro: 0.718 ± 0.215
1.076MetGln: 1.076 ± 0.25
1.005MetArg: 1.005 ± 0.31
2.727MetSer: 2.727 ± 0.425
1.866MetThr: 1.866 ± 0.348
1.22MetVal: 1.22 ± 0.25
0.144MetTrp: 0.144 ± 0.097
1.507MetTyr: 1.507 ± 0.284
0.0MetXaa: 0.0 ± 0.0
Asn
2.87AsnAla: 2.87 ± 0.377
0.287AsnCys: 0.287 ± 0.112
1.507AsnAsp: 1.507 ± 0.371
3.301AsnGlu: 3.301 ± 0.526
1.722AsnPhe: 1.722 ± 0.308
3.875AsnGly: 3.875 ± 0.445
0.502AsnHis: 0.502 ± 0.199
3.014AsnIle: 3.014 ± 0.399
3.803AsnLys: 3.803 ± 0.502
4.449AsnLeu: 4.449 ± 0.577
1.292AsnMet: 1.292 ± 0.273
2.87AsnAsn: 2.87 ± 0.453
2.296AsnPro: 2.296 ± 0.373
1.866AsnGln: 1.866 ± 0.448
2.44AsnArg: 2.44 ± 0.376
2.942AsnSer: 2.942 ± 0.385
3.157AsnThr: 3.157 ± 0.476
3.086AsnVal: 3.086 ± 0.659
0.359AsnTrp: 0.359 ± 0.121
2.44AsnTyr: 2.44 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
2.081ProAla: 2.081 ± 0.406
0.287ProCys: 0.287 ± 0.111
2.511ProAsp: 2.511 ± 0.471
3.229ProGlu: 3.229 ± 0.466
1.22ProPhe: 1.22 ± 0.326
0.144ProGly: 0.144 ± 0.087
0.502ProHis: 0.502 ± 0.143
1.65ProIle: 1.65 ± 0.378
1.794ProLys: 1.794 ± 0.398
2.44ProLeu: 2.44 ± 0.33
1.076ProMet: 1.076 ± 0.235
2.153ProAsn: 2.153 ± 0.323
0.718ProPro: 0.718 ± 0.23
0.861ProGln: 0.861 ± 0.238
0.789ProArg: 0.789 ± 0.204
4.018ProSer: 4.018 ± 0.446
2.081ProThr: 2.081 ± 0.34
2.368ProVal: 2.368 ± 0.379
0.502ProTrp: 0.502 ± 0.124
1.722ProTyr: 1.722 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
2.87GlnAla: 2.87 ± 0.539
0.359GlnCys: 0.359 ± 0.17
1.722GlnAsp: 1.722 ± 0.287
3.803GlnGlu: 3.803 ± 0.554
1.435GlnPhe: 1.435 ± 0.332
3.014GlnGly: 3.014 ± 0.396
0.933GlnHis: 0.933 ± 0.247
2.368GlnIle: 2.368 ± 0.312
1.937GlnLys: 1.937 ± 0.449
2.727GlnLeu: 2.727 ± 0.671
1.076GlnMet: 1.076 ± 0.258
1.076GlnAsn: 1.076 ± 0.282
0.861GlnPro: 0.861 ± 0.209
1.148GlnGln: 1.148 ± 0.352
1.65GlnArg: 1.65 ± 0.281
2.009GlnSer: 2.009 ± 0.328
1.507GlnThr: 1.507 ± 0.32
2.655GlnVal: 2.655 ± 0.397
0.574GlnTrp: 0.574 ± 0.128
0.789GlnTyr: 0.789 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
3.731ArgAla: 3.731 ± 0.593
0.646ArgCys: 0.646 ± 0.274
3.014ArgAsp: 3.014 ± 0.62
4.377ArgGlu: 4.377 ± 0.548
2.368ArgPhe: 2.368 ± 0.399
2.87ArgGly: 2.87 ± 0.519
1.076ArgHis: 1.076 ± 0.262
3.373ArgIle: 3.373 ± 0.409
3.875ArgLys: 3.875 ± 0.62
3.731ArgLeu: 3.731 ± 0.442
1.866ArgMet: 1.866 ± 0.461
2.224ArgAsn: 2.224 ± 0.373
1.076ArgPro: 1.076 ± 0.248
1.076ArgGln: 1.076 ± 0.276
2.153ArgArg: 2.153 ± 0.416
3.157ArgSer: 3.157 ± 0.547
2.009ArgThr: 2.009 ± 0.395
3.444ArgVal: 3.444 ± 0.584
0.789ArgTrp: 0.789 ± 0.215
1.507ArgTyr: 1.507 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
4.377SerAla: 4.377 ± 0.607
0.502SerCys: 0.502 ± 0.186
4.736SerAsp: 4.736 ± 0.624
5.166SerGlu: 5.166 ± 0.589
2.44SerPhe: 2.44 ± 0.44
5.669SerGly: 5.669 ± 0.817
1.22SerHis: 1.22 ± 0.257
4.879SerIle: 4.879 ± 0.709
4.879SerLys: 4.879 ± 0.694
6.602SerLeu: 6.602 ± 0.667
2.153SerMet: 2.153 ± 0.494
4.449SerAsn: 4.449 ± 0.703
2.511SerPro: 2.511 ± 0.36
2.583SerGln: 2.583 ± 0.454
3.731SerArg: 3.731 ± 0.521
5.884SerSer: 5.884 ± 0.941
5.023SerThr: 5.023 ± 0.777
5.166SerVal: 5.166 ± 0.69
0.646SerTrp: 0.646 ± 0.166
2.942SerTyr: 2.942 ± 0.533
0.0SerXaa: 0.0 ± 0.0
Thr
4.234ThrAla: 4.234 ± 0.715
0.789ThrCys: 0.789 ± 0.229
3.229ThrAsp: 3.229 ± 0.503
4.162ThrGlu: 4.162 ± 0.62
2.44ThrPhe: 2.44 ± 0.342
4.377ThrGly: 4.377 ± 0.602
1.22ThrHis: 1.22 ± 0.284
4.162ThrIle: 4.162 ± 0.53
2.87ThrLys: 2.87 ± 0.493
6.171ThrLeu: 6.171 ± 0.837
1.148ThrMet: 1.148 ± 0.248
2.224ThrAsn: 2.224 ± 0.454
3.086ThrPro: 3.086 ± 0.395
2.727ThrGln: 2.727 ± 0.441
2.799ThrArg: 2.799 ± 0.462
3.516ThrSer: 3.516 ± 0.571
3.086ThrThr: 3.086 ± 0.624
3.444ThrVal: 3.444 ± 0.498
0.215ThrTrp: 0.215 ± 0.164
1.866ThrTyr: 1.866 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
5.31ValAla: 5.31 ± 0.615
0.359ValCys: 0.359 ± 0.187
4.162ValAsp: 4.162 ± 0.522
4.449ValGlu: 4.449 ± 0.597
1.794ValPhe: 1.794 ± 0.382
4.521ValGly: 4.521 ± 0.62
0.789ValHis: 0.789 ± 0.224
3.014ValIle: 3.014 ± 0.567
4.521ValLys: 4.521 ± 0.651
4.808ValLeu: 4.808 ± 0.678
1.507ValMet: 1.507 ± 0.313
4.592ValAsn: 4.592 ± 0.483
2.153ValPro: 2.153 ± 0.321
2.655ValGln: 2.655 ± 0.364
3.014ValArg: 3.014 ± 0.54
5.31ValSer: 5.31 ± 0.671
4.592ValThr: 4.592 ± 0.729
4.234ValVal: 4.234 ± 0.679
0.789ValTrp: 0.789 ± 0.217
2.081ValTyr: 2.081 ± 0.378
0.0ValXaa: 0.0 ± 0.0
Trp
1.005TrpAla: 1.005 ± 0.267
0.287TrpCys: 0.287 ± 0.13
0.359TrpAsp: 0.359 ± 0.147
1.292TrpGlu: 1.292 ± 0.233
0.646TrpPhe: 0.646 ± 0.202
0.861TrpGly: 0.861 ± 0.269
0.215TrpHis: 0.215 ± 0.127
0.502TrpIle: 0.502 ± 0.183
0.933TrpLys: 0.933 ± 0.337
0.718TrpLeu: 0.718 ± 0.287
0.287TrpMet: 0.287 ± 0.132
0.574TrpAsn: 0.574 ± 0.145
0.359TrpPro: 0.359 ± 0.171
0.215TrpGln: 0.215 ± 0.109
0.646TrpArg: 0.646 ± 0.228
0.861TrpSer: 0.861 ± 0.223
0.431TrpThr: 0.431 ± 0.189
1.076TrpVal: 1.076 ± 0.219
0.072TrpTrp: 0.072 ± 0.067
0.431TrpTyr: 0.431 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.583TyrAla: 2.583 ± 0.36
0.287TyrCys: 0.287 ± 0.142
2.153TyrAsp: 2.153 ± 0.38
2.511TyrGlu: 2.511 ± 0.427
1.292TyrPhe: 1.292 ± 0.343
3.014TyrGly: 3.014 ± 0.503
1.005TyrHis: 1.005 ± 0.257
2.583TyrIle: 2.583 ± 0.436
2.799TyrLys: 2.799 ± 0.551
3.373TyrLeu: 3.373 ± 0.286
1.005TyrMet: 1.005 ± 0.211
2.081TyrAsn: 2.081 ± 0.367
1.292TyrPro: 1.292 ± 0.406
1.005TyrGln: 1.005 ± 0.188
1.435TyrArg: 1.435 ± 0.259
3.66TyrSer: 3.66 ± 0.541
2.368TyrThr: 2.368 ± 0.469
2.296TyrVal: 2.296 ± 0.357
0.502TyrTrp: 0.502 ± 0.206
1.292TyrTyr: 1.292 ± 0.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski