Amino acid dipepetide frequency for Salinibacter phage M8CR30-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.53AlaAla: 11.53 ± 2.11
0.401AlaCys: 0.401 ± 0.234
6.116AlaAsp: 6.116 ± 0.653
7.921AlaGlu: 7.921 ± 0.948
2.707AlaPhe: 2.707 ± 0.615
9.725AlaGly: 9.725 ± 1.219
1.303AlaHis: 1.303 ± 0.482
4.411AlaIle: 4.411 ± 0.83
3.108AlaLys: 3.108 ± 0.492
10.728AlaLeu: 10.728 ± 1.079
1.905AlaMet: 1.905 ± 0.384
3.108AlaAsn: 3.108 ± 0.59
3.309AlaPro: 3.309 ± 0.703
7.419AlaGln: 7.419 ± 1.146
5.715AlaArg: 5.715 ± 0.835
6.417AlaSer: 6.417 ± 0.785
6.016AlaThr: 6.016 ± 0.726
7.018AlaVal: 7.018 ± 1.123
1.805AlaTrp: 1.805 ± 0.367
3.008AlaTyr: 3.008 ± 0.445
0.0AlaXaa: 0.0 ± 0.0
Cys
0.201CysAla: 0.201 ± 0.14
0.0CysCys: 0.0 ± 0.0
0.401CysAsp: 0.401 ± 0.191
0.802CysGlu: 0.802 ± 0.293
0.201CysPhe: 0.201 ± 0.157
0.301CysGly: 0.301 ± 0.187
0.1CysHis: 0.1 ± 0.1
0.1CysIle: 0.1 ± 0.089
0.1CysLys: 0.1 ± 0.113
0.201CysLeu: 0.201 ± 0.151
0.0CysMet: 0.0 ± 0.0
0.201CysAsn: 0.201 ± 0.154
0.201CysPro: 0.201 ± 0.147
0.1CysGln: 0.1 ± 0.089
0.401CysArg: 0.401 ± 0.203
0.501CysSer: 0.501 ± 0.238
0.501CysThr: 0.501 ± 0.224
0.201CysVal: 0.201 ± 0.152
0.0CysTrp: 0.0 ± 0.0
0.501CysTyr: 0.501 ± 0.253
0.0CysXaa: 0.0 ± 0.0
Asp
7.921AspAla: 7.921 ± 0.868
0.201AspCys: 0.201 ± 0.152
5.214AspAsp: 5.214 ± 0.798
6.517AspGlu: 6.517 ± 0.705
2.306AspPhe: 2.306 ± 0.42
6.316AspGly: 6.316 ± 1.477
1.303AspHis: 1.303 ± 0.373
2.406AspIle: 2.406 ± 0.504
0.602AspLys: 0.602 ± 0.204
8.422AspLeu: 8.422 ± 0.722
0.301AspMet: 0.301 ± 0.13
1.704AspAsn: 1.704 ± 0.358
4.211AspPro: 4.211 ± 0.995
3.409AspGln: 3.409 ± 0.514
6.216AspArg: 6.216 ± 0.769
6.016AspSer: 6.016 ± 0.893
6.417AspThr: 6.417 ± 0.976
6.617AspVal: 6.617 ± 0.568
1.303AspTrp: 1.303 ± 0.36
0.802AspTyr: 0.802 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
9.725GluAla: 9.725 ± 0.81
0.401GluCys: 0.401 ± 0.239
6.316GluAsp: 6.316 ± 0.782
10.126GluGlu: 10.126 ± 1.56
1.303GluPhe: 1.303 ± 0.43
7.72GluGly: 7.72 ± 0.958
1.003GluHis: 1.003 ± 0.34
4.211GluIle: 4.211 ± 0.626
1.303GluLys: 1.303 ± 0.296
9.124GluLeu: 9.124 ± 0.705
1.504GluMet: 1.504 ± 0.474
2.206GluAsn: 2.206 ± 0.464
3.409GluPro: 3.409 ± 0.639
1.905GluGln: 1.905 ± 0.376
6.717GluArg: 6.717 ± 0.866
5.414GluSer: 5.414 ± 0.869
4.813GluThr: 4.813 ± 0.644
6.316GluVal: 6.316 ± 0.791
1.604GluTrp: 1.604 ± 0.409
2.005GluTyr: 2.005 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
2.406PheAla: 2.406 ± 0.623
0.301PheCys: 0.301 ± 0.187
2.306PheAsp: 2.306 ± 0.483
3.208PheGlu: 3.208 ± 0.433
0.702PhePhe: 0.702 ± 0.222
2.406PheGly: 2.406 ± 0.281
0.301PheHis: 0.301 ± 0.171
0.401PheIle: 0.401 ± 0.196
0.902PheLys: 0.902 ± 0.482
1.404PheLeu: 1.404 ± 0.505
0.401PheMet: 0.401 ± 0.204
0.602PheAsn: 0.602 ± 0.202
0.902PhePro: 0.902 ± 0.297
1.203PheGln: 1.203 ± 0.507
2.105PheArg: 2.105 ± 0.64
2.105PheSer: 2.105 ± 0.363
2.206PheThr: 2.206 ± 0.805
1.504PheVal: 1.504 ± 0.56
0.401PheTrp: 0.401 ± 0.175
0.602PheTyr: 0.602 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
9.525GlyAla: 9.525 ± 0.986
0.401GlyCys: 0.401 ± 0.219
8.823GlyAsp: 8.823 ± 1.379
8.121GlyGlu: 8.121 ± 0.882
1.704GlyPhe: 1.704 ± 0.362
7.82GlyGly: 7.82 ± 1.249
1.103GlyHis: 1.103 ± 0.254
2.908GlyIle: 2.908 ± 0.679
2.206GlyLys: 2.206 ± 0.691
7.219GlyLeu: 7.219 ± 0.828
1.203GlyMet: 1.203 ± 0.391
2.707GlyAsn: 2.707 ± 0.875
3.208GlyPro: 3.208 ± 0.568
2.206GlyGln: 2.206 ± 0.493
4.712GlyArg: 4.712 ± 0.696
7.62GlySer: 7.62 ± 1.668
5.915GlyThr: 5.915 ± 1.095
5.715GlyVal: 5.715 ± 0.883
0.702GlyTrp: 0.702 ± 0.273
1.103GlyTyr: 1.103 ± 0.247
0.0GlyXaa: 0.0 ± 0.0
His
1.404HisAla: 1.404 ± 0.448
0.201HisCys: 0.201 ± 0.13
0.902HisAsp: 0.902 ± 0.282
1.203HisGlu: 1.203 ± 0.367
0.301HisPhe: 0.301 ± 0.145
1.003HisGly: 1.003 ± 0.261
0.1HisHis: 0.1 ± 0.089
0.201HisIle: 0.201 ± 0.141
0.501HisLys: 0.501 ± 0.266
1.404HisLeu: 1.404 ± 0.376
0.201HisMet: 0.201 ± 0.122
0.401HisAsn: 0.401 ± 0.213
1.604HisPro: 1.604 ± 0.441
0.201HisGln: 0.201 ± 0.142
0.902HisArg: 0.902 ± 0.285
1.203HisSer: 1.203 ± 0.373
1.203HisThr: 1.203 ± 0.364
2.005HisVal: 2.005 ± 0.459
0.201HisTrp: 0.201 ± 0.126
0.602HisTyr: 0.602 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
3.008IleAla: 3.008 ± 0.577
0.1IleCys: 0.1 ± 0.122
4.512IleAsp: 4.512 ± 0.593
3.609IleGlu: 3.609 ± 0.463
0.401IlePhe: 0.401 ± 0.231
1.805IleGly: 1.805 ± 0.461
0.702IleHis: 0.702 ± 0.276
0.902IleIle: 0.902 ± 0.255
1.003IleLys: 1.003 ± 0.327
2.507IleLeu: 2.507 ± 0.526
0.401IleMet: 0.401 ± 0.188
0.802IleAsn: 0.802 ± 0.302
1.905IlePro: 1.905 ± 0.373
1.404IleGln: 1.404 ± 0.361
2.707IleArg: 2.707 ± 0.624
2.707IleSer: 2.707 ± 0.546
2.707IleThr: 2.707 ± 0.673
1.704IleVal: 1.704 ± 0.423
0.201IleTrp: 0.201 ± 0.15
1.003IleTyr: 1.003 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
3.409LysAla: 3.409 ± 0.72
0.0LysCys: 0.0 ± 0.0
1.905LysAsp: 1.905 ± 0.485
1.003LysGlu: 1.003 ± 0.417
0.802LysPhe: 0.802 ± 0.3
2.105LysGly: 2.105 ± 0.495
0.301LysHis: 0.301 ± 0.187
1.203LysIle: 1.203 ± 0.398
1.003LysLys: 1.003 ± 0.413
1.905LysLeu: 1.905 ± 0.412
1.103LysMet: 1.103 ± 0.388
0.902LysAsn: 0.902 ± 0.235
0.802LysPro: 0.802 ± 0.386
1.203LysGln: 1.203 ± 0.359
3.008LysArg: 3.008 ± 0.751
1.404LysSer: 1.404 ± 0.463
1.905LysThr: 1.905 ± 0.431
0.902LysVal: 0.902 ± 0.314
0.301LysTrp: 0.301 ± 0.178
1.404LysTyr: 1.404 ± 0.373
0.0LysXaa: 0.0 ± 0.0
Leu
9.224LeuAla: 9.224 ± 1.069
0.301LeuCys: 0.301 ± 0.135
7.419LeuAsp: 7.419 ± 1.353
7.82LeuGlu: 7.82 ± 0.738
2.306LeuPhe: 2.306 ± 0.422
7.52LeuGly: 7.52 ± 1.235
1.805LeuHis: 1.805 ± 0.392
1.805LeuIle: 1.805 ± 0.418
2.005LeuLys: 2.005 ± 0.484
6.316LeuLeu: 6.316 ± 0.936
1.103LeuMet: 1.103 ± 0.332
1.404LeuAsn: 1.404 ± 0.353
4.01LeuPro: 4.01 ± 0.491
4.512LeuGln: 4.512 ± 0.506
7.119LeuArg: 7.119 ± 0.837
7.82LeuSer: 7.82 ± 0.73
4.913LeuThr: 4.913 ± 0.677
5.214LeuVal: 5.214 ± 0.514
1.404LeuTrp: 1.404 ± 0.489
2.507LeuTyr: 2.507 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
1.303MetAla: 1.303 ± 0.335
0.0MetCys: 0.0 ± 0.0
1.404MetAsp: 1.404 ± 0.339
1.203MetGlu: 1.203 ± 0.454
0.401MetPhe: 0.401 ± 0.181
0.702MetGly: 0.702 ± 0.272
0.1MetHis: 0.1 ± 0.1
1.103MetIle: 1.103 ± 0.39
0.201MetLys: 0.201 ± 0.16
1.203MetLeu: 1.203 ± 0.405
0.501MetMet: 0.501 ± 0.385
0.501MetAsn: 0.501 ± 0.201
1.504MetPro: 1.504 ± 0.391
0.301MetGln: 0.301 ± 0.166
0.802MetArg: 0.802 ± 0.196
1.404MetSer: 1.404 ± 0.408
1.203MetThr: 1.203 ± 0.343
0.602MetVal: 0.602 ± 0.272
0.1MetTrp: 0.1 ± 0.081
0.1MetTyr: 0.1 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.309AsnAla: 3.309 ± 0.512
0.1AsnCys: 0.1 ± 0.098
1.303AsnAsp: 1.303 ± 0.397
1.805AsnGlu: 1.805 ± 0.437
1.504AsnPhe: 1.504 ± 0.548
2.908AsnGly: 2.908 ± 0.782
0.201AsnHis: 0.201 ± 0.112
0.602AsnIle: 0.602 ± 0.193
0.602AsnLys: 0.602 ± 0.299
2.005AsnLeu: 2.005 ± 0.31
0.301AsnMet: 0.301 ± 0.199
0.602AsnAsn: 0.602 ± 0.269
1.303AsnPro: 1.303 ± 0.328
1.003AsnGln: 1.003 ± 0.311
2.406AsnArg: 2.406 ± 0.696
2.406AsnSer: 2.406 ± 0.766
2.206AsnThr: 2.206 ± 1.008
1.303AsnVal: 1.303 ± 0.475
0.201AsnTrp: 0.201 ± 0.123
0.401AsnTyr: 0.401 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
3.208ProAla: 3.208 ± 0.589
0.301ProCys: 0.301 ± 0.172
5.314ProAsp: 5.314 ± 1.05
1.905ProGlu: 1.905 ± 0.522
1.203ProPhe: 1.203 ± 0.347
4.01ProGly: 4.01 ± 0.593
1.003ProHis: 1.003 ± 0.22
0.702ProIle: 0.702 ± 0.243
1.504ProLys: 1.504 ± 0.457
3.509ProLeu: 3.509 ± 0.813
0.602ProMet: 0.602 ± 0.242
1.604ProAsn: 1.604 ± 0.312
1.604ProPro: 1.604 ± 0.545
2.707ProGln: 2.707 ± 0.382
2.406ProArg: 2.406 ± 0.645
4.311ProSer: 4.311 ± 0.805
3.309ProThr: 3.309 ± 0.402
2.807ProVal: 2.807 ± 0.545
0.602ProTrp: 0.602 ± 0.273
1.303ProTyr: 1.303 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
4.913GlnAla: 4.913 ± 0.882
0.201GlnCys: 0.201 ± 0.207
2.807GlnAsp: 2.807 ± 0.355
4.411GlnGlu: 4.411 ± 0.6
0.802GlnPhe: 0.802 ± 0.247
4.111GlnGly: 4.111 ± 0.574
1.103GlnHis: 1.103 ± 0.293
2.807GlnIle: 2.807 ± 0.427
1.103GlnLys: 1.103 ± 0.326
4.01GlnLeu: 4.01 ± 0.563
0.802GlnMet: 0.802 ± 0.302
1.704GlnAsn: 1.704 ± 0.428
1.704GlnPro: 1.704 ± 0.497
1.404GlnGln: 1.404 ± 0.322
2.807GlnArg: 2.807 ± 0.347
3.409GlnSer: 3.409 ± 0.655
2.607GlnThr: 2.607 ± 0.422
2.306GlnVal: 2.306 ± 0.367
0.401GlnTrp: 0.401 ± 0.203
0.602GlnTyr: 0.602 ± 0.196
0.0GlnXaa: 0.0 ± 0.0
Arg
6.216ArgAla: 6.216 ± 1.141
0.902ArgCys: 0.902 ± 0.36
4.01ArgAsp: 4.01 ± 0.863
6.818ArgGlu: 6.818 ± 1.397
2.607ArgPhe: 2.607 ± 0.682
5.514ArgGly: 5.514 ± 0.675
0.902ArgHis: 0.902 ± 0.318
2.206ArgIle: 2.206 ± 0.409
2.206ArgLys: 2.206 ± 0.702
6.517ArgLeu: 6.517 ± 1.012
1.003ArgMet: 1.003 ± 0.523
1.704ArgAsn: 1.704 ± 0.496
2.206ArgPro: 2.206 ± 0.612
2.707ArgGln: 2.707 ± 0.369
4.411ArgArg: 4.411 ± 0.988
5.013ArgSer: 5.013 ± 0.609
3.509ArgThr: 3.509 ± 0.676
6.918ArgVal: 6.918 ± 0.804
0.702ArgTrp: 0.702 ± 0.229
2.406ArgTyr: 2.406 ± 0.526
0.0ArgXaa: 0.0 ± 0.0
Ser
8.923SerAla: 8.923 ± 1.096
0.501SerCys: 0.501 ± 0.209
5.414SerAsp: 5.414 ± 0.733
4.411SerGlu: 4.411 ± 0.552
2.306SerPhe: 2.306 ± 0.399
8.823SerGly: 8.823 ± 1.34
1.203SerHis: 1.203 ± 0.365
2.105SerIle: 2.105 ± 0.389
2.607SerLys: 2.607 ± 0.541
6.116SerLeu: 6.116 ± 0.793
0.902SerMet: 0.902 ± 0.249
1.905SerAsn: 1.905 ± 0.67
4.411SerPro: 4.411 ± 0.613
3.609SerGln: 3.609 ± 0.73
4.712SerArg: 4.712 ± 0.611
6.016SerSer: 6.016 ± 1.045
5.514SerThr: 5.514 ± 1.033
3.609SerVal: 3.609 ± 0.435
0.702SerTrp: 0.702 ± 0.311
2.206SerTyr: 2.206 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
5.314ThrAla: 5.314 ± 0.675
0.0ThrCys: 0.0 ± 0.0
3.91ThrAsp: 3.91 ± 0.508
5.615ThrGlu: 5.615 ± 0.801
2.206ThrPhe: 2.206 ± 0.504
5.615ThrGly: 5.615 ± 0.778
0.902ThrHis: 0.902 ± 0.265
3.609ThrIle: 3.609 ± 0.589
1.604ThrLys: 1.604 ± 0.339
4.813ThrLeu: 4.813 ± 1.074
1.103ThrMet: 1.103 ± 0.324
1.504ThrAsn: 1.504 ± 0.422
3.609ThrPro: 3.609 ± 0.659
3.108ThrGln: 3.108 ± 0.612
4.111ThrArg: 4.111 ± 0.764
5.414ThrSer: 5.414 ± 0.989
5.715ThrThr: 5.715 ± 1.345
7.319ThrVal: 7.319 ± 2.646
0.902ThrTrp: 0.902 ± 0.308
2.306ThrTyr: 2.306 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
8.622ValAla: 8.622 ± 0.757
0.401ValCys: 0.401 ± 0.207
6.016ValAsp: 6.016 ± 0.85
6.717ValGlu: 6.717 ± 0.751
1.704ValPhe: 1.704 ± 0.454
4.311ValGly: 4.311 ± 0.603
1.303ValHis: 1.303 ± 0.376
1.303ValIle: 1.303 ± 0.398
1.905ValLys: 1.905 ± 0.651
6.116ValLeu: 6.116 ± 0.474
0.902ValMet: 0.902 ± 0.269
1.905ValAsn: 1.905 ± 0.449
3.108ValPro: 3.108 ± 0.486
4.211ValGln: 4.211 ± 0.679
5.013ValArg: 5.013 ± 0.744
3.91ValSer: 3.91 ± 0.611
5.113ValThr: 5.113 ± 0.978
6.216ValVal: 6.216 ± 0.591
1.303ValTrp: 1.303 ± 0.265
1.103ValTyr: 1.103 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.279
0.201TrpCys: 0.201 ± 0.102
1.203TrpAsp: 1.203 ± 0.306
1.504TrpGlu: 1.504 ± 0.393
0.501TrpPhe: 0.501 ± 0.259
1.303TrpGly: 1.303 ± 0.254
0.201TrpHis: 0.201 ± 0.169
0.602TrpIle: 0.602 ± 0.217
1.103TrpLys: 1.103 ± 0.334
0.602TrpLeu: 0.602 ± 0.212
0.301TrpMet: 0.301 ± 0.175
0.602TrpAsn: 0.602 ± 0.19
0.501TrpPro: 0.501 ± 0.23
0.301TrpGln: 0.301 ± 0.328
0.802TrpArg: 0.802 ± 0.294
1.203TrpSer: 1.203 ± 0.319
0.902TrpThr: 0.902 ± 0.248
0.501TrpVal: 0.501 ± 0.232
0.201TrpTrp: 0.201 ± 0.102
0.1TrpTyr: 0.1 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.707TyrAla: 2.707 ± 0.504
0.1TyrCys: 0.1 ± 0.109
2.908TyrAsp: 2.908 ± 0.678
2.406TyrGlu: 2.406 ± 0.614
0.401TyrPhe: 0.401 ± 0.256
0.902TyrGly: 0.902 ± 0.214
0.802TyrHis: 0.802 ± 0.271
0.501TyrIle: 0.501 ± 0.233
1.003TyrLys: 1.003 ± 0.355
2.406TyrLeu: 2.406 ± 0.463
0.0TyrMet: 0.0 ± 0.0
0.401TyrAsn: 0.401 ± 0.207
0.401TyrPro: 0.401 ± 0.177
1.103TyrGln: 1.103 ± 0.277
1.504TyrArg: 1.504 ± 0.461
1.704TyrSer: 1.704 ± 0.373
2.005TyrThr: 2.005 ± 0.389
2.406TyrVal: 2.406 ± 0.455
0.401TyrTrp: 0.401 ± 0.207
0.902TyrTyr: 0.902 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (9975 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski