Amino acid dipepetide frequency for Esparto virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.741AlaAla: 2.741 ± 0.292
0.808AlaCys: 0.808 ± 0.136
2.08AlaAsp: 2.08 ± 0.238
3.426AlaGlu: 3.426 ± 1.43
1.542AlaPhe: 1.542 ± 0.228
0.93AlaGly: 0.93 ± 0.19
0.881AlaHis: 0.881 ± 0.205
3.842AlaIle: 3.842 ± 0.258
2.717AlaLys: 2.717 ± 0.313
3.157AlaLeu: 3.157 ± 0.321
1.419AlaMet: 1.419 ± 0.166
3.769AlaAsn: 3.769 ± 0.31
1.126AlaPro: 1.126 ± 0.186
1.493AlaGln: 1.493 ± 0.293
1.37AlaArg: 1.37 ± 0.245
2.619AlaSer: 2.619 ± 0.244
3.035AlaThr: 3.035 ± 0.257
1.493AlaVal: 1.493 ± 0.195
0.318AlaTrp: 0.318 ± 0.082
2.007AlaTyr: 2.007 ± 0.248
0.0AlaXaa: 0.0 ± 0.0
Cys
0.734CysAla: 0.734 ± 0.143
0.318CysCys: 0.318 ± 0.111
1.101CysAsp: 1.101 ± 0.191
0.783CysGlu: 0.783 ± 0.165
0.734CysPhe: 0.734 ± 0.151
0.587CysGly: 0.587 ± 0.113
0.269CysHis: 0.269 ± 0.073
1.395CysIle: 1.395 ± 0.187
0.881CysLys: 0.881 ± 0.192
1.273CysLeu: 1.273 ± 0.206
0.563CysMet: 0.563 ± 0.127
1.566CysAsn: 1.566 ± 0.218
0.538CysPro: 0.538 ± 0.134
0.587CysGln: 0.587 ± 0.114
1.028CysArg: 1.028 ± 0.153
1.101CysSer: 1.101 ± 0.157
1.199CysThr: 1.199 ± 0.208
0.954CysVal: 0.954 ± 0.147
0.049CysTrp: 0.049 ± 0.037
0.514CysTyr: 0.514 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
2.619AspAla: 2.619 ± 0.244
1.077AspCys: 1.077 ± 0.143
12.432AspAsp: 12.432 ± 1.582
4.332AspGlu: 4.332 ± 0.472
2.3AspPhe: 2.3 ± 0.233
2.545AspGly: 2.545 ± 0.257
1.175AspHis: 1.175 ± 0.175
6.534AspIle: 6.534 ± 0.451
2.57AspLys: 2.57 ± 0.265
4.772AspLeu: 4.772 ± 0.389
1.175AspMet: 1.175 ± 0.168
5.604AspAsn: 5.604 ± 0.586
1.762AspPro: 1.762 ± 0.272
1.738AspGln: 1.738 ± 0.222
2.398AspArg: 2.398 ± 0.338
4.258AspSer: 4.258 ± 0.351
4.699AspThr: 4.699 ± 0.405
2.937AspVal: 2.937 ± 0.268
0.196AspTrp: 0.196 ± 0.069
2.717AspTyr: 2.717 ± 0.248
0.0AspXaa: 0.0 ± 0.0
Glu
1.811GluAla: 1.811 ± 0.482
0.734GluCys: 0.734 ± 0.153
2.007GluAsp: 2.007 ± 0.291
2.447GluGlu: 2.447 ± 0.367
2.374GluPhe: 2.374 ± 0.26
0.881GluGly: 0.881 ± 0.129
1.395GluHis: 1.395 ± 0.174
3.867GluIle: 3.867 ± 0.561
3.989GluLys: 3.989 ± 0.658
4.356GluLeu: 4.356 ± 0.559
1.346GluMet: 1.346 ± 0.176
4.968GluAsn: 4.968 ± 0.452
1.835GluPro: 1.835 ± 0.188
3.402GluGln: 3.402 ± 0.397
2.619GluArg: 2.619 ± 1.029
3.598GluSer: 3.598 ± 0.288
3.084GluThr: 3.084 ± 0.28
1.322GluVal: 1.322 ± 0.194
0.245GluTrp: 0.245 ± 0.07
2.937GluTyr: 2.937 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
1.37PheAla: 1.37 ± 0.195
0.636PheCys: 0.636 ± 0.119
2.912PheAsp: 2.912 ± 0.252
2.129PheGlu: 2.129 ± 0.28
0.636PhePhe: 0.636 ± 0.122
1.86PheGly: 1.86 ± 0.251
0.685PheHis: 0.685 ± 0.116
3.304PheIle: 3.304 ± 0.332
2.692PheLys: 2.692 ± 0.348
2.325PheLeu: 2.325 ± 0.249
1.297PheMet: 1.297 ± 0.189
3.94PheAsn: 3.94 ± 0.361
1.175PhePro: 1.175 ± 0.171
1.101PheGln: 1.101 ± 0.192
1.052PheArg: 1.052 ± 0.188
2.545PheSer: 2.545 ± 0.263
2.105PheThr: 2.105 ± 0.22
2.3PheVal: 2.3 ± 0.203
0.245PheTrp: 0.245 ± 0.076
1.517PheTyr: 1.517 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
1.175GlyAla: 1.175 ± 0.195
0.587GlyCys: 0.587 ± 0.129
1.738GlyAsp: 1.738 ± 0.224
1.273GlyGlu: 1.273 ± 0.197
1.322GlyPhe: 1.322 ± 0.149
1.419GlyGly: 1.419 ± 0.197
0.563GlyHis: 0.563 ± 0.111
2.986GlyIle: 2.986 ± 0.305
2.3GlyLys: 2.3 ± 0.252
2.178GlyLeu: 2.178 ± 0.219
0.636GlyMet: 0.636 ± 0.111
2.398GlyAsn: 2.398 ± 0.231
0.734GlyPro: 0.734 ± 0.149
0.734GlyGln: 0.734 ± 0.146
1.126GlyArg: 1.126 ± 0.164
1.835GlySer: 1.835 ± 0.257
1.419GlyThr: 1.419 ± 0.19
1.689GlyVal: 1.689 ± 0.248
0.245GlyTrp: 0.245 ± 0.086
1.493GlyTyr: 1.493 ± 0.213
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.165
0.318HisCys: 0.318 ± 0.091
1.517HisAsp: 1.517 ± 0.21
1.224HisGlu: 1.224 ± 0.179
0.906HisPhe: 0.906 ± 0.145
1.077HisGly: 1.077 ± 0.167
1.419HisHis: 1.419 ± 0.53
1.811HisIle: 1.811 ± 0.236
1.86HisLys: 1.86 ± 0.38
2.154HisLeu: 2.154 ± 0.249
0.734HisMet: 0.734 ± 0.143
1.884HisAsn: 1.884 ± 0.285
0.587HisPro: 0.587 ± 0.11
0.954HisGln: 0.954 ± 0.213
1.419HisArg: 1.419 ± 0.309
1.003HisSer: 1.003 ± 0.149
1.297HisThr: 1.297 ± 0.176
1.003HisVal: 1.003 ± 0.148
0.098HisTrp: 0.098 ± 0.052
0.93HisTyr: 0.93 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
4.307IleAla: 4.307 ± 0.249
1.419IleCys: 1.419 ± 0.233
7.122IleAsp: 7.122 ± 0.431
5.262IleGlu: 5.262 ± 0.537
2.717IlePhe: 2.717 ± 0.277
2.472IleGly: 2.472 ± 0.284
2.496IleHis: 2.496 ± 0.231
7.734IleIle: 7.734 ± 0.473
5.139IleLys: 5.139 ± 0.355
7.929IleLeu: 7.929 ± 0.552
2.3IleMet: 2.3 ± 0.217
8.761IleAsn: 8.761 ± 0.525
4.772IlePro: 4.772 ± 0.431
3.133IleGln: 3.133 ± 0.348
3.182IleArg: 3.182 ± 0.322
6.804IleSer: 6.804 ± 0.454
7.122IleThr: 7.122 ± 0.501
4.405IleVal: 4.405 ± 0.344
0.489IleTrp: 0.489 ± 0.107
3.94IleTyr: 3.94 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
2.79LysAla: 2.79 ± 0.982
1.517LysCys: 1.517 ± 0.184
1.958LysAsp: 1.958 ± 0.236
2.839LysGlu: 2.839 ± 1.122
2.765LysPhe: 2.765 ± 0.256
0.906LysGly: 0.906 ± 0.139
1.958LysHis: 1.958 ± 0.36
6.167LysIle: 6.167 ± 0.348
4.821LysLys: 4.821 ± 0.606
6.094LysLeu: 6.094 ± 0.44
1.444LysMet: 1.444 ± 0.206
6.436LysAsn: 6.436 ± 0.527
2.888LysPro: 2.888 ± 0.371
2.398LysGln: 2.398 ± 0.3
2.814LysArg: 2.814 ± 0.256
6.559LysSer: 6.559 ± 0.525
5.115LysThr: 5.115 ± 0.352
2.741LysVal: 2.741 ± 0.292
0.294LysTrp: 0.294 ± 0.095
4.454LysTyr: 4.454 ± 0.395
0.0LysXaa: 0.0 ± 0.0
Leu
3.475LeuAla: 3.475 ± 0.289
1.224LeuCys: 1.224 ± 0.179
4.43LeuAsp: 4.43 ± 0.367
3.695LeuGlu: 3.695 ± 0.478
3.206LeuPhe: 3.206 ± 0.313
2.031LeuGly: 2.031 ± 0.221
1.517LeuHis: 1.517 ± 0.244
5.678LeuIle: 5.678 ± 0.468
5.776LeuLys: 5.776 ± 0.386
7.317LeuLeu: 7.317 ± 0.4
2.031LeuMet: 2.031 ± 0.236
6.852LeuAsn: 6.852 ± 0.408
4.821LeuPro: 4.821 ± 0.521
3.206LeuGln: 3.206 ± 0.304
3.402LeuArg: 3.402 ± 0.363
6.485LeuSer: 6.485 ± 0.507
5.262LeuThr: 5.262 ± 0.374
4.111LeuVal: 4.111 ± 0.256
0.294LeuTrp: 0.294 ± 0.068
3.72LeuTyr: 3.72 ± 0.319
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 0.224
0.343MetCys: 0.343 ± 0.087
1.517MetAsp: 1.517 ± 0.199
1.101MetGlu: 1.101 ± 0.155
0.857MetPhe: 0.857 ± 0.164
0.587MetGly: 0.587 ± 0.154
0.808MetHis: 0.808 ± 0.148
1.909MetIle: 1.909 ± 0.247
1.615MetLys: 1.615 ± 0.187
2.423MetLeu: 2.423 ± 0.244
0.734MetMet: 0.734 ± 0.137
2.203MetAsn: 2.203 ± 0.2
1.077MetPro: 1.077 ± 0.187
0.734MetGln: 0.734 ± 0.132
1.003MetArg: 1.003 ± 0.174
2.643MetSer: 2.643 ± 0.256
1.517MetThr: 1.517 ± 0.193
1.15MetVal: 1.15 ± 0.17
0.171MetTrp: 0.171 ± 0.059
1.395MetTyr: 1.395 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.989AsnAla: 3.989 ± 0.296
1.346AsnCys: 1.346 ± 0.204
8.199AsnAsp: 8.199 ± 0.536
4.797AsnGlu: 4.797 ± 0.504
2.961AsnPhe: 2.961 ± 0.273
3.304AsnGly: 3.304 ± 0.254
2.056AsnHis: 2.056 ± 0.356
10.866AsnIle: 10.866 ± 0.547
5.213AsnLys: 5.213 ± 0.511
5.482AsnLeu: 5.482 ± 0.378
2.276AsnMet: 2.276 ± 0.243
14.635AsnAsn: 14.635 ± 1.443
2.496AsnPro: 2.496 ± 0.257
3.671AsnGln: 3.671 ± 0.329
2.912AsnArg: 2.912 ± 0.272
7.073AsnSer: 7.073 ± 0.607
8.419AsnThr: 8.419 ± 0.735
5.384AsnVal: 5.384 ± 0.456
0.343AsnTrp: 0.343 ± 0.092
3.157AsnTyr: 3.157 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
1.077ProAla: 1.077 ± 0.179
0.489ProCys: 0.489 ± 0.122
1.738ProAsp: 1.738 ± 0.21
1.762ProGlu: 1.762 ± 0.209
1.273ProPhe: 1.273 ± 0.167
0.881ProGly: 0.881 ± 0.148
0.612ProHis: 0.612 ± 0.107
4.821ProIle: 4.821 ± 0.455
3.206ProLys: 3.206 ± 0.267
3.598ProLeu: 3.598 ± 0.383
1.199ProMet: 1.199 ± 0.205
3.157ProAsn: 3.157 ± 0.284
3.133ProPro: 3.133 ± 0.697
2.325ProGln: 2.325 ± 0.292
1.126ProArg: 1.126 ± 0.169
3.769ProSer: 3.769 ± 0.571
3.695ProThr: 3.695 ± 0.491
1.664ProVal: 1.664 ± 0.177
0.147ProTrp: 0.147 ± 0.066
1.468ProTyr: 1.468 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
1.346GlnAla: 1.346 ± 0.221
0.587GlnCys: 0.587 ± 0.137
1.322GlnAsp: 1.322 ± 0.232
1.37GlnGlu: 1.37 ± 0.285
1.762GlnPhe: 1.762 ± 0.178
0.392GlnGly: 0.392 ± 0.102
0.979GlnHis: 0.979 ± 0.169
3.108GlnIle: 3.108 ± 0.481
2.496GlnLys: 2.496 ± 0.26
3.622GlnLeu: 3.622 ± 0.336
1.37GlnMet: 1.37 ± 0.202
3.377GlnAsn: 3.377 ± 0.338
2.08GlnPro: 2.08 ± 0.329
4.846GlnGln: 4.846 ± 0.736
1.493GlnArg: 1.493 ± 0.199
4.087GlnSer: 4.087 ± 0.39
2.545GlnThr: 2.545 ± 0.312
1.248GlnVal: 1.248 ± 0.235
0.245GlnTrp: 0.245 ± 0.075
2.423GlnTyr: 2.423 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.248ArgAla: 1.248 ± 0.174
0.881ArgCys: 0.881 ± 0.16
1.835ArgAsp: 1.835 ± 0.219
1.591ArgGlu: 1.591 ± 0.214
1.933ArgPhe: 1.933 ± 0.218
0.832ArgGly: 0.832 ± 0.135
1.199ArgHis: 1.199 ± 0.213
3.451ArgIle: 3.451 ± 0.288
4.576ArgLys: 4.576 ± 1.155
2.472ArgLeu: 2.472 ± 0.257
0.612ArgMet: 0.612 ± 0.108
3.524ArgAsn: 3.524 ± 0.493
1.689ArgPro: 1.689 ± 0.224
1.738ArgGln: 1.738 ± 0.235
2.765ArgArg: 2.765 ± 0.507
2.545ArgSer: 2.545 ± 0.407
2.056ArgThr: 2.056 ± 0.226
1.468ArgVal: 1.468 ± 0.171
0.196ArgTrp: 0.196 ± 0.074
1.738ArgTyr: 1.738 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
2.961SerAla: 2.961 ± 0.328
1.37SerCys: 1.37 ± 0.195
4.601SerAsp: 4.601 ± 0.435
3.5SerGlu: 3.5 ± 0.292
2.154SerPhe: 2.154 ± 0.182
1.689SerGly: 1.689 ± 0.233
1.175SerHis: 1.175 ± 0.163
7.538SerIle: 7.538 ± 0.462
5.702SerLys: 5.702 ± 0.432
6.632SerLeu: 6.632 ± 0.568
1.933SerMet: 1.933 ± 0.223
7.954SerAsn: 7.954 ± 0.54
3.769SerPro: 3.769 ± 0.507
2.545SerGln: 2.545 ± 0.397
3.279SerArg: 3.279 ± 0.387
12.922SerSer: 12.922 ± 1.976
9.202SerThr: 9.202 ± 0.93
2.521SerVal: 2.521 ± 0.24
0.22SerTrp: 0.22 ± 0.071
2.814SerTyr: 2.814 ± 0.27
0.0SerXaa: 0.0 ± 0.0
Thr
2.496ThrAla: 2.496 ± 0.27
1.273ThrCys: 1.273 ± 0.206
3.867ThrAsp: 3.867 ± 0.398
2.937ThrGlu: 2.937 ± 0.316
2.692ThrPhe: 2.692 ± 0.291
1.909ThrGly: 1.909 ± 0.294
1.664ThrHis: 1.664 ± 0.199
8.541ThrIle: 8.541 ± 0.67
4.944ThrLys: 4.944 ± 0.427
5.653ThrLeu: 5.653 ± 0.409
2.08ThrMet: 2.08 ± 0.239
8.321ThrAsn: 8.321 ± 0.518
3.23ThrPro: 3.23 ± 0.352
2.398ThrGln: 2.398 ± 0.365
2.129ThrArg: 2.129 ± 0.254
7.782ThrSer: 7.782 ± 0.528
12.481ThrThr: 12.481 ± 1.968
2.888ThrVal: 2.888 ± 0.275
0.22ThrTrp: 0.22 ± 0.084
2.839ThrTyr: 2.839 ± 0.271
0.0ThrXaa: 0.0 ± 0.0
Val
2.08ValAla: 2.08 ± 0.255
0.661ValCys: 0.661 ± 0.14
3.304ValAsp: 3.304 ± 0.3
2.349ValGlu: 2.349 ± 0.283
1.811ValPhe: 1.811 ± 0.199
1.762ValGly: 1.762 ± 0.278
1.028ValHis: 1.028 ± 0.157
3.451ValIle: 3.451 ± 0.34
3.206ValLys: 3.206 ± 0.331
3.402ValLeu: 3.402 ± 0.36
1.077ValMet: 1.077 ± 0.137
3.573ValAsn: 3.573 ± 0.28
1.835ValPro: 1.835 ± 0.247
1.835ValGln: 1.835 ± 0.184
1.273ValArg: 1.273 ± 0.171
3.5ValSer: 3.5 ± 0.304
2.227ValThr: 2.227 ± 0.252
2.423ValVal: 2.423 ± 0.26
0.294ValTrp: 0.294 ± 0.085
2.349ValTyr: 2.349 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.171TrpAla: 0.171 ± 0.058
0.073TrpCys: 0.073 ± 0.04
0.294TrpAsp: 0.294 ± 0.079
0.171TrpGlu: 0.171 ± 0.061
0.367TrpPhe: 0.367 ± 0.102
0.147TrpGly: 0.147 ± 0.069
0.098TrpHis: 0.098 ± 0.043
0.416TrpIle: 0.416 ± 0.11
0.318TrpLys: 0.318 ± 0.095
0.343TrpLeu: 0.343 ± 0.09
0.073TrpMet: 0.073 ± 0.05
0.367TrpAsn: 0.367 ± 0.11
0.294TrpPro: 0.294 ± 0.095
0.171TrpGln: 0.171 ± 0.055
0.294TrpArg: 0.294 ± 0.088
0.318TrpSer: 0.318 ± 0.092
0.147TrpThr: 0.147 ± 0.065
0.22TrpVal: 0.22 ± 0.07
0.073TrpTrp: 0.073 ± 0.039
0.343TrpTyr: 0.343 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.884TyrAla: 1.884 ± 0.234
0.514TyrCys: 0.514 ± 0.124
3.867TyrAsp: 3.867 ± 0.343
2.227TyrGlu: 2.227 ± 0.23
1.517TyrPhe: 1.517 ± 0.22
1.64TyrGly: 1.64 ± 0.207
0.979TyrHis: 0.979 ± 0.183
4.087TyrIle: 4.087 ± 0.357
2.961TyrLys: 2.961 ± 0.24
3.353TyrLeu: 3.353 ± 0.339
1.199TyrMet: 1.199 ± 0.185
5.286TyrAsn: 5.286 ± 0.402
1.297TyrPro: 1.297 ± 0.21
1.689TyrGln: 1.689 ± 0.215
1.664TyrArg: 1.664 ± 0.202
3.035TyrSer: 3.035 ± 0.381
3.72TyrThr: 3.72 ± 0.325
1.517TyrVal: 1.517 ± 0.18
0.318TyrTrp: 0.318 ± 0.086
2.349TyrTyr: 2.349 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (40862 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski