Amino acid dipepetide frequency for Yellowstone lake phycodnavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.094AlaAla: 6.094 ± 0.791
1.215AlaCys: 1.215 ± 0.177
2.879AlaAsp: 2.879 ± 0.224
3.533AlaGlu: 3.533 ± 0.319
2.748AlaPhe: 2.748 ± 0.211
4.673AlaGly: 4.673 ± 0.338
1.402AlaHis: 1.402 ± 0.161
3.552AlaIle: 3.552 ± 0.269
4.318AlaLys: 4.318 ± 0.423
6.15AlaLeu: 6.15 ± 0.288
1.851AlaMet: 1.851 ± 0.19
5.047AlaAsn: 5.047 ± 0.993
3.72AlaPro: 3.72 ± 0.322
2.561AlaGln: 2.561 ± 0.209
3.552AlaArg: 3.552 ± 0.35
4.879AlaSer: 4.879 ± 0.436
4.15AlaThr: 4.15 ± 0.334
4.785AlaVal: 4.785 ± 0.353
1.178AlaTrp: 1.178 ± 0.166
2.0AlaTyr: 2.0 ± 0.177
0.0AlaXaa: 0.0 ± 0.0
Cys
1.047CysAla: 1.047 ± 0.127
0.336CysCys: 0.336 ± 0.09
0.991CysAsp: 0.991 ± 0.141
0.972CysGlu: 0.972 ± 0.151
0.542CysPhe: 0.542 ± 0.106
1.065CysGly: 1.065 ± 0.163
0.393CysHis: 0.393 ± 0.108
0.673CysIle: 0.673 ± 0.118
1.047CysLys: 1.047 ± 0.147
1.178CysLeu: 1.178 ± 0.169
0.43CysMet: 0.43 ± 0.096
0.43CysAsn: 0.43 ± 0.082
1.065CysPro: 1.065 ± 0.176
0.654CysGln: 0.654 ± 0.109
0.991CysArg: 0.991 ± 0.156
0.86CysSer: 0.86 ± 0.131
1.009CysThr: 1.009 ± 0.148
1.009CysVal: 1.009 ± 0.192
0.206CysTrp: 0.206 ± 0.059
0.505CysTyr: 0.505 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
3.346AspAla: 3.346 ± 0.234
0.804AspCys: 0.804 ± 0.121
2.692AspAsp: 2.692 ± 0.288
3.009AspGlu: 3.009 ± 0.285
2.318AspPhe: 2.318 ± 0.206
3.327AspGly: 3.327 ± 0.301
0.804AspHis: 0.804 ± 0.127
3.196AspIle: 3.196 ± 0.244
2.561AspLys: 2.561 ± 0.237
4.561AspLeu: 4.561 ± 0.258
1.196AspMet: 1.196 ± 0.175
1.589AspAsn: 1.589 ± 0.147
2.879AspPro: 2.879 ± 0.225
1.664AspGln: 1.664 ± 0.157
2.355AspArg: 2.355 ± 0.235
2.729AspSer: 2.729 ± 0.227
2.935AspThr: 2.935 ± 0.238
3.757AspVal: 3.757 ± 0.239
0.897AspTrp: 0.897 ± 0.124
1.869AspTyr: 1.869 ± 0.211
0.0AspXaa: 0.0 ± 0.0
Glu
3.383GluAla: 3.383 ± 0.266
1.009GluCys: 1.009 ± 0.146
2.841GluAsp: 2.841 ± 0.273
3.626GluGlu: 3.626 ± 0.353
3.066GluPhe: 3.066 ± 0.254
2.654GluGly: 2.654 ± 0.267
1.159GluHis: 1.159 ± 0.147
3.29GluIle: 3.29 ± 0.311
3.552GluLys: 3.552 ± 0.299
4.542GluLeu: 4.542 ± 0.317
1.514GluMet: 1.514 ± 0.193
2.785GluAsn: 2.785 ± 0.263
2.617GluPro: 2.617 ± 0.238
1.626GluGln: 1.626 ± 0.211
2.785GluArg: 2.785 ± 0.258
2.654GluSer: 2.654 ± 0.253
3.29GluThr: 3.29 ± 0.282
3.309GluVal: 3.309 ± 0.277
0.916GluTrp: 0.916 ± 0.128
2.206GluTyr: 2.206 ± 0.225
0.0GluXaa: 0.0 ± 0.0
Phe
3.084PheAla: 3.084 ± 0.295
0.841PheCys: 0.841 ± 0.108
2.916PheAsp: 2.916 ± 0.285
2.486PheGlu: 2.486 ± 0.238
2.0PhePhe: 2.0 ± 0.191
2.561PheGly: 2.561 ± 0.197
1.009PheHis: 1.009 ± 0.142
2.299PheIle: 2.299 ± 0.214
2.841PheLys: 2.841 ± 0.276
3.178PheLeu: 3.178 ± 0.252
1.701PheMet: 1.701 ± 0.226
2.654PheAsn: 2.654 ± 0.265
2.0PhePro: 2.0 ± 0.211
1.925PheGln: 1.925 ± 0.22
1.963PheArg: 1.963 ± 0.213
3.327PheSer: 3.327 ± 0.223
3.178PheThr: 3.178 ± 0.269
3.608PheVal: 3.608 ± 0.248
0.748PheTrp: 0.748 ± 0.124
1.832PheTyr: 1.832 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
4.748GlyAla: 4.748 ± 0.442
0.785GlyCys: 0.785 ± 0.138
2.879GlyAsp: 2.879 ± 0.232
2.43GlyGlu: 2.43 ± 0.246
3.309GlyPhe: 3.309 ± 0.204
5.383GlyGly: 5.383 ± 0.72
1.29GlyHis: 1.29 ± 0.176
3.495GlyIle: 3.495 ± 0.334
3.477GlyLys: 3.477 ± 0.335
5.253GlyLeu: 5.253 ± 0.308
1.477GlyMet: 1.477 ± 0.166
3.309GlyAsn: 3.309 ± 0.314
3.252GlyPro: 3.252 ± 0.247
2.467GlyGln: 2.467 ± 0.218
3.533GlyArg: 3.533 ± 0.329
5.29GlySer: 5.29 ± 0.477
5.496GlyThr: 5.496 ± 0.581
4.467GlyVal: 4.467 ± 0.268
1.065GlyTrp: 1.065 ± 0.131
2.692GlyTyr: 2.692 ± 0.286
0.0GlyXaa: 0.0 ± 0.0
His
1.327HisAla: 1.327 ± 0.153
0.318HisCys: 0.318 ± 0.091
0.841HisAsp: 0.841 ± 0.13
1.065HisGlu: 1.065 ± 0.157
0.972HisPhe: 0.972 ± 0.125
1.346HisGly: 1.346 ± 0.179
0.542HisHis: 0.542 ± 0.095
1.009HisIle: 1.009 ± 0.169
1.178HisLys: 1.178 ± 0.169
2.224HisLeu: 2.224 ± 0.23
0.748HisMet: 0.748 ± 0.118
0.785HisAsn: 0.785 ± 0.123
1.103HisPro: 1.103 ± 0.146
0.673HisGln: 0.673 ± 0.103
1.346HisArg: 1.346 ± 0.173
0.916HisSer: 0.916 ± 0.129
1.14HisThr: 1.14 ± 0.156
1.981HisVal: 1.981 ± 0.247
0.28HisTrp: 0.28 ± 0.062
0.748HisTyr: 0.748 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
3.645IleAla: 3.645 ± 0.312
0.729IleCys: 0.729 ± 0.131
3.047IleAsp: 3.047 ± 0.233
3.215IleGlu: 3.215 ± 0.253
2.486IlePhe: 2.486 ± 0.199
3.383IleGly: 3.383 ± 0.316
1.458IleHis: 1.458 ± 0.196
2.411IleIle: 2.411 ± 0.246
3.589IleLys: 3.589 ± 0.309
4.804IleLeu: 4.804 ± 0.377
1.327IleMet: 1.327 ± 0.151
2.841IleAsn: 2.841 ± 0.263
2.71IlePro: 2.71 ± 0.299
2.879IleGln: 2.879 ± 0.307
3.365IleArg: 3.365 ± 0.33
3.776IleSer: 3.776 ± 0.33
3.851IleThr: 3.851 ± 0.468
3.252IleVal: 3.252 ± 0.241
0.654IleTrp: 0.654 ± 0.121
2.112IleTyr: 2.112 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
5.215LysAla: 5.215 ± 0.565
0.972LysCys: 0.972 ± 0.151
3.14LysAsp: 3.14 ± 0.272
3.439LysGlu: 3.439 ± 0.333
2.86LysPhe: 2.86 ± 0.253
3.495LysGly: 3.495 ± 0.279
1.252LysHis: 1.252 ± 0.16
3.757LysIle: 3.757 ± 0.334
5.159LysLys: 5.159 ± 0.476
5.196LysLeu: 5.196 ± 0.366
2.037LysMet: 2.037 ± 0.221
3.402LysAsn: 3.402 ± 0.356
2.86LysPro: 2.86 ± 0.258
1.907LysGln: 1.907 ± 0.209
3.552LysArg: 3.552 ± 0.269
3.72LysSer: 3.72 ± 0.367
3.832LysThr: 3.832 ± 0.285
4.299LysVal: 4.299 ± 0.361
0.935LysTrp: 0.935 ± 0.131
2.355LysTyr: 2.355 ± 0.199
0.0LysXaa: 0.0 ± 0.0
Leu
6.281LeuAla: 6.281 ± 0.361
1.252LeuCys: 1.252 ± 0.192
4.467LeuAsp: 4.467 ± 0.312
5.309LeuGlu: 5.309 ± 0.365
3.552LeuPhe: 3.552 ± 0.292
4.729LeuGly: 4.729 ± 0.32
1.664LeuHis: 1.664 ± 0.191
3.514LeuIle: 3.514 ± 0.225
5.832LeuLys: 5.832 ± 0.424
6.542LeuLeu: 6.542 ± 0.522
2.224LeuMet: 2.224 ± 0.201
5.159LeuAsn: 5.159 ± 0.511
3.533LeuPro: 3.533 ± 0.276
2.561LeuGln: 2.561 ± 0.201
4.43LeuArg: 4.43 ± 0.319
5.271LeuSer: 5.271 ± 0.397
5.832LeuThr: 5.832 ± 0.439
6.879LeuVal: 6.879 ± 0.368
1.196LeuTrp: 1.196 ± 0.153
3.327LeuTyr: 3.327 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
1.944MetAla: 1.944 ± 0.186
0.598MetCys: 0.598 ± 0.114
1.514MetAsp: 1.514 ± 0.166
1.383MetGlu: 1.383 ± 0.183
1.346MetPhe: 1.346 ± 0.176
1.682MetGly: 1.682 ± 0.176
0.598MetHis: 0.598 ± 0.117
1.888MetIle: 1.888 ± 0.193
1.645MetLys: 1.645 ± 0.189
1.682MetLeu: 1.682 ± 0.164
0.598MetMet: 0.598 ± 0.121
1.645MetAsn: 1.645 ± 0.184
1.047MetPro: 1.047 ± 0.144
0.785MetGln: 0.785 ± 0.129
1.383MetArg: 1.383 ± 0.186
2.187MetSer: 2.187 ± 0.176
2.056MetThr: 2.056 ± 0.17
1.533MetVal: 1.533 ± 0.151
0.505MetTrp: 0.505 ± 0.097
1.271MetTyr: 1.271 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.626AsnAla: 3.626 ± 0.451
0.617AsnCys: 0.617 ± 0.114
1.832AsnAsp: 1.832 ± 0.172
2.206AsnGlu: 2.206 ± 0.203
3.028AsnPhe: 3.028 ± 0.236
3.701AsnGly: 3.701 ± 0.345
0.785AsnHis: 0.785 ± 0.116
3.701AsnIle: 3.701 ± 0.672
2.953AsnLys: 2.953 ± 0.307
5.776AsnLeu: 5.776 ± 0.499
1.402AsnMet: 1.402 ± 0.176
3.103AsnAsn: 3.103 ± 0.331
2.748AsnPro: 2.748 ± 0.266
1.776AsnGln: 1.776 ± 0.205
2.561AsnArg: 2.561 ± 0.283
3.776AsnSer: 3.776 ± 0.295
3.309AsnThr: 3.309 ± 0.314
4.654AsnVal: 4.654 ± 0.848
0.86AsnTrp: 0.86 ± 0.113
1.869AsnTyr: 1.869 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
2.748ProAla: 2.748 ± 0.25
0.785ProCys: 0.785 ± 0.129
2.523ProAsp: 2.523 ± 0.228
3.29ProGlu: 3.29 ± 0.311
1.981ProPhe: 1.981 ± 0.217
3.701ProGly: 3.701 ± 0.278
1.122ProHis: 1.122 ± 0.154
2.449ProIle: 2.449 ± 0.204
3.327ProLys: 3.327 ± 0.311
3.495ProLeu: 3.495 ± 0.298
1.551ProMet: 1.551 ± 0.16
1.944ProAsn: 1.944 ± 0.194
2.673ProPro: 2.673 ± 0.268
1.57ProGln: 1.57 ± 0.159
2.299ProArg: 2.299 ± 0.238
3.589ProSer: 3.589 ± 0.36
2.916ProThr: 2.916 ± 0.229
4.281ProVal: 4.281 ± 0.336
0.766ProTrp: 0.766 ± 0.13
1.701ProTyr: 1.701 ± 0.181
0.0ProXaa: 0.0 ± 0.0
Gln
2.299GlnAla: 2.299 ± 0.256
0.617GlnCys: 0.617 ± 0.115
1.981GlnAsp: 1.981 ± 0.186
1.757GlnGlu: 1.757 ± 0.178
1.589GlnPhe: 1.589 ± 0.144
2.075GlnGly: 2.075 ± 0.215
0.617GlnHis: 0.617 ± 0.12
2.224GlnIle: 2.224 ± 0.202
2.299GlnLys: 2.299 ± 0.248
3.196GlnLeu: 3.196 ± 0.263
1.009GlnMet: 1.009 ± 0.141
2.15GlnAsn: 2.15 ± 0.167
1.57GlnPro: 1.57 ± 0.185
1.29GlnGln: 1.29 ± 0.181
1.869GlnArg: 1.869 ± 0.21
2.019GlnSer: 2.019 ± 0.269
2.318GlnThr: 2.318 ± 0.258
2.486GlnVal: 2.486 ± 0.229
0.598GlnTrp: 0.598 ± 0.094
1.327GlnTyr: 1.327 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
3.346ArgAla: 3.346 ± 0.347
0.766ArgCys: 0.766 ± 0.132
2.748ArgAsp: 2.748 ± 0.243
3.009ArgGlu: 3.009 ± 0.267
2.393ArgPhe: 2.393 ± 0.212
3.383ArgGly: 3.383 ± 0.262
1.178ArgHis: 1.178 ± 0.152
3.196ArgIle: 3.196 ± 0.244
3.365ArgLys: 3.365 ± 0.334
4.841ArgLeu: 4.841 ± 0.331
1.608ArgMet: 1.608 ± 0.205
2.486ArgAsn: 2.486 ± 0.237
2.187ArgPro: 2.187 ± 0.238
1.738ArgGln: 1.738 ± 0.203
3.72ArgArg: 3.72 ± 0.368
2.972ArgSer: 2.972 ± 0.255
2.766ArgThr: 2.766 ± 0.26
3.757ArgVal: 3.757 ± 0.287
0.766ArgTrp: 0.766 ± 0.129
1.963ArgTyr: 1.963 ± 0.207
0.0ArgXaa: 0.0 ± 0.0
Ser
4.841SerAla: 4.841 ± 0.388
0.785SerCys: 0.785 ± 0.118
2.729SerAsp: 2.729 ± 0.226
3.159SerGlu: 3.159 ± 0.269
2.935SerPhe: 2.935 ± 0.264
5.29SerGly: 5.29 ± 0.52
1.159SerHis: 1.159 ± 0.156
3.925SerIle: 3.925 ± 0.328
4.337SerLys: 4.337 ± 0.394
5.327SerLeu: 5.327 ± 0.31
1.608SerMet: 1.608 ± 0.153
4.561SerAsn: 4.561 ± 0.498
2.785SerPro: 2.785 ± 0.4
2.131SerGln: 2.131 ± 0.206
3.234SerArg: 3.234 ± 0.213
5.421SerSer: 5.421 ± 0.467
4.785SerThr: 4.785 ± 0.453
4.598SerVal: 4.598 ± 0.364
1.084SerTrp: 1.084 ± 0.143
2.206SerTyr: 2.206 ± 0.238
0.0SerXaa: 0.0 ± 0.0
Thr
5.066ThrAla: 5.066 ± 0.578
1.009ThrCys: 1.009 ± 0.144
3.066ThrAsp: 3.066 ± 0.261
2.748ThrGlu: 2.748 ± 0.233
2.897ThrPhe: 2.897 ± 0.239
5.608ThrGly: 5.608 ± 0.522
1.608ThrHis: 1.608 ± 0.172
3.851ThrIle: 3.851 ± 0.299
3.795ThrLys: 3.795 ± 0.317
5.963ThrLeu: 5.963 ± 0.486
1.327ThrMet: 1.327 ± 0.134
3.57ThrAsn: 3.57 ± 0.288
3.57ThrPro: 3.57 ± 0.296
2.449ThrGln: 2.449 ± 0.366
3.365ThrArg: 3.365 ± 0.254
4.841ThrSer: 4.841 ± 0.539
5.234ThrThr: 5.234 ± 0.576
4.337ThrVal: 4.337 ± 0.341
1.196ThrTrp: 1.196 ± 0.148
2.337ThrTyr: 2.337 ± 0.249
0.0ThrXaa: 0.0 ± 0.0
Val
4.916ValAla: 4.916 ± 0.305
0.953ValCys: 0.953 ± 0.132
3.028ValAsp: 3.028 ± 0.299
3.495ValGlu: 3.495 ± 0.317
3.178ValPhe: 3.178 ± 0.22
4.224ValGly: 4.224 ± 0.392
1.421ValHis: 1.421 ± 0.215
4.094ValIle: 4.094 ± 0.273
4.467ValLys: 4.467 ± 0.302
5.439ValLeu: 5.439 ± 0.358
1.794ValMet: 1.794 ± 0.181
3.72ValAsn: 3.72 ± 0.223
4.168ValPro: 4.168 ± 0.298
2.692ValGln: 2.692 ± 0.239
3.813ValArg: 3.813 ± 0.309
5.439ValSer: 5.439 ± 0.561
6.243ValThr: 6.243 ± 0.669
5.178ValVal: 5.178 ± 0.389
1.122ValTrp: 1.122 ± 0.152
2.897ValTyr: 2.897 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
1.084TrpAla: 1.084 ± 0.149
0.374TrpCys: 0.374 ± 0.084
0.654TrpAsp: 0.654 ± 0.098
0.729TrpGlu: 0.729 ± 0.103
0.916TrpPhe: 0.916 ± 0.148
1.009TrpGly: 1.009 ± 0.132
0.374TrpHis: 0.374 ± 0.09
1.065TrpIle: 1.065 ± 0.132
1.14TrpLys: 1.14 ± 0.165
1.009TrpLeu: 1.009 ± 0.129
0.467TrpMet: 0.467 ± 0.104
0.991TrpAsn: 0.991 ± 0.139
0.822TrpPro: 0.822 ± 0.137
0.523TrpGln: 0.523 ± 0.085
0.841TrpArg: 0.841 ± 0.138
0.841TrpSer: 0.841 ± 0.126
1.028TrpThr: 1.028 ± 0.119
1.065TrpVal: 1.065 ± 0.164
0.093TrpTrp: 0.093 ± 0.051
0.542TrpTyr: 0.542 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.243TyrAla: 2.243 ± 0.184
0.636TyrCys: 0.636 ± 0.112
1.72TyrAsp: 1.72 ± 0.181
1.963TyrGlu: 1.963 ± 0.192
2.037TyrPhe: 2.037 ± 0.184
2.804TyrGly: 2.804 ± 0.271
0.71TyrHis: 0.71 ± 0.107
1.944TyrIle: 1.944 ± 0.207
2.523TyrLys: 2.523 ± 0.257
3.196TyrLeu: 3.196 ± 0.252
1.402TyrMet: 1.402 ± 0.156
2.112TyrAsn: 2.112 ± 0.188
1.402TyrPro: 1.402 ± 0.174
1.421TyrGln: 1.421 ± 0.144
1.234TyrArg: 1.234 ± 0.162
2.393TyrSer: 2.393 ± 0.246
2.467TyrThr: 2.467 ± 0.286
3.047TyrVal: 3.047 ± 0.254
0.505TyrTrp: 0.505 ± 0.1
1.626TyrTyr: 1.626 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 225 proteins (53499 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski