Amino acid dipepetide frequency for Yellowstone lake phycodnavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.64AlaAla: 6.64 ± 0.656
0.969AlaCys: 0.969 ± 0.131
3.047AlaAsp: 3.047 ± 0.258
3.434AlaGlu: 3.434 ± 0.304
3.364AlaPhe: 3.364 ± 0.433
5.407AlaGly: 5.407 ± 0.575
1.215AlaHis: 1.215 ± 0.156
4.333AlaIle: 4.333 ± 0.273
4.315AlaLys: 4.315 ± 0.405
6.552AlaLeu: 6.552 ± 0.437
1.796AlaMet: 1.796 ± 0.164
4.386AlaAsn: 4.386 ± 0.516
4.245AlaPro: 4.245 ± 0.444
2.924AlaGln: 2.924 ± 0.281
4.35AlaArg: 4.35 ± 0.409
4.614AlaSer: 4.614 ± 0.428
4.579AlaThr: 4.579 ± 0.494
4.896AlaVal: 4.896 ± 0.34
1.074AlaTrp: 1.074 ± 0.166
2.483AlaTyr: 2.483 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
1.162CysAla: 1.162 ± 0.148
0.317CysCys: 0.317 ± 0.082
0.881CysAsp: 0.881 ± 0.11
0.81CysGlu: 0.81 ± 0.147
0.599CysPhe: 0.599 ± 0.105
1.162CysGly: 1.162 ± 0.183
0.211CysHis: 0.211 ± 0.06
0.951CysIle: 0.951 ± 0.134
1.022CysLys: 1.022 ± 0.191
1.215CysLeu: 1.215 ± 0.139
0.423CysMet: 0.423 ± 0.092
0.564CysAsn: 0.564 ± 0.102
1.215CysPro: 1.215 ± 0.208
0.74CysGln: 0.74 ± 0.133
1.004CysArg: 1.004 ± 0.154
0.969CysSer: 0.969 ± 0.173
0.845CysThr: 0.845 ± 0.13
0.986CysVal: 0.986 ± 0.169
0.247CysTrp: 0.247 ± 0.082
0.493CysTyr: 0.493 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
3.487AspAla: 3.487 ± 0.293
0.669AspCys: 0.669 ± 0.103
2.977AspAsp: 2.977 ± 0.332
3.857AspGlu: 3.857 ± 0.363
2.712AspPhe: 2.712 ± 0.196
3.751AspGly: 3.751 ± 0.272
0.828AspHis: 0.828 ± 0.12
3.399AspIle: 3.399 ± 0.241
2.36AspLys: 2.36 ± 0.232
4.474AspLeu: 4.474 ± 0.335
1.286AspMet: 1.286 ± 0.183
1.673AspAsn: 1.673 ± 0.18
2.959AspPro: 2.959 ± 0.262
1.479AspGln: 1.479 ± 0.149
2.29AspArg: 2.29 ± 0.224
1.761AspSer: 1.761 ± 0.205
2.977AspThr: 2.977 ± 0.257
3.487AspVal: 3.487 ± 0.235
0.722AspTrp: 0.722 ± 0.108
1.849AspTyr: 1.849 ± 0.202
0.0AspXaa: 0.0 ± 0.0
Glu
3.434GluAla: 3.434 ± 0.275
1.198GluCys: 1.198 ± 0.146
2.607GluAsp: 2.607 ± 0.298
3.434GluGlu: 3.434 ± 0.419
2.941GluPhe: 2.941 ± 0.255
2.712GluGly: 2.712 ± 0.242
1.198GluHis: 1.198 ± 0.151
3.751GluIle: 3.751 ± 0.247
3.223GluLys: 3.223 ± 0.297
4.491GluLeu: 4.491 ± 0.329
1.303GluMet: 1.303 ± 0.16
3.065GluAsn: 3.065 ± 0.238
2.29GluPro: 2.29 ± 0.215
1.832GluGln: 1.832 ± 0.206
2.536GluArg: 2.536 ± 0.254
1.532GluSer: 1.532 ± 0.188
3.082GluThr: 3.082 ± 0.272
3.241GluVal: 3.241 ± 0.313
0.775GluTrp: 0.775 ± 0.122
2.202GluTyr: 2.202 ± 0.233
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.261
0.898PheCys: 0.898 ± 0.153
2.571PheAsp: 2.571 ± 0.244
2.378PheGlu: 2.378 ± 0.238
1.832PhePhe: 1.832 ± 0.252
3.382PheGly: 3.382 ± 0.379
0.986PheHis: 0.986 ± 0.159
2.765PheIle: 2.765 ± 0.251
2.395PheLys: 2.395 ± 0.235
3.364PheLeu: 3.364 ± 0.261
1.427PheMet: 1.427 ± 0.191
2.836PheAsn: 2.836 ± 0.221
1.673PhePro: 1.673 ± 0.155
1.532PheGln: 1.532 ± 0.186
1.867PheArg: 1.867 ± 0.184
3.065PheSer: 3.065 ± 0.219
3.399PheThr: 3.399 ± 0.272
3.17PheVal: 3.17 ± 0.258
0.564PheTrp: 0.564 ± 0.103
1.796PheTyr: 1.796 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
5.389GlyAla: 5.389 ± 0.579
0.986GlyCys: 0.986 ± 0.15
3.029GlyAsp: 3.029 ± 0.26
2.536GlyGlu: 2.536 ± 0.21
3.082GlyPhe: 3.082 ± 0.231
8.789GlyGly: 8.789 ± 1.596
1.339GlyHis: 1.339 ± 0.129
3.54GlyIle: 3.54 ± 0.277
3.646GlyLys: 3.646 ± 0.279
5.354GlyLeu: 5.354 ± 0.323
1.515GlyMet: 1.515 ± 0.179
4.967GlyAsn: 4.967 ± 0.676
3.522GlyPro: 3.522 ± 0.32
2.501GlyGln: 2.501 ± 0.292
3.47GlyArg: 3.47 ± 0.269
4.967GlySer: 4.967 ± 0.512
5.601GlyThr: 5.601 ± 0.62
4.421GlyVal: 4.421 ± 0.298
0.986GlyTrp: 0.986 ± 0.151
2.431GlyTyr: 2.431 ± 0.211
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.136
0.247HisCys: 0.247 ± 0.056
1.039HisAsp: 1.039 ± 0.145
1.127HisGlu: 1.127 ± 0.188
0.933HisPhe: 0.933 ± 0.123
1.004HisGly: 1.004 ± 0.139
0.546HisHis: 0.546 ± 0.091
1.057HisIle: 1.057 ± 0.172
1.162HisLys: 1.162 ± 0.152
1.356HisLeu: 1.356 ± 0.161
0.74HisMet: 0.74 ± 0.121
0.757HisAsn: 0.757 ± 0.124
0.986HisPro: 0.986 ± 0.143
0.722HisGln: 0.722 ± 0.113
0.863HisArg: 0.863 ± 0.122
0.81HisSer: 0.81 ± 0.115
1.127HisThr: 1.127 ± 0.146
1.585HisVal: 1.585 ± 0.194
0.335HisTrp: 0.335 ± 0.078
0.476HisTyr: 0.476 ± 0.102
0.0HisXaa: 0.0 ± 0.0
Ile
3.681IleAla: 3.681 ± 0.284
0.74IleCys: 0.74 ± 0.107
3.505IleAsp: 3.505 ± 0.262
3.364IleGlu: 3.364 ± 0.275
2.536IlePhe: 2.536 ± 0.247
3.487IleGly: 3.487 ± 0.27
1.233IleHis: 1.233 ± 0.151
3.822IleIle: 3.822 ± 0.282
4.227IleLys: 4.227 ± 0.317
4.791IleLeu: 4.791 ± 0.333
1.427IleMet: 1.427 ± 0.155
3.294IleAsn: 3.294 ± 0.25
3.153IlePro: 3.153 ± 0.239
2.994IleGln: 2.994 ± 0.29
3.153IleArg: 3.153 ± 0.267
4.544IleSer: 4.544 ± 0.434
3.963IleThr: 3.963 ± 0.357
3.364IleVal: 3.364 ± 0.247
0.898IleTrp: 0.898 ± 0.129
2.378IleTyr: 2.378 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
4.121LysAla: 4.121 ± 0.42
0.969LysCys: 0.969 ± 0.13
3.117LysAsp: 3.117 ± 0.291
2.977LysGlu: 2.977 ± 0.305
3.241LysPhe: 3.241 ± 0.275
3.117LysGly: 3.117 ± 0.281
1.057LysHis: 1.057 ± 0.145
3.716LysIle: 3.716 ± 0.3
4.597LysLys: 4.597 ± 0.366
5.037LysLeu: 5.037 ± 0.359
2.113LysMet: 2.113 ± 0.198
4.068LysAsn: 4.068 ± 0.358
2.677LysPro: 2.677 ± 0.269
1.744LysGln: 1.744 ± 0.173
3.188LysArg: 3.188 ± 0.325
4.051LysSer: 4.051 ± 0.323
3.928LysThr: 3.928 ± 0.296
4.033LysVal: 4.033 ± 0.326
0.986LysTrp: 0.986 ± 0.143
2.413LysTyr: 2.413 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
6.446LeuAla: 6.446 ± 0.386
1.18LeuCys: 1.18 ± 0.173
3.98LeuAsp: 3.98 ± 0.303
4.333LeuGlu: 4.333 ± 0.325
2.941LeuPhe: 2.941 ± 0.275
5.037LeuGly: 5.037 ± 0.34
1.585LeuHis: 1.585 ± 0.203
4.755LeuIle: 4.755 ± 0.303
5.354LeuLys: 5.354 ± 0.482
6.728LeuLeu: 6.728 ± 0.526
2.025LeuMet: 2.025 ± 0.19
4.632LeuAsn: 4.632 ± 0.422
4.068LeuPro: 4.068 ± 0.334
2.659LeuGln: 2.659 ± 0.247
4.368LeuArg: 4.368 ± 0.307
5.777LeuSer: 5.777 ± 0.588
5.425LeuThr: 5.425 ± 0.462
5.513LeuVal: 5.513 ± 0.279
1.039LeuTrp: 1.039 ± 0.14
2.853LeuTyr: 2.853 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
2.378MetAla: 2.378 ± 0.243
0.652MetCys: 0.652 ± 0.113
1.444MetAsp: 1.444 ± 0.158
1.391MetGlu: 1.391 ± 0.155
1.303MetPhe: 1.303 ± 0.176
1.568MetGly: 1.568 ± 0.157
0.423MetHis: 0.423 ± 0.087
1.339MetIle: 1.339 ± 0.169
1.673MetLys: 1.673 ± 0.204
1.55MetLeu: 1.55 ± 0.186
0.828MetMet: 0.828 ± 0.172
1.832MetAsn: 1.832 ± 0.215
1.039MetPro: 1.039 ± 0.143
0.704MetGln: 0.704 ± 0.121
1.162MetArg: 1.162 ± 0.162
2.131MetSer: 2.131 ± 0.162
1.867MetThr: 1.867 ± 0.162
1.427MetVal: 1.427 ± 0.157
0.387MetTrp: 0.387 ± 0.081
1.145MetTyr: 1.145 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
5.108AsnAla: 5.108 ± 0.67
0.775AsnCys: 0.775 ± 0.142
1.814AsnAsp: 1.814 ± 0.172
2.061AsnGlu: 2.061 ± 0.202
3.17AsnPhe: 3.17 ± 0.204
4.685AsnGly: 4.685 ± 0.308
0.775AsnHis: 0.775 ± 0.11
4.526AsnIle: 4.526 ± 0.65
3.065AsnLys: 3.065 ± 0.341
6.217AsnLeu: 6.217 ± 0.673
1.55AsnMet: 1.55 ± 0.167
3.276AsnAsn: 3.276 ± 0.363
2.624AsnPro: 2.624 ± 0.221
2.149AsnGln: 2.149 ± 0.221
2.519AsnArg: 2.519 ± 0.252
4.509AsnSer: 4.509 ± 0.348
3.699AsnThr: 3.699 ± 0.603
4.068AsnVal: 4.068 ± 0.498
0.845AsnTrp: 0.845 ± 0.15
2.166AsnTyr: 2.166 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
3.716ProAla: 3.716 ± 0.344
0.828ProCys: 0.828 ± 0.121
2.871ProAsp: 2.871 ± 0.266
3.223ProGlu: 3.223 ± 0.324
1.955ProPhe: 1.955 ± 0.216
3.875ProGly: 3.875 ± 0.394
0.757ProHis: 0.757 ± 0.128
2.483ProIle: 2.483 ± 0.193
3.029ProLys: 3.029 ± 0.366
3.399ProLeu: 3.399 ± 0.268
1.391ProMet: 1.391 ± 0.179
2.413ProAsn: 2.413 ± 0.261
3.522ProPro: 3.522 ± 0.434
1.708ProGln: 1.708 ± 0.209
2.237ProArg: 2.237 ± 0.293
4.209ProSer: 4.209 ± 0.382
3.223ProThr: 3.223 ± 0.343
3.963ProVal: 3.963 ± 0.296
0.669ProTrp: 0.669 ± 0.105
1.568ProTyr: 1.568 ± 0.216
0.0ProXaa: 0.0 ± 0.0
Gln
2.8GlnAla: 2.8 ± 0.507
0.387GlnCys: 0.387 ± 0.078
1.374GlnAsp: 1.374 ± 0.141
1.744GlnGlu: 1.744 ± 0.171
1.673GlnPhe: 1.673 ± 0.188
2.501GlnGly: 2.501 ± 0.323
0.528GlnHis: 0.528 ± 0.088
1.973GlnIle: 1.973 ± 0.186
2.202GlnLys: 2.202 ± 0.213
2.659GlnLeu: 2.659 ± 0.23
0.986GlnMet: 0.986 ± 0.176
2.254GlnAsn: 2.254 ± 0.201
1.832GlnPro: 1.832 ± 0.227
1.268GlnGln: 1.268 ± 0.181
1.779GlnArg: 1.779 ± 0.166
1.973GlnSer: 1.973 ± 0.215
2.695GlnThr: 2.695 ± 0.245
2.888GlnVal: 2.888 ± 0.225
0.616GlnTrp: 0.616 ± 0.096
1.515GlnTyr: 1.515 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
4.121ArgAla: 4.121 ± 0.415
0.828ArgCys: 0.828 ± 0.131
2.325ArgAsp: 2.325 ± 0.238
2.536ArgGlu: 2.536 ± 0.244
2.061ArgPhe: 2.061 ± 0.218
2.941ArgGly: 2.941 ± 0.235
1.233ArgHis: 1.233 ± 0.163
3.223ArgIle: 3.223 ± 0.268
3.188ArgLys: 3.188 ± 0.271
4.192ArgLeu: 4.192 ± 0.379
1.18ArgMet: 1.18 ± 0.162
2.871ArgAsn: 2.871 ± 0.301
2.378ArgPro: 2.378 ± 0.288
1.673ArgGln: 1.673 ± 0.182
3.487ArgArg: 3.487 ± 0.358
2.642ArgSer: 2.642 ± 0.235
3.1ArgThr: 3.1 ± 0.266
3.294ArgVal: 3.294 ± 0.269
0.669ArgTrp: 0.669 ± 0.127
1.973ArgTyr: 1.973 ± 0.177
0.0ArgXaa: 0.0 ± 0.0
Ser
4.755SerAla: 4.755 ± 0.425
0.986SerCys: 0.986 ± 0.141
3.575SerAsp: 3.575 ± 0.299
2.924SerGlu: 2.924 ± 0.272
2.554SerPhe: 2.554 ± 0.255
5.231SerGly: 5.231 ± 0.497
1.004SerHis: 1.004 ± 0.126
3.505SerIle: 3.505 ± 0.267
4.121SerLys: 4.121 ± 0.303
5.143SerLeu: 5.143 ± 0.319
1.603SerMet: 1.603 ± 0.203
5.407SerAsn: 5.407 ± 1.197
3.029SerPro: 3.029 ± 0.31
1.99SerGln: 1.99 ± 0.161
2.836SerArg: 2.836 ± 0.236
4.879SerSer: 4.879 ± 0.387
4.333SerThr: 4.333 ± 0.462
3.928SerVal: 3.928 ± 0.351
0.74SerTrp: 0.74 ± 0.122
2.043SerTyr: 2.043 ± 0.229
0.0SerXaa: 0.0 ± 0.0
Thr
4.896ThrAla: 4.896 ± 0.369
1.127ThrCys: 1.127 ± 0.133
2.959ThrAsp: 2.959 ± 0.233
2.695ThrGlu: 2.695 ± 0.216
2.571ThrPhe: 2.571 ± 0.242
5.83ThrGly: 5.83 ± 0.596
1.022ThrHis: 1.022 ± 0.141
4.086ThrIle: 4.086 ± 0.378
3.822ThrLys: 3.822 ± 0.316
5.389ThrLeu: 5.389 ± 0.622
1.25ThrMet: 1.25 ± 0.168
4.068ThrAsn: 4.068 ± 0.465
4.016ThrPro: 4.016 ± 0.511
2.483ThrGln: 2.483 ± 0.258
2.906ThrArg: 2.906 ± 0.237
4.579ThrSer: 4.579 ± 0.506
4.914ThrThr: 4.914 ± 0.628
4.755ThrVal: 4.755 ± 0.406
1.145ThrTrp: 1.145 ± 0.14
2.483ThrTyr: 2.483 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
4.667ValAla: 4.667 ± 0.354
1.18ValCys: 1.18 ± 0.147
2.818ValAsp: 2.818 ± 0.241
3.311ValGlu: 3.311 ± 0.27
2.712ValPhe: 2.712 ± 0.261
4.421ValGly: 4.421 ± 0.386
1.198ValHis: 1.198 ± 0.174
4.297ValIle: 4.297 ± 0.31
4.227ValLys: 4.227 ± 0.319
4.72ValLeu: 4.72 ± 0.356
1.656ValMet: 1.656 ± 0.182
4.509ValAsn: 4.509 ± 0.317
3.822ValPro: 3.822 ± 0.265
2.906ValGln: 2.906 ± 0.405
3.804ValArg: 3.804 ± 0.293
3.875ValSer: 3.875 ± 0.301
5.108ValThr: 5.108 ± 0.548
3.822ValVal: 3.822 ± 0.269
0.845ValTrp: 0.845 ± 0.116
2.783ValTyr: 2.783 ± 0.253
0.0ValXaa: 0.0 ± 0.0
Trp
0.828TrpAla: 0.828 ± 0.136
0.299TrpCys: 0.299 ± 0.078
0.881TrpAsp: 0.881 ± 0.133
0.722TrpGlu: 0.722 ± 0.11
0.916TrpPhe: 0.916 ± 0.15
0.863TrpGly: 0.863 ± 0.128
0.317TrpHis: 0.317 ± 0.083
0.793TrpIle: 0.793 ± 0.117
0.951TrpLys: 0.951 ± 0.138
1.11TrpLeu: 1.11 ± 0.156
0.493TrpMet: 0.493 ± 0.109
0.793TrpAsn: 0.793 ± 0.115
0.493TrpPro: 0.493 ± 0.094
0.387TrpGln: 0.387 ± 0.074
0.511TrpArg: 0.511 ± 0.087
0.933TrpSer: 0.933 ± 0.141
0.881TrpThr: 0.881 ± 0.115
1.127TrpVal: 1.127 ± 0.15
0.229TrpTrp: 0.229 ± 0.069
0.687TrpTyr: 0.687 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.231
0.581TyrCys: 0.581 ± 0.096
2.184TyrAsp: 2.184 ± 0.231
1.99TyrGlu: 1.99 ± 0.191
1.832TyrPhe: 1.832 ± 0.202
2.307TyrGly: 2.307 ± 0.156
0.546TyrHis: 0.546 ± 0.101
2.237TyrIle: 2.237 ± 0.197
2.571TyrLys: 2.571 ± 0.271
2.959TyrLeu: 2.959 ± 0.239
1.233TyrMet: 1.233 ± 0.128
1.937TyrAsn: 1.937 ± 0.222
1.497TyrPro: 1.497 ± 0.14
1.25TyrGln: 1.25 ± 0.143
1.585TyrArg: 1.585 ± 0.192
2.783TyrSer: 2.783 ± 0.264
2.254TyrThr: 2.254 ± 0.238
2.712TyrVal: 2.712 ± 0.207
0.44TyrTrp: 0.44 ± 0.073
1.656TyrTyr: 1.656 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 248 proteins (56779 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski