Amino acid dipepetide frequency for Micromonas sp. RCC1109 virus MpV1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.699AlaAla: 4.699 ± 0.599
0.916AlaCys: 0.916 ± 0.138
2.885AlaAsp: 2.885 ± 0.212
3.213AlaGlu: 3.213 ± 0.307
2.177AlaPhe: 2.177 ± 0.199
3.352AlaGly: 3.352 ± 0.331
1.037AlaHis: 1.037 ± 0.149
3.645AlaIle: 3.645 ± 0.232
4.026AlaLys: 4.026 ± 0.376
4.388AlaLeu: 4.388 ± 0.3
1.607AlaMet: 1.607 ± 0.192
3.041AlaAsn: 3.041 ± 0.293
1.9AlaPro: 1.9 ± 0.232
1.693AlaGln: 1.693 ± 0.165
2.782AlaArg: 2.782 ± 0.267
3.749AlaSer: 3.749 ± 0.266
3.196AlaThr: 3.196 ± 0.306
3.438AlaVal: 3.438 ± 0.291
0.397AlaTrp: 0.397 ± 0.084
2.004AlaTyr: 2.004 ± 0.185
0.0AlaXaa: 0.0 ± 0.0
Cys
0.968CysAla: 0.968 ± 0.126
0.311CysCys: 0.311 ± 0.095
1.123CysAsp: 1.123 ± 0.137
1.365CysGlu: 1.365 ± 0.186
0.898CysPhe: 0.898 ± 0.128
1.192CysGly: 1.192 ± 0.154
0.259CysHis: 0.259 ± 0.065
0.933CysIle: 0.933 ± 0.124
1.278CysLys: 1.278 ± 0.191
1.296CysLeu: 1.296 ± 0.16
0.708CysMet: 0.708 ± 0.114
0.881CysAsn: 0.881 ± 0.137
1.019CysPro: 1.019 ± 0.178
0.432CysGln: 0.432 ± 0.089
0.985CysArg: 0.985 ± 0.174
0.812CysSer: 0.812 ± 0.141
0.985CysThr: 0.985 ± 0.146
1.002CysVal: 1.002 ± 0.137
0.104CysTrp: 0.104 ± 0.037
0.76CysTyr: 0.76 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
3.334AspAla: 3.334 ± 0.266
1.088AspCys: 1.088 ± 0.183
4.734AspAsp: 4.734 ± 0.324
5.01AspGlu: 5.01 ± 0.331
3.283AspPhe: 3.283 ± 0.226
4.336AspGly: 4.336 ± 0.326
1.192AspHis: 1.192 ± 0.161
4.872AspIle: 4.872 ± 0.301
3.127AspLys: 3.127 ± 0.29
4.768AspLeu: 4.768 ± 0.26
1.728AspMet: 1.728 ± 0.206
2.609AspAsn: 2.609 ± 0.232
2.229AspPro: 2.229 ± 0.214
1.261AspGln: 1.261 ± 0.159
2.661AspArg: 2.661 ± 0.175
2.782AspSer: 2.782 ± 0.28
4.336AspThr: 4.336 ± 0.314
4.561AspVal: 4.561 ± 0.298
0.795AspTrp: 0.795 ± 0.119
2.488AspTyr: 2.488 ± 0.211
0.0AspXaa: 0.0 ± 0.0
Glu
3.352GluAla: 3.352 ± 0.329
1.382GluCys: 1.382 ± 0.185
3.939GluAsp: 3.939 ± 0.307
5.736GluGlu: 5.736 ± 0.511
3.023GluPhe: 3.023 ± 0.296
3.006GluGly: 3.006 ± 0.26
1.486GluHis: 1.486 ± 0.144
5.01GluIle: 5.01 ± 0.392
5.477GluLys: 5.477 ± 0.447
5.183GluLeu: 5.183 ± 0.342
1.883GluMet: 1.883 ± 0.198
4.043GluAsn: 4.043 ± 0.271
2.315GluPro: 2.315 ± 0.309
2.004GluGln: 2.004 ± 0.223
3.853GluArg: 3.853 ± 0.35
3.196GluSer: 3.196 ± 0.247
3.559GluThr: 3.559 ± 0.262
4.181GluVal: 4.181 ± 0.273
0.812GluTrp: 0.812 ± 0.117
3.023GluTyr: 3.023 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
2.211PheAla: 2.211 ± 0.198
0.968PheCys: 0.968 ± 0.154
3.265PheAsp: 3.265 ± 0.253
2.972PheGlu: 2.972 ± 0.207
2.142PhePhe: 2.142 ± 0.276
2.643PheGly: 2.643 ± 0.216
1.192PheHis: 1.192 ± 0.147
3.283PheIle: 3.283 ± 0.258
3.386PheLys: 3.386 ± 0.243
3.749PheLeu: 3.749 ± 0.233
1.33PheMet: 1.33 ± 0.16
2.401PheAsn: 2.401 ± 0.238
1.417PhePro: 1.417 ± 0.132
1.33PheGln: 1.33 ± 0.169
2.039PheArg: 2.039 ± 0.207
3.144PheSer: 3.144 ± 0.233
2.643PheThr: 2.643 ± 0.205
3.559PheVal: 3.559 ± 0.295
0.518PheTrp: 0.518 ± 0.092
1.883PheTyr: 1.883 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
3.404GlyAla: 3.404 ± 0.332
1.071GlyCys: 1.071 ± 0.164
4.095GlyAsp: 4.095 ± 0.323
3.265GlyGlu: 3.265 ± 0.217
2.799GlyPhe: 2.799 ± 0.235
6.064GlyGly: 6.064 ± 1.537
1.244GlyHis: 1.244 ± 0.171
4.181GlyIle: 4.181 ± 0.312
4.302GlyLys: 4.302 ± 0.316
5.079GlyLeu: 5.079 ± 0.296
1.469GlyMet: 1.469 ± 0.163
3.835GlyAsn: 3.835 ± 0.43
1.918GlyPro: 1.918 ± 0.212
1.952GlyGln: 1.952 ± 0.178
2.782GlyArg: 2.782 ± 0.281
3.887GlySer: 3.887 ± 0.332
4.043GlyThr: 4.043 ± 0.527
3.922GlyVal: 3.922 ± 0.369
0.657GlyTrp: 0.657 ± 0.122
2.574GlyTyr: 2.574 ± 0.208
0.0GlyXaa: 0.0 ± 0.0
His
1.192HisAla: 1.192 ± 0.156
0.415HisCys: 0.415 ± 0.095
1.278HisAsp: 1.278 ± 0.159
1.417HisGlu: 1.417 ± 0.156
1.175HisPhe: 1.175 ± 0.166
1.538HisGly: 1.538 ± 0.171
0.622HisHis: 0.622 ± 0.11
1.348HisIle: 1.348 ± 0.186
1.382HisLys: 1.382 ± 0.178
1.693HisLeu: 1.693 ± 0.173
0.812HisMet: 0.812 ± 0.115
0.916HisAsn: 0.916 ± 0.143
0.968HisPro: 0.968 ± 0.136
0.743HisGln: 0.743 ± 0.121
0.933HisArg: 0.933 ± 0.121
0.898HisSer: 0.898 ± 0.13
1.399HisThr: 1.399 ± 0.161
1.728HisVal: 1.728 ± 0.288
0.311HisTrp: 0.311 ± 0.071
0.916HisTyr: 0.916 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
3.749IleAla: 3.749 ± 0.253
1.054IleCys: 1.054 ± 0.149
4.855IleAsp: 4.855 ± 0.296
4.907IleGlu: 4.907 ± 0.369
3.369IlePhe: 3.369 ± 0.231
3.818IleGly: 3.818 ± 0.297
2.073IleHis: 2.073 ± 0.233
4.509IleIle: 4.509 ± 0.317
5.287IleLys: 5.287 ± 0.413
5.356IleLeu: 5.356 ± 0.269
1.78IleMet: 1.78 ± 0.171
3.974IleAsn: 3.974 ± 0.393
3.162IlePro: 3.162 ± 0.242
2.35IleGln: 2.35 ± 0.17
3.179IleArg: 3.179 ± 0.235
4.008IleSer: 4.008 ± 0.286
4.146IleThr: 4.146 ± 0.282
4.164IleVal: 4.164 ± 0.306
0.553IleTrp: 0.553 ± 0.09
2.471IleTyr: 2.471 ± 0.193
0.0IleXaa: 0.0 ± 0.0
Lys
3.991LysAla: 3.991 ± 0.376
1.555LysCys: 1.555 ± 0.169
3.663LysAsp: 3.663 ± 0.32
5.114LysGlu: 5.114 ± 0.376
3.283LysPhe: 3.283 ± 0.261
3.663LysGly: 3.663 ± 0.258
1.313LysHis: 1.313 ± 0.16
5.2LysIle: 5.2 ± 0.396
8.155LysLys: 8.155 ± 0.681
5.909LysLeu: 5.909 ± 0.417
2.522LysMet: 2.522 ± 0.226
5.166LysAsn: 5.166 ± 0.646
3.213LysPro: 3.213 ± 0.264
2.574LysGln: 2.574 ± 0.323
4.043LysArg: 4.043 ± 0.378
4.63LysSer: 4.63 ± 0.357
4.578LysThr: 4.578 ± 0.312
4.388LysVal: 4.388 ± 0.307
0.674LysTrp: 0.674 ± 0.116
3.645LysTyr: 3.645 ± 0.277
0.0LysXaa: 0.0 ± 0.0
Leu
3.922LeuAla: 3.922 ± 0.309
1.382LeuCys: 1.382 ± 0.195
5.01LeuAsp: 5.01 ± 0.309
4.872LeuGlu: 4.872 ± 0.338
3.386LeuPhe: 3.386 ± 0.286
4.146LeuGly: 4.146 ± 0.266
2.021LeuHis: 2.021 ± 0.272
5.39LeuIle: 5.39 ± 0.313
6.289LeuLys: 6.289 ± 0.399
5.65LeuLeu: 5.65 ± 0.381
2.263LeuMet: 2.263 ± 0.247
4.63LeuAsn: 4.63 ± 0.509
3.3LeuPro: 3.3 ± 0.27
2.522LeuGln: 2.522 ± 0.208
4.44LeuArg: 4.44 ± 0.286
5.546LeuSer: 5.546 ± 0.362
4.613LeuThr: 4.613 ± 0.312
4.958LeuVal: 4.958 ± 0.342
0.864LeuTrp: 0.864 ± 0.132
2.954LeuTyr: 2.954 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
1.451MetAla: 1.451 ± 0.139
0.829MetCys: 0.829 ± 0.126
1.883MetAsp: 1.883 ± 0.191
1.78MetGlu: 1.78 ± 0.217
1.451MetPhe: 1.451 ± 0.173
1.624MetGly: 1.624 ± 0.163
0.57MetHis: 0.57 ± 0.113
2.056MetIle: 2.056 ± 0.212
2.678MetLys: 2.678 ± 0.234
1.9MetLeu: 1.9 ± 0.154
0.985MetMet: 0.985 ± 0.165
1.883MetAsn: 1.883 ± 0.219
1.019MetPro: 1.019 ± 0.126
0.743MetGln: 0.743 ± 0.108
1.296MetArg: 1.296 ± 0.167
2.367MetSer: 2.367 ± 0.196
1.693MetThr: 1.693 ± 0.193
1.469MetVal: 1.469 ± 0.156
0.363MetTrp: 0.363 ± 0.081
1.503MetTyr: 1.503 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
3.334AsnAla: 3.334 ± 0.346
0.795AsnCys: 0.795 ± 0.135
3.023AsnAsp: 3.023 ± 0.229
3.732AsnGlu: 3.732 ± 0.267
2.661AsnPhe: 2.661 ± 0.185
3.715AsnGly: 3.715 ± 0.308
1.244AsnHis: 1.244 ± 0.152
4.647AsnIle: 4.647 ± 0.392
4.095AsnLys: 4.095 ± 0.485
4.855AsnLeu: 4.855 ± 0.517
1.866AsnMet: 1.866 ± 0.192
3.473AsnAsn: 3.473 ± 0.486
2.592AsnPro: 2.592 ± 0.225
1.918AsnGln: 1.918 ± 0.219
2.263AsnArg: 2.263 ± 0.315
3.11AsnSer: 3.11 ± 0.226
3.732AsnThr: 3.732 ± 0.392
4.889AsnVal: 4.889 ± 0.576
0.847AsnTrp: 0.847 ± 0.114
2.039AsnTyr: 2.039 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
1.451ProAla: 1.451 ± 0.189
0.587ProCys: 0.587 ± 0.11
2.782ProAsp: 2.782 ± 0.267
3.144ProGlu: 3.144 ± 0.359
1.693ProPhe: 1.693 ± 0.155
2.384ProGly: 2.384 ± 0.214
0.726ProHis: 0.726 ± 0.128
2.401ProIle: 2.401 ± 0.21
3.404ProLys: 3.404 ± 0.295
2.972ProLeu: 2.972 ± 0.3
1.175ProMet: 1.175 ± 0.168
2.177ProAsn: 2.177 ± 0.24
2.54ProPro: 2.54 ± 0.327
1.52ProGln: 1.52 ± 0.181
1.97ProArg: 1.97 ± 0.206
2.522ProSer: 2.522 ± 0.193
2.712ProThr: 2.712 ± 0.231
2.609ProVal: 2.609 ± 0.21
0.432ProTrp: 0.432 ± 0.091
1.486ProTyr: 1.486 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
1.693GlnAla: 1.693 ± 0.143
0.294GlnCys: 0.294 ± 0.064
1.659GlnAsp: 1.659 ± 0.189
2.125GlnGlu: 2.125 ± 0.21
1.486GlnPhe: 1.486 ± 0.147
1.728GlnGly: 1.728 ± 0.183
0.657GlnHis: 0.657 ± 0.103
2.16GlnIle: 2.16 ± 0.203
2.436GlnLys: 2.436 ± 0.241
2.972GlnLeu: 2.972 ± 0.243
1.088GlnMet: 1.088 ± 0.166
1.814GlnAsn: 1.814 ± 0.178
1.572GlnPro: 1.572 ± 0.19
0.933GlnGln: 0.933 ± 0.114
1.399GlnArg: 1.399 ± 0.225
1.78GlnSer: 1.78 ± 0.198
2.073GlnThr: 2.073 ± 0.217
1.78GlnVal: 1.78 ± 0.169
0.311GlnTrp: 0.311 ± 0.076
1.019GlnTyr: 1.019 ± 0.132
0.0GlnXaa: 0.0 ± 0.0
Arg
2.332ArgAla: 2.332 ± 0.27
0.674ArgCys: 0.674 ± 0.125
2.851ArgAsp: 2.851 ± 0.248
3.334ArgGlu: 3.334 ± 0.298
2.108ArgPhe: 2.108 ± 0.179
2.643ArgGly: 2.643 ± 0.203
1.244ArgHis: 1.244 ± 0.178
3.231ArgIle: 3.231 ± 0.207
4.008ArgLys: 4.008 ± 0.34
3.801ArgLeu: 3.801 ± 0.244
1.417ArgMet: 1.417 ± 0.172
2.557ArgAsn: 2.557 ± 0.321
1.952ArgPro: 1.952 ± 0.174
1.486ArgGln: 1.486 ± 0.156
2.712ArgArg: 2.712 ± 0.262
2.592ArgSer: 2.592 ± 0.199
2.315ArgThr: 2.315 ± 0.192
4.008ArgVal: 4.008 ± 0.248
0.518ArgTrp: 0.518 ± 0.101
1.676ArgTyr: 1.676 ± 0.165
0.0ArgXaa: 0.0 ± 0.0
Ser
3.732SerAla: 3.732 ± 0.296
0.95SerCys: 0.95 ± 0.15
3.628SerAsp: 3.628 ± 0.287
3.645SerGlu: 3.645 ± 0.315
2.712SerPhe: 2.712 ± 0.244
4.509SerGly: 4.509 ± 0.476
1.14SerHis: 1.14 ± 0.151
4.198SerIle: 4.198 ± 0.313
4.146SerLys: 4.146 ± 0.255
4.665SerLeu: 4.665 ± 0.315
1.659SerMet: 1.659 ± 0.191
4.423SerAsn: 4.423 ± 0.533
2.194SerPro: 2.194 ± 0.173
1.9SerGln: 1.9 ± 0.197
2.764SerArg: 2.764 ± 0.216
4.855SerSer: 4.855 ± 0.384
3.922SerThr: 3.922 ± 0.393
3.956SerVal: 3.956 ± 0.265
0.501SerTrp: 0.501 ± 0.095
2.056SerTyr: 2.056 ± 0.179
0.0SerXaa: 0.0 ± 0.0
Thr
3.058ThrAla: 3.058 ± 0.249
0.847ThrCys: 0.847 ± 0.139
3.628ThrAsp: 3.628 ± 0.352
3.179ThrGlu: 3.179 ± 0.241
2.989ThrPhe: 2.989 ± 0.228
4.803ThrGly: 4.803 ± 0.45
1.313ThrHis: 1.313 ± 0.161
4.008ThrIle: 4.008 ± 0.299
4.699ThrLys: 4.699 ± 0.317
4.993ThrLeu: 4.993 ± 0.367
1.572ThrMet: 1.572 ± 0.162
4.077ThrAsn: 4.077 ± 0.409
2.954ThrPro: 2.954 ± 0.177
1.987ThrGln: 1.987 ± 0.217
2.419ThrArg: 2.419 ± 0.184
4.233ThrSer: 4.233 ± 0.424
4.44ThrThr: 4.44 ± 0.388
3.887ThrVal: 3.887 ± 0.367
0.622ThrTrp: 0.622 ± 0.109
1.952ThrTyr: 1.952 ± 0.148
0.0ThrXaa: 0.0 ± 0.0
Val
3.715ValAla: 3.715 ± 0.268
1.227ValCys: 1.227 ± 0.149
3.974ValAsp: 3.974 ± 0.327
4.492ValGlu: 4.492 ± 0.26
2.73ValPhe: 2.73 ± 0.206
4.198ValGly: 4.198 ± 0.599
1.227ValHis: 1.227 ± 0.146
4.164ValIle: 4.164 ± 0.304
5.218ValLys: 5.218 ± 0.32
4.924ValLeu: 4.924 ± 0.323
1.762ValMet: 1.762 ± 0.176
3.697ValAsn: 3.697 ± 0.279
2.799ValPro: 2.799 ± 0.257
2.35ValGln: 2.35 ± 0.257
2.833ValArg: 2.833 ± 0.218
4.388ValSer: 4.388 ± 0.375
4.112ValThr: 4.112 ± 0.349
4.216ValVal: 4.216 ± 0.277
0.708ValTrp: 0.708 ± 0.111
3.162ValTyr: 3.162 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
0.553TrpAla: 0.553 ± 0.086
0.259TrpCys: 0.259 ± 0.069
0.536TrpAsp: 0.536 ± 0.096
0.605TrpGlu: 0.605 ± 0.107
0.726TrpPhe: 0.726 ± 0.128
0.57TrpGly: 0.57 ± 0.103
0.138TrpHis: 0.138 ± 0.047
0.708TrpIle: 0.708 ± 0.106
0.985TrpLys: 0.985 ± 0.137
0.829TrpLeu: 0.829 ± 0.116
0.242TrpMet: 0.242 ± 0.067
0.898TrpAsn: 0.898 ± 0.117
0.311TrpPro: 0.311 ± 0.07
0.259TrpGln: 0.259 ± 0.065
0.311TrpArg: 0.311 ± 0.084
0.639TrpSer: 0.639 ± 0.113
0.639TrpThr: 0.639 ± 0.11
0.605TrpVal: 0.605 ± 0.098
0.155TrpTrp: 0.155 ± 0.049
0.432TrpTyr: 0.432 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.004TyrAla: 2.004 ± 0.198
0.708TyrCys: 0.708 ± 0.113
2.384TyrAsp: 2.384 ± 0.224
2.557TyrGlu: 2.557 ± 0.229
1.987TyrPhe: 1.987 ± 0.193
2.609TyrGly: 2.609 ± 0.258
0.985TyrHis: 0.985 ± 0.124
2.816TyrIle: 2.816 ± 0.226
2.92TyrLys: 2.92 ± 0.281
3.179TyrLeu: 3.179 ± 0.257
1.659TyrMet: 1.659 ± 0.172
2.332TyrAsn: 2.332 ± 0.214
1.313TyrPro: 1.313 ± 0.152
1.019TyrGln: 1.019 ± 0.134
1.797TyrArg: 1.797 ± 0.185
2.419TyrSer: 2.419 ± 0.246
2.522TyrThr: 2.522 ± 0.235
2.574TyrVal: 2.574 ± 0.33
0.259TyrTrp: 0.259 ± 0.075
1.33TyrTyr: 1.33 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 244 proteins (57882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski