Amino acid dipepetide frequency for Ostreococcus tauri virus RT-2011

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.345AlaAla: 4.345 ± 0.363
1.044AlaCys: 1.044 ± 0.167
2.779AlaAsp: 2.779 ± 0.271
2.762AlaGlu: 2.762 ± 0.264
2.274AlaPhe: 2.274 ± 0.172
3.385AlaGly: 3.385 ± 0.281
1.128AlaHis: 1.128 ± 0.142
3.368AlaIle: 3.368 ± 0.249
4.244AlaLys: 4.244 ± 0.519
4.665AlaLeu: 4.665 ± 0.303
1.566AlaMet: 1.566 ± 0.18
3.048AlaAsn: 3.048 ± 0.567
1.971AlaPro: 1.971 ± 0.235
2.459AlaGln: 2.459 ± 0.318
2.914AlaArg: 2.914 ± 0.295
3.571AlaSer: 3.571 ± 0.257
2.947AlaThr: 2.947 ± 0.29
3.436AlaVal: 3.436 ± 0.249
0.539AlaTrp: 0.539 ± 0.125
2.526AlaTyr: 2.526 ± 0.251
0.0AlaXaa: 0.0 ± 0.0
Cys
0.792CysAla: 0.792 ± 0.117
0.455CysCys: 0.455 ± 0.11
1.28CysAsp: 1.28 ± 0.146
1.179CysGlu: 1.179 ± 0.159
0.707CysPhe: 0.707 ± 0.108
1.128CysGly: 1.128 ± 0.147
0.354CysHis: 0.354 ± 0.079
0.859CysIle: 0.859 ± 0.126
1.128CysLys: 1.128 ± 0.165
0.977CysLeu: 0.977 ± 0.128
0.825CysMet: 0.825 ± 0.13
0.775CysAsn: 0.775 ± 0.148
1.263CysPro: 1.263 ± 0.209
0.286CysGln: 0.286 ± 0.065
1.044CysArg: 1.044 ± 0.151
1.061CysSer: 1.061 ± 0.153
0.859CysThr: 0.859 ± 0.116
0.926CysVal: 0.926 ± 0.145
0.101CysTrp: 0.101 ± 0.04
0.792CysTyr: 0.792 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
3.368AspAla: 3.368 ± 0.282
0.758AspCys: 0.758 ± 0.123
4.396AspAsp: 4.396 ± 0.321
5.086AspGlu: 5.086 ± 0.357
2.829AspPhe: 2.829 ± 0.244
3.655AspGly: 3.655 ± 0.334
1.246AspHis: 1.246 ± 0.168
4.16AspIle: 4.16 ± 0.271
4.042AspLys: 4.042 ± 0.275
5.154AspLeu: 5.154 ± 0.297
1.785AspMet: 1.785 ± 0.184
2.375AspAsn: 2.375 ± 0.179
2.358AspPro: 2.358 ± 0.231
1.314AspGln: 1.314 ± 0.145
2.678AspArg: 2.678 ± 0.191
3.116AspSer: 3.116 ± 0.304
3.874AspThr: 3.874 ± 0.288
4.632AspVal: 4.632 ± 0.274
0.96AspTrp: 0.96 ± 0.138
2.829AspTyr: 2.829 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
3.823GluAla: 3.823 ± 0.349
1.112GluCys: 1.112 ± 0.167
3.958GluAsp: 3.958 ± 0.273
5.827GluGlu: 5.827 ± 0.496
2.712GluPhe: 2.712 ± 0.191
3.116GluGly: 3.116 ± 0.228
1.634GluHis: 1.634 ± 0.205
4.8GluIle: 4.8 ± 0.35
6.046GluLys: 6.046 ± 0.62
5.575GluLeu: 5.575 ± 0.372
1.92GluMet: 1.92 ± 0.197
3.823GluAsn: 3.823 ± 0.254
2.375GluPro: 2.375 ± 0.297
1.954GluGln: 1.954 ± 0.216
3.571GluArg: 3.571 ± 0.382
3.385GluSer: 3.385 ± 0.242
3.655GluThr: 3.655 ± 0.275
4.025GluVal: 4.025 ± 0.263
0.842GluTrp: 0.842 ± 0.12
2.678GluTyr: 2.678 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
2.459PheAla: 2.459 ± 0.251
0.842PheCys: 0.842 ± 0.123
2.627PheAsp: 2.627 ± 0.245
2.678PheGlu: 2.678 ± 0.227
2.274PhePhe: 2.274 ± 0.264
2.493PheGly: 2.493 ± 0.2
1.229PheHis: 1.229 ± 0.176
2.728PheIle: 2.728 ± 0.219
3.267PheLys: 3.267 ± 0.255
3.975PheLeu: 3.975 ± 0.273
1.482PheMet: 1.482 ± 0.164
2.055PheAsn: 2.055 ± 0.18
1.448PhePro: 1.448 ± 0.177
1.078PheGln: 1.078 ± 0.142
2.072PheArg: 2.072 ± 0.207
3.149PheSer: 3.149 ± 0.284
2.257PheThr: 2.257 ± 0.242
3.301PheVal: 3.301 ± 0.259
0.32PheTrp: 0.32 ± 0.081
1.886PheTyr: 1.886 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
3.773GlyAla: 3.773 ± 0.309
0.909GlyCys: 0.909 ± 0.13
3.402GlyAsp: 3.402 ± 0.267
3.402GlyGlu: 3.402 ± 0.193
2.964GlyPhe: 2.964 ± 0.234
4.244GlyGly: 4.244 ± 0.496
1.095GlyHis: 1.095 ± 0.135
3.924GlyIle: 3.924 ± 0.264
4.008GlyLys: 4.008 ± 0.285
5.154GlyLeu: 5.154 ± 0.307
1.735GlyMet: 1.735 ± 0.171
3.402GlyAsn: 3.402 ± 0.501
2.274GlyPro: 2.274 ± 0.255
1.768GlyGln: 1.768 ± 0.164
2.509GlyArg: 2.509 ± 0.251
3.638GlySer: 3.638 ± 0.422
3.638GlyThr: 3.638 ± 0.332
4.497GlyVal: 4.497 ± 0.425
0.657GlyTrp: 0.657 ± 0.107
2.459GlyTyr: 2.459 ± 0.236
0.0GlyXaa: 0.0 ± 0.0
His
0.926HisAla: 0.926 ± 0.124
0.455HisCys: 0.455 ± 0.093
0.994HisAsp: 0.994 ± 0.145
1.364HisGlu: 1.364 ± 0.126
0.859HisPhe: 0.859 ± 0.145
1.415HisGly: 1.415 ± 0.174
0.657HisHis: 0.657 ± 0.094
1.549HisIle: 1.549 ± 0.183
1.331HisLys: 1.331 ± 0.181
1.718HisLeu: 1.718 ± 0.183
0.825HisMet: 0.825 ± 0.12
1.044HisAsn: 1.044 ± 0.131
1.095HisPro: 1.095 ± 0.157
0.758HisGln: 0.758 ± 0.113
1.179HisArg: 1.179 ± 0.147
1.112HisSer: 1.112 ± 0.14
1.533HisThr: 1.533 ± 0.155
1.196HisVal: 1.196 ± 0.171
0.438HisTrp: 0.438 ± 0.092
0.96HisTyr: 0.96 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
3.318IleAla: 3.318 ± 0.251
1.061IleCys: 1.061 ± 0.13
4.581IleAsp: 4.581 ± 0.311
4.733IleGlu: 4.733 ± 0.26
2.223IlePhe: 2.223 ± 0.177
3.419IleGly: 3.419 ± 0.289
1.6IleHis: 1.6 ± 0.179
3.419IleIle: 3.419 ± 0.276
5.322IleLys: 5.322 ± 0.328
5.086IleLeu: 5.086 ± 0.325
1.684IleMet: 1.684 ± 0.155
3.604IleAsn: 3.604 ± 0.283
3.065IlePro: 3.065 ± 0.249
2.408IleGln: 2.408 ± 0.187
2.829IleArg: 2.829 ± 0.19
3.571IleSer: 3.571 ± 0.276
4.076IleThr: 4.076 ± 0.29
4.396IleVal: 4.396 ± 0.325
0.522IleTrp: 0.522 ± 0.089
2.358IleTyr: 2.358 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
3.638LysAla: 3.638 ± 0.517
1.499LysCys: 1.499 ± 0.209
4.312LysAsp: 4.312 ± 0.307
5.002LysGlu: 5.002 ± 0.452
3.554LysPhe: 3.554 ± 0.284
3.385LysGly: 3.385 ± 0.278
1.566LysHis: 1.566 ± 0.193
5.103LysIle: 5.103 ± 0.288
8.505LysLys: 8.505 ± 0.959
6.821LysLeu: 6.821 ± 0.534
2.594LysMet: 2.594 ± 0.232
5.036LysAsn: 5.036 ± 0.431
3.251LysPro: 3.251 ± 0.254
2.678LysGln: 2.678 ± 0.359
4.295LysArg: 4.295 ± 0.591
4.227LysSer: 4.227 ± 0.323
5.137LysThr: 5.137 ± 0.348
4.985LysVal: 4.985 ± 0.312
0.741LysTrp: 0.741 ± 0.116
3.301LysTyr: 3.301 ± 0.258
0.0LysXaa: 0.0 ± 0.0
Leu
4.867LeuAla: 4.867 ± 0.291
1.415LeuCys: 1.415 ± 0.163
5.305LeuAsp: 5.305 ± 0.278
5.221LeuGlu: 5.221 ± 0.343
3.082LeuPhe: 3.082 ± 0.217
4.783LeuGly: 4.783 ± 0.335
1.634LeuHis: 1.634 ± 0.169
5.406LeuIle: 5.406 ± 0.334
6.989LeuLys: 6.989 ± 0.562
6.602LeuLeu: 6.602 ± 0.364
2.189LeuMet: 2.189 ± 0.201
5.12LeuAsn: 5.12 ± 0.799
3.368LeuPro: 3.368 ± 0.282
2.425LeuGln: 2.425 ± 0.275
4.278LeuArg: 4.278 ± 0.305
4.884LeuSer: 4.884 ± 0.295
5.12LeuThr: 5.12 ± 0.383
5.204LeuVal: 5.204 ± 0.285
0.792LeuTrp: 0.792 ± 0.116
3.133LeuTyr: 3.133 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
1.482MetAla: 1.482 ± 0.171
0.657MetCys: 0.657 ± 0.11
1.718MetAsp: 1.718 ± 0.151
1.954MetGlu: 1.954 ± 0.215
1.381MetPhe: 1.381 ± 0.162
1.718MetGly: 1.718 ± 0.174
0.623MetHis: 0.623 ± 0.105
1.853MetIle: 1.853 ± 0.196
3.116MetLys: 3.116 ± 0.269
2.004MetLeu: 2.004 ± 0.186
1.128MetMet: 1.128 ± 0.16
2.088MetAsn: 2.088 ± 0.278
0.926MetPro: 0.926 ± 0.119
0.909MetGln: 0.909 ± 0.114
1.297MetArg: 1.297 ± 0.148
2.476MetSer: 2.476 ± 0.194
1.246MetThr: 1.246 ± 0.14
1.583MetVal: 1.583 ± 0.156
0.522MetTrp: 0.522 ± 0.092
1.465MetTyr: 1.465 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
3.537AsnAla: 3.537 ± 0.413
0.674AsnCys: 0.674 ± 0.11
2.762AsnAsp: 2.762 ± 0.221
3.234AsnGlu: 3.234 ± 0.234
2.947AsnPhe: 2.947 ± 0.284
3.52AsnGly: 3.52 ± 0.236
1.095AsnHis: 1.095 ± 0.128
3.975AsnIle: 3.975 ± 0.285
4.093AsnLys: 4.093 ± 0.631
4.918AsnLeu: 4.918 ± 0.479
1.533AsnMet: 1.533 ± 0.178
4.093AsnAsn: 4.093 ± 0.561
2.408AsnPro: 2.408 ± 0.223
1.735AsnGln: 1.735 ± 0.295
2.661AsnArg: 2.661 ± 0.675
3.251AsnSer: 3.251 ± 0.344
3.874AsnThr: 3.874 ± 0.596
5.171AsnVal: 5.171 ± 0.624
0.589AsnTrp: 0.589 ± 0.109
2.156AsnTyr: 2.156 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
1.634ProAla: 1.634 ± 0.19
0.556ProCys: 0.556 ± 0.135
2.425ProAsp: 2.425 ± 0.275
3.453ProGlu: 3.453 ± 0.293
1.869ProPhe: 1.869 ± 0.195
2.611ProGly: 2.611 ± 0.207
0.691ProHis: 0.691 ± 0.102
2.341ProIle: 2.341 ± 0.209
3.318ProLys: 3.318 ± 0.321
2.728ProLeu: 2.728 ± 0.23
1.297ProMet: 1.297 ± 0.177
2.611ProAsn: 2.611 ± 0.238
2.324ProPro: 2.324 ± 0.308
1.735ProGln: 1.735 ± 0.178
1.92ProArg: 1.92 ± 0.172
2.56ProSer: 2.56 ± 0.228
2.964ProThr: 2.964 ± 0.247
2.88ProVal: 2.88 ± 0.255
0.337ProTrp: 0.337 ± 0.07
1.516ProTyr: 1.516 ± 0.176
0.0ProXaa: 0.0 ± 0.0
Gln
1.802GlnAla: 1.802 ± 0.226
0.505GlnCys: 0.505 ± 0.1
1.836GlnAsp: 1.836 ± 0.171
2.189GlnGlu: 2.189 ± 0.305
1.398GlnPhe: 1.398 ± 0.164
1.549GlnGly: 1.549 ± 0.17
0.606GlnHis: 0.606 ± 0.092
2.206GlnIle: 2.206 ± 0.203
2.644GlnLys: 2.644 ± 0.276
2.998GlnLeu: 2.998 ± 0.282
1.078GlnMet: 1.078 ± 0.143
2.088GlnAsn: 2.088 ± 0.283
1.667GlnPro: 1.667 ± 0.17
1.162GlnGln: 1.162 ± 0.129
1.836GlnArg: 1.836 ± 0.245
1.819GlnSer: 1.819 ± 0.198
1.954GlnThr: 1.954 ± 0.196
1.92GlnVal: 1.92 ± 0.189
0.404GlnTrp: 0.404 ± 0.076
1.095GlnTyr: 1.095 ± 0.135
0.0GlnXaa: 0.0 ± 0.0
Arg
2.661ArgAla: 2.661 ± 0.331
0.741ArgCys: 0.741 ± 0.123
3.486ArgAsp: 3.486 ± 0.265
3.891ArgGlu: 3.891 ± 0.441
1.954ArgPhe: 1.954 ± 0.172
2.627ArgGly: 2.627 ± 0.182
0.977ArgHis: 0.977 ± 0.138
2.897ArgIle: 2.897 ± 0.243
3.621ArgLys: 3.621 ± 0.409
3.907ArgLeu: 3.907 ± 0.286
1.415ArgMet: 1.415 ± 0.191
2.796ArgAsn: 2.796 ± 0.455
1.92ArgPro: 1.92 ± 0.182
1.903ArgGln: 1.903 ± 0.311
2.947ArgArg: 2.947 ± 0.306
2.712ArgSer: 2.712 ± 0.211
2.543ArgThr: 2.543 ± 0.203
3.773ArgVal: 3.773 ± 0.255
0.505ArgTrp: 0.505 ± 0.097
1.819ArgTyr: 1.819 ± 0.201
0.0ArgXaa: 0.0 ± 0.0
Ser
3.015SerAla: 3.015 ± 0.227
0.96SerCys: 0.96 ± 0.138
3.773SerAsp: 3.773 ± 0.305
3.638SerGlu: 3.638 ± 0.314
2.577SerPhe: 2.577 ± 0.211
4.733SerGly: 4.733 ± 0.429
1.246SerHis: 1.246 ± 0.155
3.722SerIle: 3.722 ± 0.276
4.093SerLys: 4.093 ± 0.275
4.918SerLeu: 4.918 ± 0.303
1.684SerMet: 1.684 ± 0.179
3.891SerAsn: 3.891 ± 0.444
2.324SerPro: 2.324 ± 0.256
2.257SerGln: 2.257 ± 0.211
2.88SerArg: 2.88 ± 0.218
3.975SerSer: 3.975 ± 0.324
4.008SerThr: 4.008 ± 0.377
4.076SerVal: 4.076 ± 0.328
0.556SerTrp: 0.556 ± 0.1
1.937SerTyr: 1.937 ± 0.161
0.0SerXaa: 0.0 ± 0.0
Thr
3.099ThrAla: 3.099 ± 0.286
0.926ThrCys: 0.926 ± 0.136
3.453ThrAsp: 3.453 ± 0.435
3.621ThrGlu: 3.621 ± 0.249
2.712ThrPhe: 2.712 ± 0.252
4.177ThrGly: 4.177 ± 0.448
1.516ThrHis: 1.516 ± 0.174
3.385ThrIle: 3.385 ± 0.238
4.48ThrLys: 4.48 ± 0.273
5.356ThrLeu: 5.356 ± 0.285
1.701ThrMet: 1.701 ± 0.174
4.093ThrAsn: 4.093 ± 0.451
2.981ThrPro: 2.981 ± 0.25
2.291ThrGln: 2.291 ± 0.219
2.897ThrArg: 2.897 ± 0.226
3.84ThrSer: 3.84 ± 0.349
4.312ThrThr: 4.312 ± 0.541
3.621ThrVal: 3.621 ± 0.361
0.556ThrTrp: 0.556 ± 0.095
2.324ThrTyr: 2.324 ± 0.208
0.0ThrXaa: 0.0 ± 0.0
Val
3.907ValAla: 3.907 ± 0.292
1.398ValCys: 1.398 ± 0.175
4.278ValAsp: 4.278 ± 0.241
4.261ValGlu: 4.261 ± 0.257
3.133ValPhe: 3.133 ± 0.289
4.093ValGly: 4.093 ± 0.542
1.432ValHis: 1.432 ± 0.177
3.688ValIle: 3.688 ± 0.279
5.44ValLys: 5.44 ± 0.349
5.086ValLeu: 5.086 ± 0.309
1.937ValMet: 1.937 ± 0.165
3.638ValAsn: 3.638 ± 0.419
2.947ValPro: 2.947 ± 0.31
2.358ValGln: 2.358 ± 0.202
3.065ValArg: 3.065 ± 0.243
4.531ValSer: 4.531 ± 0.46
4.396ValThr: 4.396 ± 0.466
4.194ValVal: 4.194 ± 0.476
0.64ValTrp: 0.64 ± 0.102
2.914ValTyr: 2.914 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
0.32TrpAla: 0.32 ± 0.071
0.303TrpCys: 0.303 ± 0.074
0.674TrpAsp: 0.674 ± 0.121
0.589TrpGlu: 0.589 ± 0.108
0.505TrpPhe: 0.505 ± 0.075
0.808TrpGly: 0.808 ± 0.103
0.168TrpHis: 0.168 ± 0.053
0.893TrpIle: 0.893 ± 0.141
0.876TrpLys: 0.876 ± 0.126
0.96TrpLeu: 0.96 ± 0.125
0.202TrpMet: 0.202 ± 0.061
0.573TrpAsn: 0.573 ± 0.094
0.371TrpPro: 0.371 ± 0.092
0.253TrpGln: 0.253 ± 0.072
0.421TrpArg: 0.421 ± 0.096
0.707TrpSer: 0.707 ± 0.138
0.623TrpThr: 0.623 ± 0.102
0.724TrpVal: 0.724 ± 0.133
0.202TrpTrp: 0.202 ± 0.064
0.387TrpTyr: 0.387 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.206TyrAla: 2.206 ± 0.223
0.522TyrCys: 0.522 ± 0.097
2.577TyrAsp: 2.577 ± 0.241
2.745TyrGlu: 2.745 ± 0.244
1.667TyrPhe: 1.667 ± 0.208
2.594TyrGly: 2.594 ± 0.248
0.909TyrHis: 0.909 ± 0.126
2.998TyrIle: 2.998 ± 0.265
3.065TyrLys: 3.065 ± 0.301
3.217TyrLeu: 3.217 ± 0.259
1.549TyrMet: 1.549 ± 0.177
2.072TyrAsn: 2.072 ± 0.19
1.432TyrPro: 1.432 ± 0.176
1.027TyrGln: 1.027 ± 0.149
1.802TyrArg: 1.802 ± 0.214
2.678TyrSer: 2.678 ± 0.229
2.341TyrThr: 2.341 ± 0.233
2.796TyrVal: 2.796 ± 0.315
0.337TyrTrp: 0.337 ± 0.066
1.566TyrTyr: 1.566 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 249 proteins (59376 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski