Amino acid dipepetide frequency for Lymphocystis disease virus Sa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.61AlaAla: 2.61 ± 0.225
1.224AlaCys: 1.224 ± 0.185
2.887AlaAsp: 2.887 ± 0.292
2.979AlaGlu: 2.979 ± 0.308
2.91AlaPhe: 2.91 ± 0.31
2.31AlaGly: 2.31 ± 0.242
0.716AlaHis: 0.716 ± 0.114
2.402AlaIle: 2.402 ± 0.244
2.379AlaLys: 2.379 ± 0.242
5.243AlaLeu: 5.243 ± 0.369
0.947AlaMet: 0.947 ± 0.149
1.062AlaAsn: 1.062 ± 0.157
1.316AlaPro: 1.316 ± 0.224
1.293AlaGln: 1.293 ± 0.183
1.848AlaArg: 1.848 ± 0.207
2.61AlaSer: 2.61 ± 0.206
1.432AlaThr: 1.432 ± 0.213
5.497AlaVal: 5.497 ± 0.432
0.323AlaTrp: 0.323 ± 0.098
2.148AlaTyr: 2.148 ± 0.219
0.0AlaXaa: 0.0 ± 0.0
Cys
0.647CysAla: 0.647 ± 0.123
0.554CysCys: 0.554 ± 0.135
1.34CysAsp: 1.34 ± 0.177
1.247CysGlu: 1.247 ± 0.167
1.178CysPhe: 1.178 ± 0.179
1.062CysGly: 1.062 ± 0.197
0.393CysHis: 0.393 ± 0.091
1.201CysIle: 1.201 ± 0.212
3.233CysLys: 3.233 ± 0.331
2.91CysLeu: 2.91 ± 0.302
0.323CysMet: 0.323 ± 0.088
1.062CysAsn: 1.062 ± 0.182
0.924CysPro: 0.924 ± 0.185
0.831CysGln: 0.831 ± 0.155
0.924CysArg: 0.924 ± 0.155
1.57CysSer: 1.57 ± 0.251
0.993CysThr: 0.993 ± 0.245
2.009CysVal: 2.009 ± 0.265
0.346CysTrp: 0.346 ± 0.093
1.155CysTyr: 1.155 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
3.049AspAla: 3.049 ± 0.266
1.224AspCys: 1.224 ± 0.18
3.534AspAsp: 3.534 ± 0.29
3.372AspGlu: 3.372 ± 0.359
3.141AspPhe: 3.141 ± 0.303
2.356AspGly: 2.356 ± 0.215
0.831AspHis: 0.831 ± 0.113
2.471AspIle: 2.471 ± 0.205
4.711AspLys: 4.711 ± 0.399
7.206AspLeu: 7.206 ± 0.417
0.531AspMet: 0.531 ± 0.102
1.594AspAsn: 1.594 ± 0.224
2.517AspPro: 2.517 ± 0.264
2.055AspGln: 2.055 ± 0.189
3.88AspArg: 3.88 ± 0.3
3.141AspSer: 3.141 ± 0.286
1.363AspThr: 1.363 ± 0.19
6.605AspVal: 6.605 ± 0.438
0.693AspTrp: 0.693 ± 0.151
3.164AspTyr: 3.164 ± 0.26
0.0AspXaa: 0.0 ± 0.0
Glu
3.002GluAla: 3.002 ± 0.355
1.409GluCys: 1.409 ± 0.223
3.903GluAsp: 3.903 ± 0.321
4.665GluGlu: 4.665 ± 0.445
2.656GluPhe: 2.656 ± 0.295
2.54GluGly: 2.54 ± 0.252
0.531GluHis: 0.531 ± 0.091
5.104GluIle: 5.104 ± 0.396
5.335GluLys: 5.335 ± 0.409
6.628GluLeu: 6.628 ± 0.506
1.039GluMet: 1.039 ± 0.143
3.372GluAsn: 3.372 ± 0.283
2.402GluPro: 2.402 ± 0.256
1.178GluGln: 1.178 ± 0.175
2.91GluArg: 2.91 ± 0.293
3.88GluSer: 3.88 ± 0.276
4.735GluThr: 4.735 ± 0.307
2.171GluVal: 2.171 ± 0.285
0.716GluTrp: 0.716 ± 0.143
2.933GluTyr: 2.933 ± 0.295
0.0GluXaa: 0.0 ± 0.0
Phe
1.917PheAla: 1.917 ± 0.219
1.132PheCys: 1.132 ± 0.156
3.233PheAsp: 3.233 ± 0.249
3.072PheGlu: 3.072 ± 0.186
2.679PhePhe: 2.679 ± 0.27
2.009PheGly: 2.009 ± 0.319
0.993PheHis: 0.993 ± 0.153
4.85PheIle: 4.85 ± 0.342
7.414PheLys: 7.414 ± 0.464
5.358PheLeu: 5.358 ± 0.365
1.409PheMet: 1.409 ± 0.181
4.088PheAsn: 4.088 ± 0.333
1.963PhePro: 1.963 ± 0.193
1.224PheGln: 1.224 ± 0.148
1.247PheArg: 1.247 ± 0.14
3.464PheSer: 3.464 ± 0.289
2.61PheThr: 2.61 ± 0.27
3.418PheVal: 3.418 ± 0.273
0.462PheTrp: 0.462 ± 0.096
2.864PheTyr: 2.864 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
1.524GlyAla: 1.524 ± 0.184
1.062GlyCys: 1.062 ± 0.155
2.517GlyAsp: 2.517 ± 0.295
2.471GlyGlu: 2.471 ± 0.343
2.425GlyPhe: 2.425 ± 0.231
1.848GlyGly: 1.848 ± 0.253
0.577GlyHis: 0.577 ± 0.151
2.795GlyIle: 2.795 ± 0.269
3.626GlyLys: 3.626 ± 0.364
4.573GlyLeu: 4.573 ± 0.409
0.508GlyMet: 0.508 ± 0.126
1.524GlyAsn: 1.524 ± 0.229
1.57GlyPro: 1.57 ± 0.522
1.34GlyGln: 1.34 ± 0.204
2.055GlyArg: 2.055 ± 0.269
3.28GlySer: 3.28 ± 0.367
2.24GlyThr: 2.24 ± 0.241
2.91GlyVal: 2.91 ± 0.282
0.508GlyTrp: 0.508 ± 0.115
2.517GlyTyr: 2.517 ± 0.217
0.0GlyXaa: 0.0 ± 0.0
His
0.485HisAla: 0.485 ± 0.111
0.416HisCys: 0.416 ± 0.09
0.554HisAsp: 0.554 ± 0.121
0.554HisGlu: 0.554 ± 0.134
0.924HisPhe: 0.924 ± 0.144
0.716HisGly: 0.716 ± 0.131
0.508HisHis: 0.508 ± 0.121
1.224HisIle: 1.224 ± 0.17
1.755HisLys: 1.755 ± 0.205
1.778HisLeu: 1.778 ± 0.192
0.162HisMet: 0.162 ± 0.054
0.67HisAsn: 0.67 ± 0.108
0.97HisPro: 0.97 ± 0.176
0.508HisGln: 0.508 ± 0.113
1.062HisArg: 1.062 ± 0.16
0.808HisSer: 0.808 ± 0.173
0.393HisThr: 0.393 ± 0.1
1.178HisVal: 1.178 ± 0.165
0.139HisTrp: 0.139 ± 0.065
0.831HisTyr: 0.831 ± 0.12
0.0HisXaa: 0.0 ± 0.0
Ile
3.626IleAla: 3.626 ± 0.264
1.894IleCys: 1.894 ± 0.23
4.411IleAsp: 4.411 ± 0.307
4.781IleGlu: 4.781 ± 0.38
4.296IlePhe: 4.296 ± 0.344
2.633IleGly: 2.633 ± 0.251
1.132IleHis: 1.132 ± 0.146
4.504IleIle: 4.504 ± 0.394
7.991IleLys: 7.991 ± 0.44
8.106IleLeu: 8.106 ± 0.474
1.247IleMet: 1.247 ± 0.142
3.972IleAsn: 3.972 ± 0.277
2.656IlePro: 2.656 ± 0.283
1.801IleGln: 1.801 ± 0.19
3.025IleArg: 3.025 ± 0.278
4.504IleSer: 4.504 ± 0.292
2.956IleThr: 2.956 ± 0.244
5.358IleVal: 5.358 ± 0.365
0.624IleTrp: 0.624 ± 0.112
3.464IleTyr: 3.464 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
3.788LysAla: 3.788 ± 0.321
2.194LysCys: 2.194 ± 0.284
4.573LysAsp: 4.573 ± 0.415
5.82LysGlu: 5.82 ± 0.5
4.735LysPhe: 4.735 ± 0.294
3.765LysGly: 3.765 ± 0.615
1.64LysHis: 1.64 ± 0.252
9.954LysIle: 9.954 ± 0.549
8.176LysLys: 8.176 ± 0.492
8.822LysLeu: 8.822 ± 0.413
1.594LysMet: 1.594 ± 0.224
6.651LysAsn: 6.651 ± 0.384
4.504LysPro: 4.504 ± 0.347
2.356LysGln: 2.356 ± 0.275
3.28LysArg: 3.28 ± 0.318
7.229LysSer: 7.229 ± 0.548
7.945LysThr: 7.945 ± 0.415
3.025LysVal: 3.025 ± 0.269
0.924LysTrp: 0.924 ± 0.17
4.642LysTyr: 4.642 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
3.857LeuAla: 3.857 ± 0.263
2.125LeuCys: 2.125 ± 0.224
4.365LeuAsp: 4.365 ± 0.281
5.982LeuGlu: 5.982 ± 0.413
4.919LeuPhe: 4.919 ± 0.429
3.418LeuGly: 3.418 ± 0.285
1.686LeuHis: 1.686 ± 0.228
9.746LeuIle: 9.746 ± 0.525
13.141LeuLys: 13.141 ± 0.72
8.638LeuLeu: 8.638 ± 0.523
2.171LeuMet: 2.171 ± 0.241
8.106LeuAsn: 8.106 ± 0.442
4.111LeuPro: 4.111 ± 0.311
2.979LeuGln: 2.979 ± 0.287
4.25LeuArg: 4.25 ± 0.305
6.213LeuSer: 6.213 ± 0.356
9.007LeuThr: 9.007 ± 0.462
3.626LeuVal: 3.626 ± 0.349
0.947LeuTrp: 0.947 ± 0.121
3.926LeuTyr: 3.926 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
0.693MetAla: 0.693 ± 0.103
0.577MetCys: 0.577 ± 0.117
0.878MetAsp: 0.878 ± 0.132
1.27MetGlu: 1.27 ± 0.185
1.062MetPhe: 1.062 ± 0.168
0.855MetGly: 0.855 ± 0.137
0.346MetHis: 0.346 ± 0.079
1.386MetIle: 1.386 ± 0.186
1.455MetLys: 1.455 ± 0.215
1.34MetLeu: 1.34 ± 0.171
0.416MetMet: 0.416 ± 0.095
1.709MetAsn: 1.709 ± 0.181
0.531MetPro: 0.531 ± 0.115
0.416MetGln: 0.416 ± 0.098
0.762MetArg: 0.762 ± 0.119
1.34MetSer: 1.34 ± 0.181
1.34MetThr: 1.34 ± 0.149
0.462MetVal: 0.462 ± 0.083
0.139MetTrp: 0.139 ± 0.059
0.808MetTyr: 0.808 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
2.356AsnAla: 2.356 ± 0.195
1.501AsnCys: 1.501 ± 0.257
2.841AsnAsp: 2.841 ± 0.238
2.402AsnGlu: 2.402 ± 0.238
3.672AsnPhe: 3.672 ± 0.349
1.894AsnGly: 1.894 ± 0.223
1.039AsnHis: 1.039 ± 0.167
3.072AsnIle: 3.072 ± 0.27
5.081AsnLys: 5.081 ± 0.38
6.444AsnLeu: 6.444 ± 0.389
1.247AsnMet: 1.247 ± 0.167
2.494AsnAsn: 2.494 ± 0.242
2.633AsnPro: 2.633 ± 0.28
1.524AsnGln: 1.524 ± 0.2
2.979AsnArg: 2.979 ± 0.268
3.256AsnSer: 3.256 ± 0.265
1.894AsnThr: 1.894 ± 0.228
5.012AsnVal: 5.012 ± 0.359
0.439AsnTrp: 0.439 ± 0.101
3.002AsnTyr: 3.002 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
1.27ProAla: 1.27 ± 0.184
1.27ProCys: 1.27 ± 0.216
2.286ProAsp: 2.286 ± 0.23
2.633ProGlu: 2.633 ± 0.267
2.425ProPhe: 2.425 ± 0.206
2.286ProGly: 2.286 ± 0.771
0.647ProHis: 0.647 ± 0.115
2.564ProIle: 2.564 ± 0.256
3.095ProLys: 3.095 ± 0.304
3.903ProLeu: 3.903 ± 0.353
0.762ProMet: 0.762 ± 0.132
2.079ProAsn: 2.079 ± 0.308
1.94ProPro: 1.94 ± 0.345
1.201ProGln: 1.201 ± 0.176
1.501ProArg: 1.501 ± 0.182
2.402ProSer: 2.402 ± 0.216
1.409ProThr: 1.409 ± 0.187
3.21ProVal: 3.21 ± 0.278
0.254ProTrp: 0.254 ± 0.081
1.986ProTyr: 1.986 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
1.455GlnAla: 1.455 ± 0.192
0.346GlnCys: 0.346 ± 0.097
1.594GlnAsp: 1.594 ± 0.227
1.64GlnGlu: 1.64 ± 0.225
1.594GlnPhe: 1.594 ± 0.194
1.178GlnGly: 1.178 ± 0.22
0.346GlnHis: 0.346 ± 0.088
2.032GlnIle: 2.032 ± 0.249
2.702GlnLys: 2.702 ± 0.195
2.448GlnLeu: 2.448 ± 0.221
0.323GlnMet: 0.323 ± 0.088
1.871GlnAsn: 1.871 ± 0.18
1.178GlnPro: 1.178 ± 0.203
0.785GlnGln: 0.785 ± 0.161
1.34GlnArg: 1.34 ± 0.194
1.686GlnSer: 1.686 ± 0.189
2.194GlnThr: 2.194 ± 0.213
1.178GlnVal: 1.178 ± 0.168
0.254GlnTrp: 0.254 ± 0.067
1.363GlnTyr: 1.363 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
1.617ArgAla: 1.617 ± 0.197
1.224ArgCys: 1.224 ± 0.153
1.547ArgAsp: 1.547 ± 0.228
1.871ArgGlu: 1.871 ± 0.227
2.933ArgPhe: 2.933 ± 0.295
1.825ArgGly: 1.825 ± 0.301
0.508ArgHis: 0.508 ± 0.121
4.042ArgIle: 4.042 ± 0.307
2.887ArgLys: 2.887 ± 0.25
5.012ArgLeu: 5.012 ± 0.318
0.739ArgMet: 0.739 ± 0.129
2.148ArgAsn: 2.148 ± 0.216
1.709ArgPro: 1.709 ± 0.189
1.062ArgGln: 1.062 ± 0.18
2.633ArgArg: 2.633 ± 0.249
3.557ArgSer: 3.557 ± 0.295
2.148ArgThr: 2.148 ± 0.188
2.148ArgVal: 2.148 ± 0.24
0.647ArgTrp: 0.647 ± 0.118
2.956ArgTyr: 2.956 ± 0.227
0.0ArgXaa: 0.0 ± 0.0
Ser
2.771SerAla: 2.771 ± 0.245
1.201SerCys: 1.201 ± 0.182
4.965SerAsp: 4.965 ± 0.402
3.972SerGlu: 3.972 ± 0.245
3.834SerPhe: 3.834 ± 0.314
2.702SerGly: 2.702 ± 0.273
0.924SerHis: 0.924 ± 0.156
4.088SerIle: 4.088 ± 0.309
5.728SerLys: 5.728 ± 0.399
5.935SerLeu: 5.935 ± 0.338
1.247SerMet: 1.247 ± 0.15
2.91SerAsn: 2.91 ± 0.272
2.125SerPro: 2.125 ± 0.359
1.755SerGln: 1.755 ± 0.213
2.517SerArg: 2.517 ± 0.203
3.534SerSer: 3.534 ± 0.29
3.303SerThr: 3.303 ± 0.302
5.589SerVal: 5.589 ± 0.362
0.647SerTrp: 0.647 ± 0.133
3.395SerTyr: 3.395 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
3.51ThrAla: 3.51 ± 0.306
1.155ThrCys: 1.155 ± 0.172
3.834ThrAsp: 3.834 ± 0.303
4.157ThrGlu: 4.157 ± 0.338
3.072ThrPhe: 3.072 ± 0.28
3.049ThrGly: 3.049 ± 0.292
0.993ThrHis: 0.993 ± 0.129
3.418ThrIle: 3.418 ± 0.296
3.233ThrLys: 3.233 ± 0.272
5.566ThrLeu: 5.566 ± 0.364
0.924ThrMet: 0.924 ± 0.124
2.356ThrAsn: 2.356 ± 0.23
2.102ThrPro: 2.102 ± 0.251
1.894ThrGln: 1.894 ± 0.204
1.917ThrArg: 1.917 ± 0.201
3.256ThrSer: 3.256 ± 0.25
3.51ThrThr: 3.51 ± 0.514
5.474ThrVal: 5.474 ± 0.361
0.393ThrTrp: 0.393 ± 0.1
2.055ThrTyr: 2.055 ± 0.222
0.0ThrXaa: 0.0 ± 0.0
Val
3.049ValAla: 3.049 ± 0.272
1.917ValCys: 1.917 ± 0.186
4.55ValAsp: 4.55 ± 0.277
5.127ValGlu: 5.127 ± 0.379
3.372ValPhe: 3.372 ± 0.244
2.61ValGly: 2.61 ± 0.267
0.924ValHis: 0.924 ± 0.136
4.758ValIle: 4.758 ± 0.331
7.668ValLys: 7.668 ± 0.418
7.229ValLeu: 7.229 ± 0.376
1.085ValMet: 1.085 ± 0.166
3.811ValAsn: 3.811 ± 0.283
2.24ValPro: 2.24 ± 0.289
1.594ValGln: 1.594 ± 0.231
2.887ValArg: 2.887 ± 0.218
3.834ValSer: 3.834 ± 0.32
3.303ValThr: 3.303 ± 0.363
4.25ValVal: 4.25 ± 0.37
0.462ValTrp: 0.462 ± 0.108
3.303ValTyr: 3.303 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.346TrpAla: 0.346 ± 0.091
0.208TrpCys: 0.208 ± 0.065
0.439TrpAsp: 0.439 ± 0.109
0.462TrpGlu: 0.462 ± 0.105
0.67TrpPhe: 0.67 ± 0.115
0.6TrpGly: 0.6 ± 0.124
0.115TrpHis: 0.115 ± 0.045
0.716TrpIle: 0.716 ± 0.112
0.808TrpLys: 0.808 ± 0.154
0.831TrpLeu: 0.831 ± 0.147
0.277TrpMet: 0.277 ± 0.077
0.554TrpAsn: 0.554 ± 0.091
0.323TrpPro: 0.323 ± 0.09
0.439TrpGln: 0.439 ± 0.102
0.231TrpArg: 0.231 ± 0.077
0.647TrpSer: 0.647 ± 0.111
0.831TrpThr: 0.831 ± 0.134
0.439TrpVal: 0.439 ± 0.103
0.139TrpTrp: 0.139 ± 0.056
0.439TrpTyr: 0.439 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.564TyrAla: 2.564 ± 0.246
1.27TyrCys: 1.27 ± 0.183
3.187TyrAsp: 3.187 ± 0.264
2.633TyrGlu: 2.633 ± 0.243
2.933TyrPhe: 2.933 ± 0.263
2.217TyrGly: 2.217 ± 0.23
0.785TyrHis: 0.785 ± 0.155
2.656TyrIle: 2.656 ± 0.279
5.196TyrLys: 5.196 ± 0.34
5.243TyrLeu: 5.243 ± 0.417
0.808TyrMet: 0.808 ± 0.146
2.748TyrAsn: 2.748 ± 0.267
1.409TyrPro: 1.409 ± 0.162
1.27TyrGln: 1.27 ± 0.151
1.963TyrArg: 1.963 ± 0.208
2.933TyrSer: 2.933 ± 0.257
2.263TyrThr: 2.263 ± 0.233
4.411TyrVal: 4.411 ± 0.329
0.462TyrTrp: 0.462 ± 0.103
2.841TyrTyr: 2.841 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 183 proteins (43300 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski