Amino acid dipepetide frequency for Hyposoter fugitivus ichnovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.689AlaAla: 5.689 ± 0.842
1.385AlaCys: 1.385 ± 0.25
3.069AlaAsp: 3.069 ± 0.34
3.743AlaGlu: 3.743 ± 0.479
2.882AlaPhe: 2.882 ± 0.448
2.021AlaGly: 2.021 ± 0.255
1.46AlaHis: 1.46 ± 0.288
2.92AlaIle: 2.92 ± 0.388
3.256AlaLys: 3.256 ± 0.397
4.903AlaLeu: 4.903 ± 0.439
1.16AlaMet: 1.16 ± 0.23
1.872AlaAsn: 1.872 ± 0.266
2.658AlaPro: 2.658 ± 0.419
2.283AlaGln: 2.283 ± 0.328
3.818AlaArg: 3.818 ± 0.377
5.278AlaSer: 5.278 ± 0.638
3.93AlaThr: 3.93 ± 0.324
3.818AlaVal: 3.818 ± 0.366
0.674AlaTrp: 0.674 ± 0.171
2.545AlaTyr: 2.545 ± 0.294
0.0AlaXaa: 0.0 ± 0.0
Cys
1.535CysAla: 1.535 ± 0.266
1.085CysCys: 1.085 ± 0.177
1.46CysAsp: 1.46 ± 0.419
1.984CysGlu: 1.984 ± 0.252
1.61CysPhe: 1.61 ± 0.224
1.198CysGly: 1.198 ± 0.215
1.535CysHis: 1.535 ± 0.227
1.797CysIle: 1.797 ± 0.256
1.31CysLys: 1.31 ± 0.219
2.77CysLeu: 2.77 ± 0.303
0.636CysMet: 0.636 ± 0.146
1.16CysAsn: 1.16 ± 0.207
1.46CysPro: 1.46 ± 0.228
1.273CysGln: 1.273 ± 0.245
1.872CysArg: 1.872 ± 0.283
3.893CysSer: 3.893 ± 0.374
2.396CysThr: 2.396 ± 0.301
1.834CysVal: 1.834 ± 0.401
0.823CysTrp: 0.823 ± 0.171
0.861CysTyr: 0.861 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
3.369AspAla: 3.369 ± 0.524
1.872AspCys: 1.872 ± 0.253
2.62AspAsp: 2.62 ± 0.444
4.005AspGlu: 4.005 ± 0.497
2.059AspPhe: 2.059 ± 0.312
3.406AspGly: 3.406 ± 0.481
1.048AspHis: 1.048 ± 0.224
2.433AspIle: 2.433 ± 0.328
1.909AspLys: 1.909 ± 0.278
2.957AspLeu: 2.957 ± 0.367
1.422AspMet: 1.422 ± 0.222
1.647AspAsn: 1.647 ± 0.246
1.984AspPro: 1.984 ± 0.266
1.31AspGln: 1.31 ± 0.277
2.433AspArg: 2.433 ± 0.332
3.219AspSer: 3.219 ± 0.403
2.396AspThr: 2.396 ± 0.308
3.518AspVal: 3.518 ± 0.67
0.636AspTrp: 0.636 ± 0.138
2.246AspTyr: 2.246 ± 0.266
0.0AspXaa: 0.0 ± 0.0
Glu
3.481GluAla: 3.481 ± 0.428
2.134GluCys: 2.134 ± 0.268
3.855GluAsp: 3.855 ± 0.389
5.877GluGlu: 5.877 ± 0.894
2.508GluPhe: 2.508 ± 0.278
2.208GluGly: 2.208 ± 0.353
1.797GluHis: 1.797 ± 0.282
3.144GluIle: 3.144 ± 0.37
3.032GluLys: 3.032 ± 0.368
6.438GluLeu: 6.438 ± 0.656
1.385GluMet: 1.385 ± 0.242
2.695GluAsn: 2.695 ± 0.347
2.396GluPro: 2.396 ± 0.41
3.444GluGln: 3.444 ± 0.471
5.839GluArg: 5.839 ± 0.94
4.155GluSer: 4.155 ± 0.455
2.77GluThr: 2.77 ± 0.343
2.658GluVal: 2.658 ± 0.316
0.561GluTrp: 0.561 ± 0.148
2.77GluTyr: 2.77 ± 0.303
0.0GluXaa: 0.0 ± 0.0
Phe
2.171PheAla: 2.171 ± 0.269
1.759PheCys: 1.759 ± 0.251
2.545PheAsp: 2.545 ± 0.402
1.984PheGlu: 1.984 ± 0.304
3.107PhePhe: 3.107 ± 0.382
2.882PheGly: 2.882 ± 0.31
3.219PheHis: 3.219 ± 0.288
3.331PheIle: 3.331 ± 0.341
1.535PheLys: 1.535 ± 0.219
5.353PheLeu: 5.353 ± 0.524
1.385PheMet: 1.385 ± 0.232
2.134PheAsn: 2.134 ± 0.247
1.872PhePro: 1.872 ± 0.273
1.61PheGln: 1.61 ± 0.253
3.219PheArg: 3.219 ± 0.276
3.182PheSer: 3.182 ± 0.359
2.695PheThr: 2.695 ± 0.269
4.342PheVal: 4.342 ± 0.412
0.823PheTrp: 0.823 ± 0.156
2.059PheTyr: 2.059 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
2.92GlyAla: 2.92 ± 0.422
1.085GlyCys: 1.085 ± 0.224
2.545GlyAsp: 2.545 ± 0.359
3.032GlyGlu: 3.032 ± 0.375
1.834GlyPhe: 1.834 ± 0.3
2.358GlyGly: 2.358 ± 0.258
1.46GlyHis: 1.46 ± 0.252
2.77GlyIle: 2.77 ± 0.329
3.069GlyLys: 3.069 ± 0.355
2.845GlyLeu: 2.845 ± 0.388
1.048GlyMet: 1.048 ± 0.23
2.059GlyAsn: 2.059 ± 0.278
1.909GlyPro: 1.909 ± 0.238
1.46GlyGln: 1.46 ± 0.241
2.433GlyArg: 2.433 ± 0.286
2.957GlySer: 2.957 ± 0.468
2.171GlyThr: 2.171 ± 0.276
2.246GlyVal: 2.246 ± 0.329
0.449GlyTrp: 0.449 ± 0.13
1.123GlyTyr: 1.123 ± 0.206
0.0GlyXaa: 0.0 ± 0.0
His
2.021HisAla: 2.021 ± 0.264
0.936HisCys: 0.936 ± 0.172
1.16HisAsp: 1.16 ± 0.219
1.422HisGlu: 1.422 ± 0.284
2.957HisPhe: 2.957 ± 0.353
1.348HisGly: 1.348 ± 0.233
1.872HisHis: 1.872 ± 0.265
1.535HisIle: 1.535 ± 0.255
0.823HisLys: 0.823 ± 0.181
3.93HisLeu: 3.93 ± 0.48
0.749HisMet: 0.749 ± 0.175
1.61HisAsn: 1.61 ± 0.282
1.422HisPro: 1.422 ± 0.225
0.749HisGln: 0.749 ± 0.185
1.872HisArg: 1.872 ± 0.271
2.545HisSer: 2.545 ± 0.401
1.46HisThr: 1.46 ± 0.216
3.706HisVal: 3.706 ± 0.382
0.674HisTrp: 0.674 ± 0.145
2.47HisTyr: 2.47 ± 0.318
0.0HisXaa: 0.0 ± 0.0
Ile
3.069IleAla: 3.069 ± 0.296
2.171IleCys: 2.171 ± 0.291
2.545IleAsp: 2.545 ± 0.278
2.807IleGlu: 2.807 ± 0.325
3.593IlePhe: 3.593 ± 0.385
2.396IleGly: 2.396 ± 0.303
1.722IleHis: 1.722 ± 0.274
3.069IleIle: 3.069 ± 0.314
1.797IleLys: 1.797 ± 0.221
6.026IleLeu: 6.026 ± 0.446
1.235IleMet: 1.235 ± 0.207
3.219IleAsn: 3.219 ± 0.387
2.508IlePro: 2.508 ± 0.409
2.283IleGln: 2.283 ± 0.295
2.882IleArg: 2.882 ± 0.319
3.481IleSer: 3.481 ± 0.364
3.369IleThr: 3.369 ± 0.395
3.444IleVal: 3.444 ± 0.338
0.374IleTrp: 0.374 ± 0.127
1.834IleTyr: 1.834 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
2.321LysAla: 2.321 ± 0.376
1.422LysCys: 1.422 ± 0.234
1.61LysAsp: 1.61 ± 0.266
3.294LysGlu: 3.294 ± 0.432
2.433LysPhe: 2.433 ± 0.342
1.235LysGly: 1.235 ± 0.235
1.647LysHis: 1.647 ± 0.242
2.321LysIle: 2.321 ± 0.313
2.433LysLys: 2.433 ± 0.354
4.903LysLeu: 4.903 ± 0.47
1.759LysMet: 1.759 ± 0.25
2.807LysAsn: 2.807 ± 0.391
2.807LysPro: 2.807 ± 0.316
1.946LysGln: 1.946 ± 0.297
3.294LysArg: 3.294 ± 0.416
3.069LysSer: 3.069 ± 0.433
2.62LysThr: 2.62 ± 0.358
2.396LysVal: 2.396 ± 0.351
0.599LysTrp: 0.599 ± 0.149
1.759LysTyr: 1.759 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
5.727LeuAla: 5.727 ± 0.579
2.658LeuCys: 2.658 ± 0.346
4.23LeuAsp: 4.23 ± 0.48
4.567LeuGlu: 4.567 ± 0.535
4.829LeuPhe: 4.829 ± 0.459
3.107LeuGly: 3.107 ± 0.426
4.305LeuHis: 4.305 ± 0.451
4.978LeuIle: 4.978 ± 0.442
3.556LeuLys: 3.556 ± 0.381
11.267LeuLeu: 11.267 ± 0.836
2.583LeuMet: 2.583 ± 0.236
5.353LeuAsn: 5.353 ± 0.475
4.791LeuPro: 4.791 ± 0.438
3.743LeuGln: 3.743 ± 0.402
7.074LeuArg: 7.074 ± 0.491
7.711LeuSer: 7.711 ± 0.596
4.679LeuThr: 4.679 ± 0.492
4.903LeuVal: 4.903 ± 0.424
2.845LeuTrp: 2.845 ± 0.324
3.593LeuTyr: 3.593 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
1.497MetAla: 1.497 ± 0.226
0.973MetCys: 0.973 ± 0.172
1.385MetAsp: 1.385 ± 0.241
2.096MetGlu: 2.096 ± 0.271
1.235MetPhe: 1.235 ± 0.195
0.636MetGly: 0.636 ± 0.145
1.198MetHis: 1.198 ± 0.193
1.422MetIle: 1.422 ± 0.208
1.572MetLys: 1.572 ± 0.251
2.62MetLeu: 2.62 ± 0.332
0.861MetMet: 0.861 ± 0.187
1.946MetAsn: 1.946 ± 0.255
1.348MetPro: 1.348 ± 0.22
0.823MetGln: 0.823 ± 0.209
1.048MetArg: 1.048 ± 0.171
2.583MetSer: 2.583 ± 0.27
1.348MetThr: 1.348 ± 0.212
0.973MetVal: 0.973 ± 0.213
0.412MetTrp: 0.412 ± 0.119
0.599MetTyr: 0.599 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
2.433AsnAla: 2.433 ± 0.401
0.861AsnCys: 0.861 ± 0.177
1.872AsnAsp: 1.872 ± 0.223
2.47AsnGlu: 2.47 ± 0.387
2.92AsnPhe: 2.92 ± 0.36
2.994AsnGly: 2.994 ± 0.329
1.31AsnHis: 1.31 ± 0.318
2.321AsnIle: 2.321 ± 0.304
2.134AsnLys: 2.134 ± 0.286
3.481AsnLeu: 3.481 ± 0.339
1.46AsnMet: 1.46 ± 0.261
1.759AsnAsn: 1.759 ± 0.265
2.059AsnPro: 2.059 ± 0.277
0.936AsnGln: 0.936 ± 0.177
3.069AsnArg: 3.069 ± 0.355
3.406AsnSer: 3.406 ± 0.342
2.658AsnThr: 2.658 ± 0.307
3.032AsnVal: 3.032 ± 0.329
0.449AsnTrp: 0.449 ± 0.17
2.47AsnTyr: 2.47 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
2.508ProAla: 2.508 ± 0.293
1.797ProCys: 1.797 ± 0.236
1.759ProAsp: 1.759 ± 0.253
3.107ProGlu: 3.107 ± 0.311
1.422ProPhe: 1.422 ± 0.249
2.021ProGly: 2.021 ± 0.269
1.16ProHis: 1.16 ± 0.191
2.246ProIle: 2.246 ± 0.235
1.797ProLys: 1.797 ± 0.338
4.342ProLeu: 4.342 ± 0.399
1.497ProMet: 1.497 ± 0.197
1.348ProAsn: 1.348 ± 0.233
2.545ProPro: 2.545 ± 0.359
1.647ProGln: 1.647 ± 0.233
2.208ProArg: 2.208 ± 0.31
4.941ProSer: 4.941 ± 0.58
3.144ProThr: 3.144 ± 0.377
3.256ProVal: 3.256 ± 0.395
0.225ProTrp: 0.225 ± 0.083
1.422ProTyr: 1.422 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
2.583GlnAla: 2.583 ± 0.323
1.123GlnCys: 1.123 ± 0.211
1.722GlnAsp: 1.722 ± 0.269
3.182GlnGlu: 3.182 ± 0.523
1.647GlnPhe: 1.647 ± 0.325
1.011GlnGly: 1.011 ± 0.215
1.759GlnHis: 1.759 ± 0.246
2.358GlnIle: 2.358 ± 0.268
1.797GlnLys: 1.797 ± 0.283
3.893GlnLeu: 3.893 ± 0.409
0.749GlnMet: 0.749 ± 0.215
1.759GlnAsn: 1.759 ± 0.298
2.208GlnPro: 2.208 ± 0.282
1.984GlnGln: 1.984 ± 0.337
3.144GlnArg: 3.144 ± 0.648
2.62GlnSer: 2.62 ± 0.331
1.61GlnThr: 1.61 ± 0.283
2.021GlnVal: 2.021 ± 0.285
0.299GlnTrp: 0.299 ± 0.093
1.61GlnTyr: 1.61 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
4.043ArgAla: 4.043 ± 0.499
1.684ArgCys: 1.684 ± 0.282
3.069ArgAsp: 3.069 ± 0.372
5.39ArgGlu: 5.39 ± 0.832
2.396ArgPhe: 2.396 ± 0.27
2.508ArgGly: 2.508 ± 0.324
2.171ArgHis: 2.171 ± 0.284
4.192ArgIle: 4.192 ± 0.438
4.043ArgLys: 4.043 ± 0.572
6.101ArgLeu: 6.101 ± 0.41
2.321ArgMet: 2.321 ± 0.3
3.107ArgAsn: 3.107 ± 0.391
1.909ArgPro: 1.909 ± 0.294
2.994ArgGln: 2.994 ± 0.49
6.812ArgArg: 6.812 ± 1.015
4.903ArgSer: 4.903 ± 0.603
3.219ArgThr: 3.219 ± 0.419
3.93ArgVal: 3.93 ± 0.402
0.936ArgTrp: 0.936 ± 0.232
2.283ArgTyr: 2.283 ± 0.275
0.0ArgXaa: 0.0 ± 0.0
Ser
4.978SerAla: 4.978 ± 0.506
3.481SerCys: 3.481 ± 0.371
2.994SerAsp: 2.994 ± 0.415
5.053SerGlu: 5.053 ± 0.658
2.845SerPhe: 2.845 ± 0.337
3.743SerGly: 3.743 ± 0.438
2.059SerHis: 2.059 ± 0.275
4.417SerIle: 4.417 ± 0.474
3.743SerLys: 3.743 ± 0.566
6.663SerLeu: 6.663 ± 0.51
1.647SerMet: 1.647 ± 0.269
2.62SerAsn: 2.62 ± 0.364
3.444SerPro: 3.444 ± 0.434
4.043SerGln: 4.043 ± 0.578
5.577SerArg: 5.577 ± 0.481
7.524SerSer: 7.524 ± 0.934
6.625SerThr: 6.625 ± 1.212
4.978SerVal: 4.978 ± 0.436
1.273SerTrp: 1.273 ± 0.219
1.684SerTyr: 1.684 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
3.069ThrAla: 3.069 ± 0.398
1.684ThrCys: 1.684 ± 0.259
2.62ThrAsp: 2.62 ± 0.306
2.957ThrGlu: 2.957 ± 0.456
3.182ThrPhe: 3.182 ± 0.353
2.283ThrGly: 2.283 ± 0.305
2.096ThrHis: 2.096 ± 0.281
3.182ThrIle: 3.182 ± 0.367
2.807ThrLys: 2.807 ± 0.337
4.529ThrLeu: 4.529 ± 0.417
1.235ThrMet: 1.235 ± 0.215
1.872ThrAsn: 1.872 ± 0.299
2.171ThrPro: 2.171 ± 0.309
1.984ThrGln: 1.984 ± 0.327
2.807ThrArg: 2.807 ± 0.302
6.663ThrSer: 6.663 ± 1.112
4.679ThrThr: 4.679 ± 0.701
4.604ThrVal: 4.604 ± 0.499
0.524ThrTrp: 0.524 ± 0.151
1.722ThrTyr: 1.722 ± 0.286
0.0ThrXaa: 0.0 ± 0.0
Val
3.406ValAla: 3.406 ± 0.358
1.946ValCys: 1.946 ± 0.417
3.406ValAsp: 3.406 ± 0.701
3.406ValGlu: 3.406 ± 0.306
3.93ValPhe: 3.93 ± 0.411
2.134ValGly: 2.134 ± 0.329
2.433ValHis: 2.433 ± 0.321
3.032ValIle: 3.032 ± 0.388
3.256ValLys: 3.256 ± 0.463
7.187ValLeu: 7.187 ± 0.486
1.348ValMet: 1.348 ± 0.226
2.583ValAsn: 2.583 ± 0.281
2.732ValPro: 2.732 ± 0.342
2.957ValGln: 2.957 ± 0.37
4.192ValArg: 4.192 ± 0.408
4.192ValSer: 4.192 ± 0.45
3.219ValThr: 3.219 ± 0.415
4.454ValVal: 4.454 ± 0.633
0.599ValTrp: 0.599 ± 0.149
2.47ValTyr: 2.47 ± 0.339
0.0ValXaa: 0.0 ± 0.0
Trp
0.374TrpAla: 0.374 ± 0.122
0.262TrpCys: 0.262 ± 0.096
0.749TrpAsp: 0.749 ± 0.172
0.636TrpGlu: 0.636 ± 0.141
1.273TrpPhe: 1.273 ± 0.259
0.412TrpGly: 0.412 ± 0.129
0.225TrpHis: 0.225 ± 0.094
0.636TrpIle: 0.636 ± 0.145
1.123TrpLys: 1.123 ± 0.179
2.882TrpLeu: 2.882 ± 0.289
0.524TrpMet: 0.524 ± 0.137
0.449TrpAsn: 0.449 ± 0.133
0.898TrpPro: 0.898 ± 0.15
0.299TrpGln: 0.299 ± 0.118
1.198TrpArg: 1.198 ± 0.212
0.561TrpSer: 0.561 ± 0.132
0.412TrpThr: 0.412 ± 0.112
0.636TrpVal: 0.636 ± 0.151
0.15TrpTrp: 0.15 ± 0.068
0.299TrpTyr: 0.299 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.171TyrAla: 2.171 ± 0.276
2.171TyrCys: 2.171 ± 0.224
1.273TyrAsp: 1.273 ± 0.187
2.134TyrGlu: 2.134 ± 0.299
2.545TyrPhe: 2.545 ± 0.342
1.872TyrGly: 1.872 ± 0.33
0.786TyrHis: 0.786 ± 0.185
1.722TyrIle: 1.722 ± 0.289
1.684TyrLys: 1.684 ± 0.259
3.406TyrLeu: 3.406 ± 0.362
1.535TyrMet: 1.535 ± 0.277
2.096TyrAsn: 2.096 ± 0.267
1.273TyrPro: 1.273 ± 0.231
1.31TyrGln: 1.31 ± 0.221
3.219TyrArg: 3.219 ± 0.259
2.508TyrSer: 2.508 ± 0.322
1.273TyrThr: 1.273 ± 0.227
2.171TyrVal: 2.171 ± 0.234
0.674TyrTrp: 0.674 ± 0.144
2.358TyrTyr: 2.358 ± 0.679
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 143 proteins (26717 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski