Amino acid dipepetide frequency for Neodiprion sertifer nucleopolyhedrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.731AlaAla: 1.731 ± 0.309
0.907AlaCys: 0.907 ± 0.204
1.937AlaAsp: 1.937 ± 0.304
1.731AlaGlu: 1.731 ± 0.194
1.443AlaPhe: 1.443 ± 0.257
1.237AlaGly: 1.237 ± 0.208
0.989AlaHis: 0.989 ± 0.226
2.885AlaIle: 2.885 ± 0.354
2.597AlaLys: 2.597 ± 0.349
3.174AlaLeu: 3.174 ± 0.43
1.03AlaMet: 1.03 ± 0.191
3.132AlaAsn: 3.132 ± 0.326
0.701AlaPro: 0.701 ± 0.205
1.237AlaGln: 1.237 ± 0.244
1.278AlaArg: 1.278 ± 0.23
2.308AlaSer: 2.308 ± 0.295
2.061AlaThr: 2.061 ± 0.338
2.061AlaVal: 2.061 ± 0.353
0.371AlaTrp: 0.371 ± 0.11
1.649AlaTyr: 1.649 ± 0.271
0.0AlaXaa: 0.0 ± 0.0
Cys
0.989CysAla: 0.989 ± 0.188
0.412CysCys: 0.412 ± 0.179
2.143CysAsp: 2.143 ± 0.344
1.772CysGlu: 1.772 ± 0.34
1.195CysPhe: 1.195 ± 0.213
1.69CysGly: 1.69 ± 0.289
0.577CysHis: 0.577 ± 0.168
2.184CysIle: 2.184 ± 0.281
1.978CysLys: 1.978 ± 0.314
1.896CysLeu: 1.896 ± 0.247
0.577CysMet: 0.577 ± 0.159
2.102CysAsn: 2.102 ± 0.235
1.03CysPro: 1.03 ± 0.242
1.072CysGln: 1.072 ± 0.176
0.824CysArg: 0.824 ± 0.213
2.02CysSer: 2.02 ± 0.314
1.525CysThr: 1.525 ± 0.319
2.226CysVal: 2.226 ± 0.309
0.041CysTrp: 0.041 ± 0.048
1.484CysTyr: 1.484 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
2.555AspAla: 2.555 ± 0.386
1.607AspCys: 1.607 ± 0.291
5.07AspAsp: 5.07 ± 0.408
4.204AspGlu: 4.204 ± 0.419
3.339AspPhe: 3.339 ± 0.375
2.143AspGly: 2.143 ± 0.349
1.113AspHis: 1.113 ± 0.197
7.048AspIle: 7.048 ± 0.421
4.822AspLys: 4.822 ± 0.425
4.245AspLeu: 4.245 ± 0.373
1.731AspMet: 1.731 ± 0.241
6.636AspAsn: 6.636 ± 0.444
1.484AspPro: 1.484 ± 0.236
1.607AspGln: 1.607 ± 0.26
1.937AspArg: 1.937 ± 0.34
4.41AspSer: 4.41 ± 0.476
4.287AspThr: 4.287 ± 0.528
5.317AspVal: 5.317 ± 0.466
0.577AspTrp: 0.577 ± 0.12
2.555AspTyr: 2.555 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
1.607GluAla: 1.607 ± 0.245
1.36GluCys: 1.36 ± 0.254
3.05GluAsp: 3.05 ± 0.403
1.855GluGlu: 1.855 ± 0.278
2.72GluPhe: 2.72 ± 0.294
0.948GluGly: 0.948 ± 0.204
1.731GluHis: 1.731 ± 0.246
5.935GluIle: 5.935 ± 0.693
3.916GluLys: 3.916 ± 0.45
4.287GluLeu: 4.287 ± 0.417
1.484GluMet: 1.484 ± 0.231
5.976GluAsn: 5.976 ± 0.602
0.989GluPro: 0.989 ± 0.211
1.69GluGln: 1.69 ± 0.288
1.36GluArg: 1.36 ± 0.246
4.08GluSer: 4.08 ± 0.545
4.039GluThr: 4.039 ± 0.364
1.525GluVal: 1.525 ± 0.239
0.412GluTrp: 0.412 ± 0.123
2.885GluTyr: 2.885 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
2.061PheAla: 2.061 ± 0.399
1.772PheCys: 1.772 ± 0.307
4.699PheAsp: 4.699 ± 0.526
2.514PheGlu: 2.514 ± 0.316
2.143PhePhe: 2.143 ± 0.243
1.443PheGly: 1.443 ± 0.282
1.154PheHis: 1.154 ± 0.189
4.74PheIle: 4.74 ± 0.387
3.256PheLys: 3.256 ± 0.338
4.74PheLeu: 4.74 ± 0.385
0.948PheMet: 0.948 ± 0.196
3.38PheAsn: 3.38 ± 0.384
1.03PhePro: 1.03 ± 0.222
1.237PheGln: 1.237 ± 0.206
1.772PheArg: 1.772 ± 0.278
3.503PheSer: 3.503 ± 0.515
2.968PheThr: 2.968 ± 0.283
3.751PheVal: 3.751 ± 0.421
0.371PheTrp: 0.371 ± 0.122
2.102PheTyr: 2.102 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
1.154GlyAla: 1.154 ± 0.214
0.701GlyCys: 0.701 ± 0.207
2.061GlyAsp: 2.061 ± 0.319
1.113GlyGlu: 1.113 ± 0.19
1.896GlyPhe: 1.896 ± 0.286
1.278GlyGly: 1.278 ± 0.357
0.618GlyHis: 0.618 ± 0.135
2.184GlyIle: 2.184 ± 0.298
1.937GlyLys: 1.937 ± 0.302
3.132GlyLeu: 3.132 ± 0.5
0.412GlyMet: 0.412 ± 0.141
1.855GlyAsn: 1.855 ± 0.26
0.577GlyPro: 0.577 ± 0.152
1.113GlyGln: 1.113 ± 0.222
1.195GlyArg: 1.195 ± 0.245
1.978GlySer: 1.978 ± 0.265
1.401GlyThr: 1.401 ± 0.263
2.473GlyVal: 2.473 ± 0.315
0.289GlyTrp: 0.289 ± 0.129
1.278GlyTyr: 1.278 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
0.948HisAla: 0.948 ± 0.219
0.742HisCys: 0.742 ± 0.178
1.855HisAsp: 1.855 ± 0.276
1.319HisGlu: 1.319 ± 0.287
1.072HisPhe: 1.072 ± 0.179
0.866HisGly: 0.866 ± 0.178
0.206HisHis: 0.206 ± 0.087
2.597HisIle: 2.597 ± 0.367
1.649HisLys: 1.649 ± 0.271
2.308HisLeu: 2.308 ± 0.387
0.783HisMet: 0.783 ± 0.204
2.061HisAsn: 2.061 ± 0.251
0.618HisPro: 0.618 ± 0.144
0.824HisGln: 0.824 ± 0.181
1.03HisArg: 1.03 ± 0.254
1.36HisSer: 1.36 ± 0.219
1.237HisThr: 1.237 ± 0.236
1.937HisVal: 1.937 ± 0.387
0.165HisTrp: 0.165 ± 0.081
1.484HisTyr: 1.484 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
3.132IleAla: 3.132 ± 0.441
3.05IleCys: 3.05 ± 0.342
7.914IleAsp: 7.914 ± 0.448
5.647IleGlu: 5.647 ± 0.453
4.74IlePhe: 4.74 ± 0.337
2.391IleGly: 2.391 ± 0.304
2.061IleHis: 2.061 ± 0.385
7.13IleIle: 7.13 ± 0.674
5.894IleLys: 5.894 ± 0.566
8.12IleLeu: 8.12 ± 0.468
2.184IleMet: 2.184 ± 0.312
7.007IleAsn: 7.007 ± 0.636
3.256IlePro: 3.256 ± 0.42
3.091IleGln: 3.091 ± 0.399
3.462IleArg: 3.462 ± 0.339
6.677IleSer: 6.677 ± 0.494
6.471IleThr: 6.471 ± 0.512
5.193IleVal: 5.193 ± 0.485
0.412IleTrp: 0.412 ± 0.136
4.616IleTyr: 4.616 ± 0.493
0.0IleXaa: 0.0 ± 0.0
Lys
1.978LysAla: 1.978 ± 0.278
1.978LysCys: 1.978 ± 0.315
2.597LysAsp: 2.597 ± 0.363
2.762LysGlu: 2.762 ± 0.472
3.627LysPhe: 3.627 ± 0.497
1.278LysGly: 1.278 ± 0.244
2.267LysHis: 2.267 ± 0.403
8.202LysIle: 8.202 ± 0.679
6.471LysLys: 6.471 ± 0.619
6.924LysLeu: 6.924 ± 0.608
2.143LysMet: 2.143 ± 0.255
6.347LysAsn: 6.347 ± 0.583
2.391LysPro: 2.391 ± 0.387
2.844LysGln: 2.844 ± 0.364
3.668LysArg: 3.668 ± 0.414
4.493LysSer: 4.493 ± 0.445
4.822LysThr: 4.822 ± 0.473
2.514LysVal: 2.514 ± 0.365
0.783LysTrp: 0.783 ± 0.169
4.287LysTyr: 4.287 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
3.132LeuAla: 3.132 ± 0.323
2.762LeuCys: 2.762 ± 0.333
4.039LeuAsp: 4.039 ± 0.489
3.462LeuGlu: 3.462 ± 0.424
4.204LeuPhe: 4.204 ± 0.438
2.184LeuGly: 2.184 ± 0.314
2.885LeuHis: 2.885 ± 0.383
7.708LeuIle: 7.708 ± 0.687
6.924LeuLys: 6.924 ± 0.623
8.202LeuLeu: 8.202 ± 0.738
2.184LeuMet: 2.184 ± 0.257
6.265LeuAsn: 6.265 ± 0.451
2.679LeuPro: 2.679 ± 0.293
4.122LeuGln: 4.122 ± 0.415
3.215LeuArg: 3.215 ± 0.437
7.749LeuSer: 7.749 ± 0.534
4.575LeuThr: 4.575 ± 0.564
4.204LeuVal: 4.204 ± 0.454
0.577LeuTrp: 0.577 ± 0.139
5.729LeuTyr: 5.729 ± 0.484
0.0LeuXaa: 0.0 ± 0.0
Met
0.783MetAla: 0.783 ± 0.17
0.659MetCys: 0.659 ± 0.185
1.237MetAsp: 1.237 ± 0.224
1.319MetGlu: 1.319 ± 0.25
1.772MetPhe: 1.772 ± 0.248
0.412MetGly: 0.412 ± 0.131
0.371MetHis: 0.371 ± 0.124
2.597MetIle: 2.597 ± 0.257
2.391MetLys: 2.391 ± 0.354
2.885MetLeu: 2.885 ± 0.408
0.866MetMet: 0.866 ± 0.203
1.772MetAsn: 1.772 ± 0.278
0.742MetPro: 0.742 ± 0.146
0.824MetGln: 0.824 ± 0.199
0.824MetArg: 0.824 ± 0.164
2.143MetSer: 2.143 ± 0.312
1.195MetThr: 1.195 ± 0.219
0.989MetVal: 0.989 ± 0.229
0.247MetTrp: 0.247 ± 0.101
1.525MetTyr: 1.525 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
2.638AsnAla: 2.638 ± 0.286
1.896AsnCys: 1.896 ± 0.276
6.059AsnAsp: 6.059 ± 0.536
4.987AsnGlu: 4.987 ± 0.65
3.421AsnPhe: 3.421 ± 0.41
2.885AsnGly: 2.885 ± 0.342
2.102AsnHis: 2.102 ± 0.284
8.532AsnIle: 8.532 ± 0.648
5.77AsnLys: 5.77 ± 0.692
6.43AsnLeu: 6.43 ± 0.598
2.102AsnMet: 2.102 ± 0.321
7.543AsnAsn: 7.543 ± 0.638
1.978AsnPro: 1.978 ± 0.296
2.679AsnGln: 2.679 ± 0.313
3.009AsnArg: 3.009 ± 0.389
4.369AsnSer: 4.369 ± 0.404
6.018AsnThr: 6.018 ± 0.514
7.13AsnVal: 7.13 ± 0.643
0.412AsnTrp: 0.412 ± 0.124
3.091AsnTyr: 3.091 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
0.824ProAla: 0.824 ± 0.198
0.989ProCys: 0.989 ± 0.22
1.896ProAsp: 1.896 ± 0.293
1.649ProGlu: 1.649 ± 0.256
1.525ProPhe: 1.525 ± 0.238
0.618ProGly: 0.618 ± 0.16
0.659ProHis: 0.659 ± 0.161
2.514ProIle: 2.514 ± 0.289
1.566ProLys: 1.566 ± 0.214
2.391ProLeu: 2.391 ± 0.264
0.536ProMet: 0.536 ± 0.167
1.978ProAsn: 1.978 ± 0.296
0.659ProPro: 0.659 ± 0.18
1.401ProGln: 1.401 ± 0.26
1.278ProArg: 1.278 ± 0.267
1.937ProSer: 1.937 ± 0.287
1.69ProThr: 1.69 ± 0.34
1.937ProVal: 1.937 ± 0.297
0.041ProTrp: 0.041 ± 0.032
1.36ProTyr: 1.36 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
0.824GlnAla: 0.824 ± 0.282
0.989GlnCys: 0.989 ± 0.207
1.607GlnAsp: 1.607 ± 0.249
1.69GlnGlu: 1.69 ± 0.266
1.072GlnPhe: 1.072 ± 0.225
0.412GlnGly: 0.412 ± 0.123
1.072GlnHis: 1.072 ± 0.234
3.792GlnIle: 3.792 ± 0.442
2.803GlnLys: 2.803 ± 0.401
3.916GlnLeu: 3.916 ± 0.429
0.989GlnMet: 0.989 ± 0.202
3.71GlnAsn: 3.71 ± 0.458
1.36GlnPro: 1.36 ± 0.186
1.896GlnGln: 1.896 ± 0.307
1.607GlnArg: 1.607 ± 0.271
2.555GlnSer: 2.555 ± 0.276
2.72GlnThr: 2.72 ± 0.315
1.072GlnVal: 1.072 ± 0.235
0.412GlnTrp: 0.412 ± 0.157
2.308GlnTyr: 2.308 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
0.948ArgAla: 0.948 ± 0.205
1.154ArgCys: 1.154 ± 0.222
1.937ArgAsp: 1.937 ± 0.291
1.937ArgGlu: 1.937 ± 0.287
2.061ArgPhe: 2.061 ± 0.317
0.866ArgGly: 0.866 ± 0.209
1.36ArgHis: 1.36 ± 0.259
2.968ArgIle: 2.968 ± 0.356
2.968ArgLys: 2.968 ± 0.349
3.998ArgLeu: 3.998 ± 0.32
1.195ArgMet: 1.195 ± 0.239
3.132ArgAsn: 3.132 ± 0.44
0.701ArgPro: 0.701 ± 0.182
2.391ArgGln: 2.391 ± 0.335
1.607ArgArg: 1.607 ± 0.318
3.339ArgSer: 3.339 ± 0.675
2.349ArgThr: 2.349 ± 0.338
1.443ArgVal: 1.443 ± 0.206
0.165ArgTrp: 0.165 ± 0.084
1.69ArgTyr: 1.69 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
2.267SerAla: 2.267 ± 0.314
1.401SerCys: 1.401 ± 0.323
5.193SerAsp: 5.193 ± 0.482
4.328SerGlu: 4.328 ± 0.419
3.503SerPhe: 3.503 ± 0.35
2.02SerGly: 2.02 ± 0.336
1.566SerHis: 1.566 ± 0.242
6.1SerIle: 6.1 ± 0.561
5.111SerLys: 5.111 ± 0.474
5.77SerLeu: 5.77 ± 0.449
1.896SerMet: 1.896 ± 0.233
5.235SerAsn: 5.235 ± 0.528
2.061SerPro: 2.061 ± 0.33
2.555SerGln: 2.555 ± 0.337
3.38SerArg: 3.38 ± 0.552
4.699SerSer: 4.699 ± 0.586
5.482SerThr: 5.482 ± 0.582
4.493SerVal: 4.493 ± 0.456
0.701SerTrp: 0.701 ± 0.188
3.091SerTyr: 3.091 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
2.597ThrAla: 2.597 ± 0.412
1.401ThrCys: 1.401 ± 0.288
5.523ThrAsp: 5.523 ± 0.502
2.926ThrGlu: 2.926 ± 0.353
3.545ThrPhe: 3.545 ± 0.385
1.978ThrGly: 1.978 ± 0.34
1.113ThrHis: 1.113 ± 0.229
5.235ThrIle: 5.235 ± 0.397
3.668ThrLys: 3.668 ± 0.403
5.647ThrLeu: 5.647 ± 0.403
1.278ThrMet: 1.278 ± 0.22
5.853ThrAsn: 5.853 ± 0.467
2.061ThrPro: 2.061 ± 0.32
2.061ThrGln: 2.061 ± 0.34
2.803ThrArg: 2.803 ± 0.417
4.946ThrSer: 4.946 ± 0.543
5.523ThrThr: 5.523 ± 0.757
4.287ThrVal: 4.287 ± 0.427
0.33ThrTrp: 0.33 ± 0.119
2.679ThrTyr: 2.679 ± 0.426
0.0ThrXaa: 0.0 ± 0.0
Val
1.978ValAla: 1.978 ± 0.317
2.391ValCys: 2.391 ± 0.351
3.916ValAsp: 3.916 ± 0.546
3.751ValGlu: 3.751 ± 0.404
3.421ValPhe: 3.421 ± 0.466
1.649ValGly: 1.649 ± 0.227
1.401ValHis: 1.401 ± 0.22
5.111ValIle: 5.111 ± 0.437
4.74ValLys: 4.74 ± 0.457
4.328ValLeu: 4.328 ± 0.418
1.607ValMet: 1.607 ± 0.264
4.575ValAsn: 4.575 ± 0.404
1.649ValPro: 1.649 ± 0.256
2.061ValGln: 2.061 ± 0.293
1.937ValArg: 1.937 ± 0.203
4.905ValSer: 4.905 ± 0.465
3.751ValThr: 3.751 ± 0.378
4.204ValVal: 4.204 ± 0.362
0.536ValTrp: 0.536 ± 0.139
2.844ValTyr: 2.844 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
0.289TrpAla: 0.289 ± 0.097
0.124TrpCys: 0.124 ± 0.076
0.206TrpAsp: 0.206 ± 0.088
0.33TrpGlu: 0.33 ± 0.132
0.371TrpPhe: 0.371 ± 0.124
0.577TrpGly: 0.577 ± 0.201
0.165TrpHis: 0.165 ± 0.08
0.495TrpIle: 0.495 ± 0.155
0.453TrpLys: 0.453 ± 0.148
0.701TrpLeu: 0.701 ± 0.168
0.165TrpMet: 0.165 ± 0.074
0.412TrpAsn: 0.412 ± 0.115
0.247TrpPro: 0.247 ± 0.108
0.412TrpGln: 0.412 ± 0.128
0.33TrpArg: 0.33 ± 0.113
0.701TrpSer: 0.701 ± 0.135
0.412TrpThr: 0.412 ± 0.156
0.412TrpVal: 0.412 ± 0.154
0.082TrpTrp: 0.082 ± 0.057
0.289TrpTyr: 0.289 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.02TyrAla: 2.02 ± 0.314
1.443TyrCys: 1.443 ± 0.265
3.751TyrAsp: 3.751 ± 0.442
2.555TyrGlu: 2.555 ± 0.342
2.514TyrPhe: 2.514 ± 0.32
1.772TyrGly: 1.772 ± 0.293
1.649TyrHis: 1.649 ± 0.279
4.493TyrIle: 4.493 ± 0.349
3.462TyrLys: 3.462 ± 0.35
3.71TyrLeu: 3.71 ± 0.401
1.401TyrMet: 1.401 ± 0.229
4.08TyrAsn: 4.08 ± 0.474
1.237TyrPro: 1.237 ± 0.219
1.69TyrGln: 1.69 ± 0.261
1.69TyrArg: 1.69 ± 0.271
2.72TyrSer: 2.72 ± 0.292
2.926TyrThr: 2.926 ± 0.37
3.586TyrVal: 3.586 ± 0.318
0.247TyrTrp: 0.247 ± 0.107
2.102TyrTyr: 2.102 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (24263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski