Amino acid dipepetide frequency for Neodiprion abietis NPV

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.896AlaAla: 1.896 ± 0.302
0.968AlaCys: 0.968 ± 0.219
2.138AlaAsp: 2.138 ± 0.269
1.735AlaGlu: 1.735 ± 0.264
1.371AlaPhe: 1.371 ± 0.216
1.371AlaGly: 1.371 ± 0.279
0.887AlaHis: 0.887 ± 0.179
3.187AlaIle: 3.187 ± 0.405
3.066AlaLys: 3.066 ± 0.374
2.945AlaLeu: 2.945 ± 0.339
0.807AlaMet: 0.807 ± 0.155
3.509AlaAsn: 3.509 ± 0.331
0.686AlaPro: 0.686 ± 0.194
1.17AlaGln: 1.17 ± 0.198
1.17AlaArg: 1.17 ± 0.233
2.098AlaSer: 2.098 ± 0.309
2.824AlaThr: 2.824 ± 0.39
2.098AlaVal: 2.098 ± 0.321
0.04AlaTrp: 0.04 ± 0.034
1.775AlaTyr: 1.775 ± 0.292
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.191
0.484CysCys: 0.484 ± 0.166
2.34CysAsp: 2.34 ± 0.323
1.412CysGlu: 1.412 ± 0.301
1.371CysPhe: 1.371 ± 0.271
1.17CysGly: 1.17 ± 0.278
0.887CysHis: 0.887 ± 0.199
1.654CysIle: 1.654 ± 0.294
1.412CysLys: 1.412 ± 0.273
2.299CysLeu: 2.299 ± 0.317
0.363CysMet: 0.363 ± 0.123
2.299CysAsn: 2.299 ± 0.363
1.21CysPro: 1.21 ± 0.205
1.089CysGln: 1.089 ± 0.22
1.331CysArg: 1.331 ± 0.223
1.371CysSer: 1.371 ± 0.244
1.694CysThr: 1.694 ± 0.199
1.977CysVal: 1.977 ± 0.275
0.807CysTrp: 0.807 ± 0.31
0.928CysTyr: 0.928 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
2.582AspAla: 2.582 ± 0.272
1.775AspCys: 1.775 ± 0.311
5.365AspAsp: 5.365 ± 0.495
4.316AspGlu: 4.316 ± 0.413
2.662AspPhe: 2.662 ± 0.346
2.299AspGly: 2.299 ± 0.358
0.968AspHis: 0.968 ± 0.197
6.696AspIle: 6.696 ± 0.605
5.405AspLys: 5.405 ± 0.485
4.8AspLeu: 4.8 ± 0.431
1.694AspMet: 1.694 ± 0.292
5.204AspAsn: 5.204 ± 0.359
1.654AspPro: 1.654 ± 0.284
1.694AspGln: 1.694 ± 0.278
1.735AspArg: 1.735 ± 0.268
3.872AspSer: 3.872 ± 0.392
3.913AspThr: 3.913 ± 0.567
5.526AspVal: 5.526 ± 0.634
0.444AspTrp: 0.444 ± 0.122
3.388AspTyr: 3.388 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
1.412GluAla: 1.412 ± 0.26
1.21GluCys: 1.21 ± 0.211
2.824GluAsp: 2.824 ± 0.347
2.219GluGlu: 2.219 ± 0.416
2.299GluPhe: 2.299 ± 0.283
0.686GluGly: 0.686 ± 0.205
1.291GluHis: 1.291 ± 0.236
5.889GluIle: 5.889 ± 0.481
3.993GluLys: 3.993 ± 0.473
4.558GluLeu: 4.558 ± 0.369
1.291GluMet: 1.291 ± 0.217
5.244GluAsn: 5.244 ± 0.515
1.492GluPro: 1.492 ± 0.249
1.694GluGln: 1.694 ± 0.278
2.501GluArg: 2.501 ± 0.317
3.953GluSer: 3.953 ± 0.395
4.235GluThr: 4.235 ± 0.495
1.694GluVal: 1.694 ± 0.27
0.484GluTrp: 0.484 ± 0.125
3.066GluTyr: 3.066 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.34PheAla: 2.34 ± 0.316
1.331PheCys: 1.331 ± 0.255
4.598PheAsp: 4.598 ± 0.483
2.703PheGlu: 2.703 ± 0.365
2.299PhePhe: 2.299 ± 0.285
1.291PheGly: 1.291 ± 0.251
0.686PheHis: 0.686 ± 0.149
3.671PheIle: 3.671 ± 0.401
3.509PheLys: 3.509 ± 0.387
4.195PheLeu: 4.195 ± 0.404
1.25PheMet: 1.25 ± 0.19
2.824PheAsn: 2.824 ± 0.421
1.008PhePro: 1.008 ± 0.208
1.412PheGln: 1.412 ± 0.264
2.057PheArg: 2.057 ± 0.296
2.662PheSer: 2.662 ± 0.37
3.227PheThr: 3.227 ± 0.374
4.76PheVal: 4.76 ± 0.457
0.323PheTrp: 0.323 ± 0.101
2.38PheTyr: 2.38 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
0.928GlyAla: 0.928 ± 0.181
1.089GlyCys: 1.089 ± 0.193
2.098GlyAsp: 2.098 ± 0.304
1.21GlyGlu: 1.21 ± 0.193
1.291GlyPhe: 1.291 ± 0.22
1.129GlyGly: 1.129 ± 0.244
0.524GlyHis: 0.524 ± 0.2
2.582GlyIle: 2.582 ± 0.358
1.573GlyLys: 1.573 ± 0.245
2.703GlyLeu: 2.703 ± 0.367
0.403GlyMet: 0.403 ± 0.121
2.017GlyAsn: 2.017 ± 0.222
0.565GlyPro: 0.565 ± 0.164
1.008GlyGln: 1.008 ± 0.223
1.089GlyArg: 1.089 ± 0.18
2.42GlySer: 2.42 ± 0.392
1.856GlyThr: 1.856 ± 0.271
1.936GlyVal: 1.936 ± 0.312
0.524GlyTrp: 0.524 ± 0.131
1.412GlyTyr: 1.412 ± 0.233
0.0GlyXaa: 0.0 ± 0.0
His
0.847HisAla: 0.847 ± 0.193
0.484HisCys: 0.484 ± 0.19
1.654HisAsp: 1.654 ± 0.272
1.452HisGlu: 1.452 ± 0.291
1.008HisPhe: 1.008 ± 0.197
0.887HisGly: 0.887 ± 0.179
0.524HisHis: 0.524 ± 0.129
2.017HisIle: 2.017 ± 0.373
1.492HisLys: 1.492 ± 0.265
2.178HisLeu: 2.178 ± 0.415
0.726HisMet: 0.726 ± 0.169
1.533HisAsn: 1.533 ± 0.27
0.565HisPro: 0.565 ± 0.156
0.484HisGln: 0.484 ± 0.121
1.17HisArg: 1.17 ± 0.196
1.291HisSer: 1.291 ± 0.235
1.452HisThr: 1.452 ± 0.208
2.622HisVal: 2.622 ± 0.368
0.161HisTrp: 0.161 ± 0.087
0.847HisTyr: 0.847 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
3.751IleAla: 3.751 ± 0.407
3.187IleCys: 3.187 ± 0.439
5.849IleAsp: 5.849 ± 0.465
4.598IleGlu: 4.598 ± 0.461
4.397IlePhe: 4.397 ± 0.327
2.259IleGly: 2.259 ± 0.299
2.017IleHis: 2.017 ± 0.306
6.01IleIle: 6.01 ± 0.442
6.172IleLys: 6.172 ± 0.618
7.462IleLeu: 7.462 ± 0.588
2.219IleMet: 2.219 ± 0.316
6.777IleAsn: 6.777 ± 0.457
2.904IlePro: 2.904 ± 0.314
2.703IleGln: 2.703 ± 0.376
3.469IleArg: 3.469 ± 0.404
5.849IleSer: 5.849 ± 0.478
5.889IleThr: 5.889 ± 0.43
6.615IleVal: 6.615 ± 0.434
0.323IleTrp: 0.323 ± 0.154
4.598IleTyr: 4.598 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
1.896LysAla: 1.896 ± 0.282
1.896LysCys: 1.896 ± 0.322
2.945LysAsp: 2.945 ± 0.399
2.461LysGlu: 2.461 ± 0.314
4.558LysPhe: 4.558 ± 0.547
1.17LysGly: 1.17 ± 0.204
2.662LysHis: 2.662 ± 0.347
7.785LysIle: 7.785 ± 0.686
6.01LysLys: 6.01 ± 0.561
6.494LysLeu: 6.494 ± 0.659
2.178LysMet: 2.178 ± 0.361
6.333LysAsn: 6.333 ± 0.474
1.654LysPro: 1.654 ± 0.269
2.904LysGln: 2.904 ± 0.467
3.146LysArg: 3.146 ± 0.374
5.042LysSer: 5.042 ± 0.481
5.567LysThr: 5.567 ± 0.427
1.856LysVal: 1.856 ± 0.298
0.645LysTrp: 0.645 ± 0.152
4.558LysTyr: 4.558 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
3.348LeuAla: 3.348 ± 0.325
2.178LeuCys: 2.178 ± 0.326
4.598LeuAsp: 4.598 ± 0.381
4.477LeuGlu: 4.477 ± 0.437
4.195LeuPhe: 4.195 ± 0.334
2.138LeuGly: 2.138 ± 0.295
2.501LeuHis: 2.501 ± 0.435
8.027LeuIle: 8.027 ± 0.53
6.736LeuLys: 6.736 ± 0.517
7.866LeuLeu: 7.866 ± 0.603
2.259LeuMet: 2.259 ± 0.227
6.575LeuAsn: 6.575 ± 0.578
3.509LeuPro: 3.509 ± 0.391
4.639LeuGln: 4.639 ± 0.508
3.832LeuArg: 3.832 ± 0.557
7.301LeuSer: 7.301 ± 0.587
5.526LeuThr: 5.526 ± 0.587
4.316LeuVal: 4.316 ± 0.382
0.847LeuTrp: 0.847 ± 0.128
5.889LeuTyr: 5.889 ± 0.574
0.0LeuXaa: 0.0 ± 0.0
Met
0.524MetAla: 0.524 ± 0.132
1.17MetCys: 1.17 ± 0.25
1.412MetAsp: 1.412 ± 0.241
1.089MetGlu: 1.089 ± 0.209
1.533MetPhe: 1.533 ± 0.216
0.403MetGly: 0.403 ± 0.132
0.524MetHis: 0.524 ± 0.153
1.856MetIle: 1.856 ± 0.292
1.331MetLys: 1.331 ± 0.258
2.824MetLeu: 2.824 ± 0.363
0.766MetMet: 0.766 ± 0.193
1.654MetAsn: 1.654 ± 0.274
0.807MetPro: 0.807 ± 0.151
0.807MetGln: 0.807 ± 0.163
0.968MetArg: 0.968 ± 0.175
2.461MetSer: 2.461 ± 0.306
1.775MetThr: 1.775 ± 0.243
0.968MetVal: 0.968 ± 0.172
0.242MetTrp: 0.242 ± 0.095
1.089MetTyr: 1.089 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
3.025AsnAla: 3.025 ± 0.316
1.856AsnCys: 1.856 ± 0.231
5.688AsnAsp: 5.688 ± 0.483
4.679AsnGlu: 4.679 ± 0.543
3.59AsnPhe: 3.59 ± 0.386
2.461AsnGly: 2.461 ± 0.296
1.452AsnHis: 1.452 ± 0.219
8.269AsnIle: 8.269 ± 0.495
6.01AsnLys: 6.01 ± 0.568
7.503AsnLeu: 7.503 ± 0.678
1.977AsnMet: 1.977 ± 0.297
6.817AsnAsn: 6.817 ± 0.759
2.017AsnPro: 2.017 ± 0.297
2.34AsnGln: 2.34 ± 0.319
3.066AsnArg: 3.066 ± 0.38
3.792AsnSer: 3.792 ± 0.429
5.325AsnThr: 5.325 ± 0.496
7.543AsnVal: 7.543 ± 0.642
0.484AsnTrp: 0.484 ± 0.138
3.429AsnTyr: 3.429 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
1.291ProAla: 1.291 ± 0.231
0.484ProCys: 0.484 ± 0.164
1.654ProAsp: 1.654 ± 0.227
1.492ProGlu: 1.492 ± 0.261
1.371ProPhe: 1.371 ± 0.218
0.847ProGly: 0.847 ± 0.188
0.766ProHis: 0.766 ± 0.158
2.178ProIle: 2.178 ± 0.303
1.977ProLys: 1.977 ± 0.273
2.299ProLeu: 2.299 ± 0.336
0.363ProMet: 0.363 ± 0.129
2.38ProAsn: 2.38 ± 0.318
0.766ProPro: 0.766 ± 0.196
1.291ProGln: 1.291 ± 0.237
1.331ProArg: 1.331 ± 0.266
1.613ProSer: 1.613 ± 0.273
2.178ProThr: 2.178 ± 0.336
1.815ProVal: 1.815 ± 0.291
0.161ProTrp: 0.161 ± 0.078
1.492ProTyr: 1.492 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
1.129GlnAla: 1.129 ± 0.223
0.887GlnCys: 0.887 ± 0.228
1.089GlnAsp: 1.089 ± 0.187
2.098GlnGlu: 2.098 ± 0.256
1.492GlnPhe: 1.492 ± 0.272
0.605GlnGly: 0.605 ± 0.134
1.17GlnHis: 1.17 ± 0.244
3.993GlnIle: 3.993 ± 0.423
3.106GlnLys: 3.106 ± 0.377
3.348GlnLeu: 3.348 ± 0.364
1.089GlnMet: 1.089 ± 0.224
3.509GlnAsn: 3.509 ± 0.333
0.968GlnPro: 0.968 ± 0.201
2.017GlnGln: 2.017 ± 0.327
1.331GlnArg: 1.331 ± 0.212
2.259GlnSer: 2.259 ± 0.319
2.541GlnThr: 2.541 ± 0.292
1.049GlnVal: 1.049 ± 0.193
0.282GlnTrp: 0.282 ± 0.103
1.654GlnTyr: 1.654 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
1.129ArgAla: 1.129 ± 0.183
1.492ArgCys: 1.492 ± 0.346
2.34ArgAsp: 2.34 ± 0.405
2.138ArgGlu: 2.138 ± 0.347
2.299ArgPhe: 2.299 ± 0.35
1.089ArgGly: 1.089 ± 0.25
0.968ArgHis: 0.968 ± 0.191
2.985ArgIle: 2.985 ± 0.323
2.582ArgLys: 2.582 ± 0.338
4.598ArgLeu: 4.598 ± 0.449
1.412ArgMet: 1.412 ± 0.205
3.025ArgAsn: 3.025 ± 0.4
1.008ArgPro: 1.008 ± 0.201
1.775ArgGln: 1.775 ± 0.273
1.452ArgArg: 1.452 ± 0.246
3.066ArgSer: 3.066 ± 0.385
2.461ArgThr: 2.461 ± 0.277
1.977ArgVal: 1.977 ± 0.365
0.242ArgTrp: 0.242 ± 0.089
2.098ArgTyr: 2.098 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
1.815SerAla: 1.815 ± 0.301
1.492SerCys: 1.492 ± 0.255
4.76SerAsp: 4.76 ± 0.486
3.509SerGlu: 3.509 ± 0.406
3.671SerPhe: 3.671 ± 0.358
2.461SerGly: 2.461 ± 0.419
1.25SerHis: 1.25 ± 0.25
5.486SerIle: 5.486 ± 0.412
4.719SerLys: 4.719 ± 0.45
6.293SerLeu: 6.293 ± 0.391
1.533SerMet: 1.533 ± 0.241
5.889SerAsn: 5.889 ± 0.501
1.936SerPro: 1.936 ± 0.316
2.259SerGln: 2.259 ± 0.319
2.34SerArg: 2.34 ± 0.373
5.768SerSer: 5.768 ± 0.572
5.365SerThr: 5.365 ± 0.543
4.961SerVal: 4.961 ± 0.436
0.766SerTrp: 0.766 ± 0.183
2.501SerTyr: 2.501 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
2.622ThrAla: 2.622 ± 0.421
1.331ThrCys: 1.331 ± 0.261
5.365ThrAsp: 5.365 ± 0.459
3.59ThrGlu: 3.59 ± 0.42
3.429ThrPhe: 3.429 ± 0.451
2.259ThrGly: 2.259 ± 0.297
1.412ThrHis: 1.412 ± 0.244
5.809ThrIle: 5.809 ± 0.505
3.953ThrLys: 3.953 ± 0.383
7.22ThrLeu: 7.22 ± 0.485
1.371ThrMet: 1.371 ± 0.212
6.01ThrAsn: 6.01 ± 0.475
1.815ThrPro: 1.815 ± 0.261
2.299ThrGln: 2.299 ± 0.347
2.864ThrArg: 2.864 ± 0.328
5.204ThrSer: 5.204 ± 0.481
5.526ThrThr: 5.526 ± 0.645
3.832ThrVal: 3.832 ± 0.406
0.524ThrTrp: 0.524 ± 0.106
3.106ThrTyr: 3.106 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
2.017ValAla: 2.017 ± 0.345
1.896ValCys: 1.896 ± 0.308
5.325ValAsp: 5.325 ± 0.466
3.711ValGlu: 3.711 ± 0.451
3.187ValPhe: 3.187 ± 0.427
1.694ValGly: 1.694 ± 0.302
1.533ValHis: 1.533 ± 0.244
4.316ValIle: 4.316 ± 0.432
4.719ValLys: 4.719 ± 0.449
5.042ValLeu: 5.042 ± 0.469
1.21ValMet: 1.21 ± 0.18
5.405ValAsn: 5.405 ± 0.471
1.775ValPro: 1.775 ± 0.303
2.178ValGln: 2.178 ± 0.291
2.541ValArg: 2.541 ± 0.265
4.921ValSer: 4.921 ± 0.444
4.276ValThr: 4.276 ± 0.529
4.114ValVal: 4.114 ± 0.428
0.444ValTrp: 0.444 ± 0.133
3.509ValTyr: 3.509 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.274
0.202TrpCys: 0.202 ± 0.101
0.323TrpAsp: 0.323 ± 0.116
0.282TrpGlu: 0.282 ± 0.097
0.444TrpPhe: 0.444 ± 0.149
0.323TrpGly: 0.323 ± 0.127
0.403TrpHis: 0.403 ± 0.143
0.403TrpIle: 0.403 ± 0.107
0.524TrpLys: 0.524 ± 0.17
0.847TrpLeu: 0.847 ± 0.194
0.202TrpMet: 0.202 ± 0.103
0.524TrpAsn: 0.524 ± 0.144
0.323TrpPro: 0.323 ± 0.109
0.403TrpGln: 0.403 ± 0.118
0.323TrpArg: 0.323 ± 0.112
0.726TrpSer: 0.726 ± 0.191
0.726TrpThr: 0.726 ± 0.208
0.202TrpVal: 0.202 ± 0.082
0.081TrpTrp: 0.081 ± 0.065
0.242TrpTyr: 0.242 ± 0.091
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.815TyrAla: 1.815 ± 0.271
1.452TyrCys: 1.452 ± 0.227
3.63TyrAsp: 3.63 ± 0.348
2.864TyrGlu: 2.864 ± 0.402
2.057TyrPhe: 2.057 ± 0.284
1.815TyrGly: 1.815 ± 0.295
0.968TyrHis: 0.968 ± 0.194
4.235TyrIle: 4.235 ± 0.378
3.469TyrLys: 3.469 ± 0.336
5.284TyrLeu: 5.284 ± 0.476
1.089TyrMet: 1.089 ± 0.189
3.792TyrAsn: 3.792 ± 0.462
1.089TyrPro: 1.089 ± 0.216
1.452TyrGln: 1.452 ± 0.228
2.42TyrArg: 2.42 ± 0.327
3.106TyrSer: 3.106 ± 0.316
3.187TyrThr: 3.187 ± 0.375
3.953TyrVal: 3.953 ± 0.423
0.282TyrTrp: 0.282 ± 0.116
2.299TyrTyr: 2.299 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (24792 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski