Amino acid dipepetide frequency for Adoxophyes orana granulovirus (AoGV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.616AlaAla: 1.616 ± 0.281
0.937AlaCys: 0.937 ± 0.168
2.359AlaAsp: 2.359 ± 0.232
1.745AlaGlu: 1.745 ± 0.269
1.778AlaPhe: 1.778 ± 0.203
1.131AlaGly: 1.131 ± 0.256
1.002AlaHis: 1.002 ± 0.163
3.07AlaIle: 3.07 ± 0.265
2.036AlaLys: 2.036 ± 0.195
3.782AlaLeu: 3.782 ± 0.362
0.808AlaMet: 0.808 ± 0.185
3.006AlaAsn: 3.006 ± 0.306
1.099AlaPro: 1.099 ± 0.186
1.519AlaGln: 1.519 ± 0.177
1.487AlaArg: 1.487 ± 0.233
2.327AlaSer: 2.327 ± 0.256
2.23AlaThr: 2.23 ± 0.282
2.133AlaVal: 2.133 ± 0.281
0.291AlaTrp: 0.291 ± 0.09
2.165AlaTyr: 2.165 ± 0.224
0.0AlaXaa: 0.0 ± 0.0
Cys
1.034CysAla: 1.034 ± 0.177
0.776CysCys: 0.776 ± 0.203
2.036CysAsp: 2.036 ± 0.319
1.519CysGlu: 1.519 ± 0.21
1.034CysPhe: 1.034 ± 0.214
0.97CysGly: 0.97 ± 0.186
0.517CysHis: 0.517 ± 0.152
1.681CysIle: 1.681 ± 0.251
1.778CysLys: 1.778 ± 0.245
2.489CysLeu: 2.489 ± 0.285
0.517CysMet: 0.517 ± 0.115
2.359CysAsn: 2.359 ± 0.267
0.97CysPro: 0.97 ± 0.174
0.743CysGln: 0.743 ± 0.177
1.196CysArg: 1.196 ± 0.187
1.681CysSer: 1.681 ± 0.311
0.937CysThr: 0.937 ± 0.174
2.295CysVal: 2.295 ± 0.262
0.065CysTrp: 0.065 ± 0.046
1.519CysTyr: 1.519 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
2.747AspAla: 2.747 ± 0.315
1.681AspCys: 1.681 ± 0.264
5.236AspAsp: 5.236 ± 0.641
4.008AspGlu: 4.008 ± 0.849
2.521AspPhe: 2.521 ± 0.292
2.101AspGly: 2.101 ± 0.27
1.099AspHis: 1.099 ± 0.179
4.848AspIle: 4.848 ± 0.413
5.365AspLys: 5.365 ± 0.427
5.333AspLeu: 5.333 ± 0.476
1.551AspMet: 1.551 ± 0.236
7.046AspAsn: 7.046 ± 0.682
1.584AspPro: 1.584 ± 0.308
1.842AspGln: 1.842 ± 0.275
2.069AspArg: 2.069 ± 0.222
3.749AspSer: 3.749 ± 0.373
3.361AspThr: 3.361 ± 0.28
4.686AspVal: 4.686 ± 0.431
0.485AspTrp: 0.485 ± 0.126
3.458AspTyr: 3.458 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
1.745GluAla: 1.745 ± 0.245
1.551GluCys: 1.551 ± 0.302
3.458GluAsp: 3.458 ± 0.729
4.363GluGlu: 4.363 ± 0.923
2.295GluPhe: 2.295 ± 0.272
1.454GluGly: 1.454 ± 0.207
1.034GluHis: 1.034 ± 0.162
4.59GluIle: 4.59 ± 0.388
4.977GluLys: 4.977 ± 0.501
4.622GluLeu: 4.622 ± 0.349
2.101GluMet: 2.101 ± 0.299
5.43GluAsn: 5.43 ± 0.779
1.487GluPro: 1.487 ± 0.219
2.359GluGln: 2.359 ± 0.334
1.907GluArg: 1.907 ± 0.224
3.62GluSer: 3.62 ± 0.348
3.749GluThr: 3.749 ± 0.352
2.327GluVal: 2.327 ± 0.288
0.452GluTrp: 0.452 ± 0.133
2.004GluTyr: 2.004 ± 0.202
0.0GluXaa: 0.0 ± 0.0
Phe
1.584PheAla: 1.584 ± 0.254
0.808PheCys: 0.808 ± 0.144
4.234PheAsp: 4.234 ± 0.399
2.747PheGlu: 2.747 ± 0.282
2.101PhePhe: 2.101 ± 0.268
1.519PheGly: 1.519 ± 0.201
0.743PheHis: 0.743 ± 0.167
3.652PheIle: 3.652 ± 0.372
4.072PheLys: 4.072 ± 0.444
4.072PheLeu: 4.072 ± 0.488
0.937PheMet: 0.937 ± 0.17
4.299PheAsn: 4.299 ± 0.437
1.099PhePro: 1.099 ± 0.186
0.937PheGln: 0.937 ± 0.188
1.454PheArg: 1.454 ± 0.221
2.715PheSer: 2.715 ± 0.325
2.165PheThr: 2.165 ± 0.269
4.234PheVal: 4.234 ± 0.416
0.323PheTrp: 0.323 ± 0.105
2.973PheTyr: 2.973 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
1.648GlyAla: 1.648 ± 0.244
0.42GlyCys: 0.42 ± 0.156
2.65GlyAsp: 2.65 ± 0.316
1.648GlyGlu: 1.648 ± 0.274
1.616GlyPhe: 1.616 ± 0.234
1.81GlyGly: 1.81 ± 0.285
0.549GlyHis: 0.549 ± 0.141
1.778GlyIle: 1.778 ± 0.254
2.101GlyLys: 2.101 ± 0.338
2.683GlyLeu: 2.683 ± 0.308
0.743GlyMet: 0.743 ± 0.15
1.972GlyAsn: 1.972 ± 0.274
0.549GlyPro: 0.549 ± 0.168
0.873GlyGln: 0.873 ± 0.179
1.099GlyArg: 1.099 ± 0.188
1.681GlySer: 1.681 ± 0.232
2.036GlyThr: 2.036 ± 0.296
2.618GlyVal: 2.618 ± 0.342
0.162GlyTrp: 0.162 ± 0.067
1.584GlyTyr: 1.584 ± 0.224
0.0GlyXaa: 0.0 ± 0.0
His
1.067HisAla: 1.067 ± 0.139
0.42HisCys: 0.42 ± 0.124
1.228HisAsp: 1.228 ± 0.249
1.002HisGlu: 1.002 ± 0.181
1.261HisPhe: 1.261 ± 0.186
0.776HisGly: 0.776 ± 0.158
0.582HisHis: 0.582 ± 0.131
1.325HisIle: 1.325 ± 0.201
1.778HisLys: 1.778 ± 0.219
2.004HisLeu: 2.004 ± 0.272
0.42HisMet: 0.42 ± 0.126
2.133HisAsn: 2.133 ± 0.273
0.776HisPro: 0.776 ± 0.2
0.517HisGln: 0.517 ± 0.14
0.614HisArg: 0.614 ± 0.14
1.164HisSer: 1.164 ± 0.152
0.97HisThr: 0.97 ± 0.168
1.745HisVal: 1.745 ± 0.253
0.065HisTrp: 0.065 ± 0.046
1.131HisTyr: 1.131 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
2.586IleAla: 2.586 ± 0.269
1.745IleCys: 1.745 ± 0.265
6.238IleAsp: 6.238 ± 0.531
4.686IleGlu: 4.686 ± 0.333
3.006IlePhe: 3.006 ± 0.342
2.069IleGly: 2.069 ± 0.294
1.196IleHis: 1.196 ± 0.205
5.85IleIle: 5.85 ± 0.442
6.852IleLys: 6.852 ± 0.515
7.046IleLeu: 7.046 ± 0.488
2.133IleMet: 2.133 ± 0.308
8.791IleAsn: 8.791 ± 0.588
2.78IlePro: 2.78 ± 0.283
2.101IleGln: 2.101 ± 0.235
2.327IleArg: 2.327 ± 0.283
4.331IleSer: 4.331 ± 0.384
4.654IleThr: 4.654 ± 0.367
6.044IleVal: 6.044 ± 0.503
0.549IleTrp: 0.549 ± 0.141
3.911IleTyr: 3.911 ± 0.427
0.0IleXaa: 0.0 ± 0.0
Lys
2.101LysAla: 2.101 ± 0.233
2.489LysCys: 2.489 ± 0.29
3.555LysAsp: 3.555 ± 0.405
4.751LysGlu: 4.751 ± 0.442
3.491LysPhe: 3.491 ± 0.321
1.745LysGly: 1.745 ± 0.261
2.456LysHis: 2.456 ± 0.262
7.111LysIle: 7.111 ± 0.619
6.012LysLys: 6.012 ± 0.619
8.403LysLeu: 8.403 ± 0.675
1.875LysMet: 1.875 ± 0.26
6.949LysAsn: 6.949 ± 0.488
1.551LysPro: 1.551 ± 0.235
3.167LysGln: 3.167 ± 0.341
4.493LysArg: 4.493 ± 0.494
5.01LysSer: 5.01 ± 0.427
4.622LysThr: 4.622 ± 0.349
3.135LysVal: 3.135 ± 0.3
0.97LysTrp: 0.97 ± 0.181
3.458LysTyr: 3.458 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
3.2LeuAla: 3.2 ± 0.387
2.877LeuCys: 2.877 ± 0.267
4.977LeuAsp: 4.977 ± 0.437
4.88LeuGlu: 4.88 ± 0.504
4.977LeuPhe: 4.977 ± 0.371
2.489LeuGly: 2.489 ± 0.342
2.521LeuHis: 2.521 ± 0.259
7.951LeuIle: 7.951 ± 0.521
8.565LeuLys: 8.565 ± 0.664
8.727LeuLeu: 8.727 ± 0.594
2.909LeuMet: 2.909 ± 0.279
9.373LeuAsn: 9.373 ± 0.68
2.359LeuPro: 2.359 ± 0.346
4.169LeuGln: 4.169 ± 0.479
3.911LeuArg: 3.911 ± 0.349
5.915LeuSer: 5.915 ± 0.495
3.975LeuThr: 3.975 ± 0.34
4.719LeuVal: 4.719 ± 0.341
1.067LeuTrp: 1.067 ± 0.193
5.171LeuTyr: 5.171 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
0.776MetAla: 0.776 ± 0.128
0.808MetCys: 0.808 ± 0.167
1.357MetAsp: 1.357 ± 0.213
1.681MetGlu: 1.681 ± 0.214
1.713MetPhe: 1.713 ± 0.227
0.808MetGly: 0.808 ± 0.151
0.517MetHis: 0.517 ± 0.13
1.713MetIle: 1.713 ± 0.205
1.357MetLys: 1.357 ± 0.206
3.038MetLeu: 3.038 ± 0.282
0.614MetMet: 0.614 ± 0.128
1.584MetAsn: 1.584 ± 0.178
0.582MetPro: 0.582 ± 0.153
1.067MetGln: 1.067 ± 0.191
1.002MetArg: 1.002 ± 0.171
2.295MetSer: 2.295 ± 0.282
1.422MetThr: 1.422 ± 0.183
1.551MetVal: 1.551 ± 0.227
0.129MetTrp: 0.129 ± 0.066
1.519MetTyr: 1.519 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
2.973AsnAla: 2.973 ± 0.343
2.165AsnCys: 2.165 ± 0.24
6.432AsnAsp: 6.432 ± 0.506
5.624AsnGlu: 5.624 ± 0.516
4.202AsnPhe: 4.202 ± 0.351
2.909AsnGly: 2.909 ± 0.38
1.099AsnHis: 1.099 ± 0.203
7.789AsnIle: 7.789 ± 0.78
7.628AsnLys: 7.628 ± 0.605
8.112AsnLeu: 8.112 ± 0.551
2.456AsnMet: 2.456 ± 0.251
10.795AsnAsn: 10.795 ± 0.703
2.262AsnPro: 2.262 ± 0.315
2.973AsnGln: 2.973 ± 0.307
3.62AsnArg: 3.62 ± 0.318
5.591AsnSer: 5.591 ± 0.413
5.042AsnThr: 5.042 ± 0.433
7.369AsnVal: 7.369 ± 0.487
0.679AsnTrp: 0.679 ± 0.15
4.783AsnTyr: 4.783 ± 0.405
0.0AsnXaa: 0.0 ± 0.0
Pro
1.164ProAla: 1.164 ± 0.228
0.937ProCys: 0.937 ± 0.2
1.39ProAsp: 1.39 ± 0.194
1.39ProGlu: 1.39 ± 0.218
1.487ProPhe: 1.487 ± 0.227
0.646ProGly: 0.646 ± 0.172
0.614ProHis: 0.614 ± 0.139
2.618ProIle: 2.618 ± 0.322
1.584ProLys: 1.584 ± 0.224
3.006ProLeu: 3.006 ± 0.39
0.614ProMet: 0.614 ± 0.156
2.715ProAsn: 2.715 ± 0.271
1.842ProPro: 1.842 ± 0.896
0.614ProGln: 0.614 ± 0.169
1.002ProArg: 1.002 ± 0.211
2.327ProSer: 2.327 ± 0.293
2.359ProThr: 2.359 ± 0.807
2.262ProVal: 2.262 ± 0.29
0.291ProTrp: 0.291 ± 0.097
1.454ProTyr: 1.454 ± 0.216
0.0ProXaa: 0.0 ± 0.0
Gln
1.067GlnAla: 1.067 ± 0.203
1.034GlnCys: 1.034 ± 0.168
1.487GlnAsp: 1.487 ± 0.238
1.681GlnGlu: 1.681 ± 0.298
1.39GlnPhe: 1.39 ± 0.247
0.646GlnGly: 0.646 ± 0.171
0.937GlnHis: 0.937 ± 0.137
3.361GlnIle: 3.361 ± 0.306
2.553GlnLys: 2.553 ± 0.284
3.685GlnLeu: 3.685 ± 0.331
0.905GlnMet: 0.905 ± 0.157
2.812GlnAsn: 2.812 ± 0.309
1.067GlnPro: 1.067 ± 0.17
1.81GlnGln: 1.81 ± 0.26
1.325GlnArg: 1.325 ± 0.202
2.327GlnSer: 2.327 ± 0.34
2.262GlnThr: 2.262 ± 0.251
1.39GlnVal: 1.39 ± 0.204
0.356GlnTrp: 0.356 ± 0.116
2.262GlnTyr: 2.262 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
1.519ArgAla: 1.519 ± 0.207
0.937ArgCys: 0.937 ± 0.169
2.715ArgAsp: 2.715 ± 0.33
1.454ArgGlu: 1.454 ± 0.192
1.745ArgPhe: 1.745 ± 0.247
1.39ArgGly: 1.39 ± 0.203
1.261ArgHis: 1.261 ± 0.21
3.329ArgIle: 3.329 ± 0.281
2.812ArgLys: 2.812 ± 0.422
4.46ArgLeu: 4.46 ± 0.36
1.261ArgMet: 1.261 ± 0.22
2.941ArgAsn: 2.941 ± 0.286
1.228ArgPro: 1.228 ± 0.213
1.519ArgGln: 1.519 ± 0.201
2.133ArgArg: 2.133 ± 0.364
2.23ArgSer: 2.23 ± 0.461
1.487ArgThr: 1.487 ± 0.194
2.747ArgVal: 2.747 ± 0.321
0.323ArgTrp: 0.323 ± 0.099
2.004ArgTyr: 2.004 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
2.101SerAla: 2.101 ± 0.294
1.164SerCys: 1.164 ± 0.234
4.266SerAsp: 4.266 ± 0.464
3.232SerGlu: 3.232 ± 0.452
3.491SerPhe: 3.491 ± 0.379
2.165SerGly: 2.165 ± 0.268
1.034SerHis: 1.034 ± 0.183
4.654SerIle: 4.654 ± 0.403
4.396SerLys: 4.396 ± 0.424
6.303SerLeu: 6.303 ± 0.478
1.39SerMet: 1.39 ± 0.196
5.365SerAsn: 5.365 ± 0.327
2.069SerPro: 2.069 ± 0.284
1.681SerGln: 1.681 ± 0.256
2.489SerArg: 2.489 ± 0.426
4.654SerSer: 4.654 ± 0.357
3.2SerThr: 3.2 ± 0.348
4.525SerVal: 4.525 ± 0.429
0.549SerTrp: 0.549 ± 0.112
3.103SerTyr: 3.103 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
2.327ThrAla: 2.327 ± 0.349
1.196ThrCys: 1.196 ± 0.205
3.006ThrAsp: 3.006 ± 0.29
2.392ThrGlu: 2.392 ± 0.293
2.65ThrPhe: 2.65 ± 0.391
1.745ThrGly: 1.745 ± 0.279
1.034ThrHis: 1.034 ± 0.222
4.59ThrIle: 4.59 ± 0.515
3.394ThrLys: 3.394 ± 0.27
5.85ThrLeu: 5.85 ± 0.417
1.067ThrMet: 1.067 ± 0.172
5.139ThrAsn: 5.139 ± 0.493
3.426ThrPro: 3.426 ± 0.764
2.392ThrGln: 2.392 ± 0.219
2.069ThrArg: 2.069 ± 0.261
2.909ThrSer: 2.909 ± 0.257
4.493ThrThr: 4.493 ± 0.582
2.877ThrVal: 2.877 ± 0.292
0.549ThrTrp: 0.549 ± 0.137
2.327ThrTyr: 2.327 ± 0.237
0.0ThrXaa: 0.0 ± 0.0
Val
2.877ValAla: 2.877 ± 0.284
2.133ValCys: 2.133 ± 0.249
4.363ValAsp: 4.363 ± 0.425
3.394ValGlu: 3.394 ± 0.361
2.941ValPhe: 2.941 ± 0.323
2.069ValGly: 2.069 ± 0.281
1.519ValHis: 1.519 ± 0.247
5.236ValIle: 5.236 ± 0.435
4.88ValLys: 4.88 ± 0.43
6.303ValLeu: 6.303 ± 0.415
1.584ValMet: 1.584 ± 0.222
5.85ValAsn: 5.85 ± 0.467
1.972ValPro: 1.972 ± 0.332
2.198ValGln: 2.198 ± 0.257
2.941ValArg: 2.941 ± 0.276
3.749ValSer: 3.749 ± 0.364
2.909ValThr: 2.909 ± 0.29
4.88ValVal: 4.88 ± 0.427
0.485ValTrp: 0.485 ± 0.147
3.652ValTyr: 3.652 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
0.42TrpAla: 0.42 ± 0.105
0.291TrpCys: 0.291 ± 0.091
0.356TrpAsp: 0.356 ± 0.115
0.388TrpGlu: 0.388 ± 0.114
0.452TrpPhe: 0.452 ± 0.121
0.194TrpGly: 0.194 ± 0.073
0.291TrpHis: 0.291 ± 0.092
0.323TrpIle: 0.323 ± 0.099
0.485TrpLys: 0.485 ± 0.123
0.743TrpLeu: 0.743 ± 0.154
0.097TrpMet: 0.097 ± 0.056
0.776TrpAsn: 0.776 ± 0.185
0.259TrpPro: 0.259 ± 0.083
0.582TrpGln: 0.582 ± 0.126
0.452TrpArg: 0.452 ± 0.145
0.485TrpSer: 0.485 ± 0.131
0.517TrpThr: 0.517 ± 0.112
0.452TrpVal: 0.452 ± 0.121
0.162TrpTrp: 0.162 ± 0.066
0.679TrpTyr: 0.679 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.972TyrAla: 1.972 ± 0.219
1.745TyrCys: 1.745 ± 0.257
3.426TyrAsp: 3.426 ± 0.308
2.941TyrGlu: 2.941 ± 0.419
2.521TyrPhe: 2.521 ± 0.331
1.584TyrGly: 1.584 ± 0.296
0.97TyrHis: 0.97 ± 0.181
3.426TyrIle: 3.426 ± 0.32
4.719TyrLys: 4.719 ± 0.386
4.105TyrLeu: 4.105 ± 0.311
1.519TyrMet: 1.519 ± 0.241
5.171TyrAsn: 5.171 ± 0.339
1.357TyrPro: 1.357 ± 0.176
1.293TyrGln: 1.293 ± 0.231
1.907TyrArg: 1.907 ± 0.255
3.07TyrSer: 3.07 ± 0.3
3.038TyrThr: 3.038 ± 0.316
3.943TyrVal: 3.943 ± 0.348
0.452TyrTrp: 0.452 ± 0.116
3.264TyrTyr: 3.264 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 119 proteins (30941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski