Amino acid dipepetide frequency for Botrylloides leachii nidovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.791AlaAla: 3.791 ± 0.589
1.81AlaCys: 1.81 ± 0.231
3.016AlaAsp: 3.016 ± 0.487
3.705AlaGlu: 3.705 ± 0.41
2.413AlaPhe: 2.413 ± 0.31
2.327AlaGly: 2.327 ± 0.533
1.206AlaHis: 1.206 ± 0.289
3.791AlaIle: 3.791 ± 0.188
3.964AlaLys: 3.964 ± 0.718
5.687AlaLeu: 5.687 ± 0.435
1.206AlaMet: 1.206 ± 0.165
2.671AlaAsn: 2.671 ± 0.602
3.188AlaPro: 3.188 ± 0.529
2.068AlaGln: 2.068 ± 0.413
3.447AlaArg: 3.447 ± 0.312
2.844AlaSer: 2.844 ± 0.689
3.705AlaThr: 3.705 ± 0.454
3.878AlaVal: 3.878 ± 0.755
0.517AlaTrp: 0.517 ± 0.156
2.154AlaTyr: 2.154 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.034CysAla: 1.034 ± 0.235
0.603CysCys: 0.603 ± 0.729
1.465CysAsp: 1.465 ± 0.331
0.948CysGlu: 0.948 ± 0.411
1.12CysPhe: 1.12 ± 0.722
2.499CysGly: 2.499 ± 0.496
0.431CysHis: 0.431 ± 0.181
1.12CysIle: 1.12 ± 0.383
3.361CysLys: 3.361 ± 0.606
1.723CysLeu: 1.723 ± 0.423
1.034CysMet: 1.034 ± 0.213
1.551CysAsn: 1.551 ± 0.392
1.12CysPro: 1.12 ± 0.13
0.517CysGln: 0.517 ± 0.059
1.12CysArg: 1.12 ± 0.158
1.465CysSer: 1.465 ± 0.292
2.499CysThr: 2.499 ± 0.338
1.637CysVal: 1.637 ± 0.224
0.172CysTrp: 0.172 ± 0.585
1.12CysTyr: 1.12 ± 0.308
0.0CysXaa: 0.0 ± 0.0
Asp
3.016AspAla: 3.016 ± 0.517
1.034AspCys: 1.034 ± 0.167
2.93AspAsp: 2.93 ± 0.721
2.327AspGlu: 2.327 ± 0.286
4.826AspPhe: 4.826 ± 0.713
3.447AspGly: 3.447 ± 1.093
1.034AspHis: 1.034 ± 0.117
3.791AspIle: 3.791 ± 0.396
3.964AspLys: 3.964 ± 0.795
5.17AspLeu: 5.17 ± 0.211
1.034AspMet: 1.034 ± 0.136
3.188AspAsn: 3.188 ± 0.802
2.327AspPro: 2.327 ± 0.973
3.447AspGln: 3.447 ± 0.61
2.671AspArg: 2.671 ± 0.553
2.413AspSer: 2.413 ± 0.522
3.188AspThr: 3.188 ± 0.439
2.757AspVal: 2.757 ± 0.404
0.948AspTrp: 0.948 ± 0.326
2.327AspTyr: 2.327 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
2.585GluAla: 2.585 ± 0.294
1.551GluCys: 1.551 ± 0.389
3.533GluAsp: 3.533 ± 0.931
4.912GluGlu: 4.912 ± 1.009
2.24GluPhe: 2.24 ± 0.643
3.188GluGly: 3.188 ± 0.463
1.379GluHis: 1.379 ± 0.336
4.739GluIle: 4.739 ± 0.228
5.17GluLys: 5.17 ± 0.944
5.17GluLeu: 5.17 ± 0.719
1.982GluMet: 1.982 ± 0.439
2.757GluAsn: 2.757 ± 0.359
3.016GluPro: 3.016 ± 0.402
2.93GluGln: 2.93 ± 0.359
2.327GluArg: 2.327 ± 0.33
3.188GluSer: 3.188 ± 0.239
3.533GluThr: 3.533 ± 0.333
3.102GluVal: 3.102 ± 0.621
0.345GluTrp: 0.345 ± 0.133
3.016GluTyr: 3.016 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
3.533PheAla: 3.533 ± 0.793
1.551PheCys: 1.551 ± 0.255
3.619PheAsp: 3.619 ± 0.529
4.308PheGlu: 4.308 ± 0.597
2.154PhePhe: 2.154 ± 0.591
3.274PheGly: 3.274 ± 0.47
1.034PheHis: 1.034 ± 0.213
3.533PheIle: 3.533 ± 0.608
2.499PheLys: 2.499 ± 0.823
3.016PheLeu: 3.016 ± 0.777
1.293PheMet: 1.293 ± 0.31
3.102PheAsn: 3.102 ± 0.404
1.81PhePro: 1.81 ± 0.533
1.465PheGln: 1.465 ± 0.485
2.068PheArg: 2.068 ± 0.618
3.188PheSer: 3.188 ± 0.284
4.481PheThr: 4.481 ± 0.782
2.93PheVal: 2.93 ± 0.366
1.034PheTrp: 1.034 ± 0.3
2.671PheTyr: 2.671 ± 0.639
0.0PheXaa: 0.0 ± 0.0
Gly
2.413GlyAla: 2.413 ± 0.522
1.465GlyCys: 1.465 ± 0.448
2.068GlyAsp: 2.068 ± 0.371
3.188GlyGlu: 3.188 ± 0.921
4.136GlyPhe: 4.136 ± 0.508
1.379GlyGly: 1.379 ± 0.147
1.379GlyHis: 1.379 ± 0.267
3.533GlyIle: 3.533 ± 0.413
5.773GlyLys: 5.773 ± 1.399
4.912GlyLeu: 4.912 ± 0.429
1.293GlyMet: 1.293 ± 0.34
3.705GlyAsn: 3.705 ± 0.387
2.24GlyPro: 2.24 ± 0.587
0.862GlyGln: 0.862 ± 0.882
2.757GlyArg: 2.757 ± 0.404
3.533GlySer: 3.533 ± 1.072
3.102GlyThr: 3.102 ± 0.447
2.24GlyVal: 2.24 ± 0.696
0.172GlyTrp: 0.172 ± 0.106
2.24GlyTyr: 2.24 ± 0.361
0.0GlyXaa: 0.0 ± 0.0
His
0.776HisAla: 0.776 ± 0.203
0.862HisCys: 0.862 ± 0.141
1.465HisAsp: 1.465 ± 0.221
1.465HisGlu: 1.465 ± 0.577
1.379HisPhe: 1.379 ± 0.245
1.293HisGly: 1.293 ± 0.147
0.948HisHis: 0.948 ± 0.203
1.551HisIle: 1.551 ± 0.389
1.293HisLys: 1.293 ± 0.3
1.465HisLeu: 1.465 ± 0.238
0.431HisMet: 0.431 ± 0.148
0.862HisAsn: 0.862 ± 0.209
1.551HisPro: 1.551 ± 0.318
1.293HisGln: 1.293 ± 0.443
1.206HisArg: 1.206 ± 0.534
1.293HisSer: 1.293 ± 0.147
1.81HisThr: 1.81 ± 0.339
1.896HisVal: 1.896 ± 0.645
0.431HisTrp: 0.431 ± 0.148
1.12HisTyr: 1.12 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
4.998IleAla: 4.998 ± 1.301
1.12IleCys: 1.12 ± 0.465
4.395IleAsp: 4.395 ± 0.73
4.05IleGlu: 4.05 ± 0.576
3.361IlePhe: 3.361 ± 0.479
1.81IleGly: 1.81 ± 0.24
2.154IleHis: 2.154 ± 0.329
2.671IleIle: 2.671 ± 0.3
3.619IleLys: 3.619 ± 0.514
4.308IleLeu: 4.308 ± 0.316
1.293IleMet: 1.293 ± 0.293
3.447IleAsn: 3.447 ± 0.406
3.102IlePro: 3.102 ± 0.528
1.982IleGln: 1.982 ± 0.465
2.499IleArg: 2.499 ± 0.893
5.343IleSer: 5.343 ± 0.563
5.946IleThr: 5.946 ± 0.832
5.429IleVal: 5.429 ± 0.72
0.603IleTrp: 0.603 ± 0.118
3.274IleTyr: 3.274 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
3.705LysAla: 3.705 ± 0.387
2.24LysCys: 2.24 ± 0.26
3.016LysAsp: 3.016 ± 0.5
4.998LysGlu: 4.998 ± 0.924
3.878LysPhe: 3.878 ± 0.724
3.533LysGly: 3.533 ± 1.033
2.327LysHis: 2.327 ± 0.513
5.256LysIle: 5.256 ± 1.141
5.17LysLys: 5.17 ± 0.827
4.739LysLeu: 4.739 ± 0.561
2.24LysMet: 2.24 ± 1.005
4.826LysAsn: 4.826 ± 1.36
3.533LysPro: 3.533 ± 1.591
3.188LysGln: 3.188 ± 0.68
3.447LysArg: 3.447 ± 0.333
4.308LysSer: 4.308 ± 0.264
7.411LysThr: 7.411 ± 1.476
3.274LysVal: 3.274 ± 0.375
0.603LysTrp: 0.603 ± 0.156
2.154LysTyr: 2.154 ± 0.595
0.0LysXaa: 0.0 ± 0.0
Leu
5.946LeuAla: 5.946 ± 0.544
1.982LeuCys: 1.982 ± 0.347
3.619LeuAsp: 3.619 ± 0.769
4.826LeuGlu: 4.826 ± 0.422
3.188LeuPhe: 3.188 ± 0.387
3.447LeuGly: 3.447 ± 0.251
1.293LeuHis: 1.293 ± 0.378
4.308LeuIle: 4.308 ± 0.909
5.515LeuLys: 5.515 ± 0.719
5.256LeuLeu: 5.256 ± 2.315
1.723LeuMet: 1.723 ± 0.423
4.308LeuAsn: 4.308 ± 0.712
3.361LeuPro: 3.361 ± 1.429
2.24LeuGln: 2.24 ± 0.394
3.102LeuArg: 3.102 ± 0.26
7.152LeuSer: 7.152 ± 0.397
7.324LeuThr: 7.324 ± 0.955
4.653LeuVal: 4.653 ± 0.551
0.259LeuTrp: 0.259 ± 0.083
4.05LeuTyr: 4.05 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
1.723MetAla: 1.723 ± 0.455
0.345MetCys: 0.345 ± 0.056
0.862MetAsp: 0.862 ± 0.295
1.982MetGlu: 1.982 ± 0.549
0.517MetPhe: 0.517 ± 0.18
0.517MetGly: 0.517 ± 0.18
1.12MetHis: 1.12 ± 0.13
2.24MetIle: 2.24 ± 0.391
1.293MetLys: 1.293 ± 0.222
2.068MetLeu: 2.068 ± 1.499
0.689MetMet: 0.689 ± 0.263
1.12MetAsn: 1.12 ± 0.383
0.948MetPro: 0.948 ± 0.186
0.603MetGln: 0.603 ± 0.52
0.689MetArg: 0.689 ± 0.265
1.465MetSer: 1.465 ± 0.688
1.982MetThr: 1.982 ± 0.313
1.551MetVal: 1.551 ± 0.497
0.259MetTrp: 0.259 ± 0.09
1.206MetTyr: 1.206 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
2.499AsnAla: 2.499 ± 0.442
1.293AsnCys: 1.293 ± 0.31
2.327AsnAsp: 2.327 ± 0.553
2.757AsnGlu: 2.757 ± 0.405
2.499AsnPhe: 2.499 ± 0.445
3.705AsnGly: 3.705 ± 0.428
0.603AsnHis: 0.603 ± 0.16
3.878AsnIle: 3.878 ± 0.6
3.102AsnLys: 3.102 ± 0.832
3.964AsnLeu: 3.964 ± 1.309
0.862AsnMet: 0.862 ± 0.171
2.24AsnAsn: 2.24 ± 0.366
1.982AsnPro: 1.982 ± 0.247
0.862AsnGln: 0.862 ± 0.528
1.551AsnArg: 1.551 ± 0.309
4.308AsnSer: 4.308 ± 0.682
3.274AsnThr: 3.274 ± 0.474
4.308AsnVal: 4.308 ± 0.165
0.776AsnTrp: 0.776 ± 0.173
2.844AsnTyr: 2.844 ± 0.296
0.0AsnXaa: 0.0 ± 0.0
Pro
1.723ProAla: 1.723 ± 0.55
1.206ProCys: 1.206 ± 0.462
3.188ProAsp: 3.188 ± 0.388
2.757ProGlu: 2.757 ± 0.314
1.896ProPhe: 1.896 ± 0.25
2.154ProGly: 2.154 ± 0.44
0.862ProHis: 0.862 ± 0.161
2.327ProIle: 2.327 ± 0.492
3.878ProLys: 3.878 ± 1.495
4.05ProLeu: 4.05 ± 0.764
0.517ProMet: 0.517 ± 0.436
1.81ProAsn: 1.81 ± 0.276
1.293ProPro: 1.293 ± 0.413
1.551ProGln: 1.551 ± 0.313
1.81ProArg: 1.81 ± 1.178
2.93ProSer: 2.93 ± 1.757
2.413ProThr: 2.413 ± 0.331
2.671ProVal: 2.671 ± 1.017
0.517ProTrp: 0.517 ± 0.059
1.896ProTyr: 1.896 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
2.413GlnAla: 2.413 ± 0.368
0.948GlnCys: 0.948 ± 0.186
1.293GlnAsp: 1.293 ± 0.52
2.154GlnGlu: 2.154 ± 0.443
1.465GlnPhe: 1.465 ± 0.291
2.413GlnGly: 2.413 ± 0.329
1.206GlnHis: 1.206 ± 0.16
2.671GlnIle: 2.671 ± 0.379
3.361GlnLys: 3.361 ± 0.724
2.24GlnLeu: 2.24 ± 0.944
0.948GlnMet: 0.948 ± 0.48
1.551GlnAsn: 1.551 ± 0.239
0.776GlnPro: 0.776 ± 0.491
1.379GlnGln: 1.379 ± 0.203
2.413GlnArg: 2.413 ± 0.333
1.637GlnSer: 1.637 ± 1.061
2.154GlnThr: 2.154 ± 0.919
2.154GlnVal: 2.154 ± 0.353
0.172GlnTrp: 0.172 ± 0.065
1.379GlnTyr: 1.379 ± 0.471
0.0GlnXaa: 0.0 ± 0.0
Arg
2.93ArgAla: 2.93 ± 0.476
1.81ArgCys: 1.81 ± 0.298
3.361ArgAsp: 3.361 ± 0.437
2.24ArgGlu: 2.24 ± 1.5
3.188ArgPhe: 3.188 ± 0.355
1.465ArgGly: 1.465 ± 0.377
1.637ArgHis: 1.637 ± 0.323
2.671ArgIle: 2.671 ± 0.305
4.395ArgLys: 4.395 ± 0.544
2.499ArgLeu: 2.499 ± 0.509
1.12ArgMet: 1.12 ± 0.401
2.24ArgAsn: 2.24 ± 0.936
2.327ArgPro: 2.327 ± 0.825
1.551ArgGln: 1.551 ± 0.412
2.499ArgArg: 2.499 ± 0.572
2.327ArgSer: 2.327 ± 1.218
3.705ArgThr: 3.705 ± 0.764
3.102ArgVal: 3.102 ± 0.496
0.517ArgTrp: 0.517 ± 0.195
0.862ArgTyr: 0.862 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
3.274SerAla: 3.274 ± 0.88
1.81SerCys: 1.81 ± 0.211
5.687SerAsp: 5.687 ± 0.75
3.791SerGlu: 3.791 ± 0.83
3.274SerPhe: 3.274 ± 0.379
5.429SerGly: 5.429 ± 1.08
1.379SerHis: 1.379 ± 0.55
4.05SerIle: 4.05 ± 0.657
4.222SerLys: 4.222 ± 0.676
4.739SerLeu: 4.739 ± 0.736
0.689SerMet: 0.689 ± 0.605
2.154SerAsn: 2.154 ± 0.398
2.327SerPro: 2.327 ± 0.472
3.016SerGln: 3.016 ± 1.427
3.188SerArg: 3.188 ± 0.66
4.395SerSer: 4.395 ± 1.352
4.653SerThr: 4.653 ± 1.893
5.687SerVal: 5.687 ± 0.678
1.034SerTrp: 1.034 ± 0.167
1.379SerTyr: 1.379 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
4.826ThrAla: 4.826 ± 0.571
2.327ThrCys: 2.327 ± 0.329
3.619ThrAsp: 3.619 ± 0.556
3.533ThrGlu: 3.533 ± 0.414
3.964ThrPhe: 3.964 ± 0.561
4.739ThrGly: 4.739 ± 1.099
1.551ThrHis: 1.551 ± 0.178
5.86ThrIle: 5.86 ± 0.79
5.256ThrLys: 5.256 ± 0.874
5.946ThrLeu: 5.946 ± 0.719
1.12ThrMet: 1.12 ± 0.179
2.844ThrAsn: 2.844 ± 0.876
2.844ThrPro: 2.844 ± 0.861
2.068ThrGln: 2.068 ± 0.235
3.274ThrArg: 3.274 ± 0.391
5.343ThrSer: 5.343 ± 0.847
5.515ThrThr: 5.515 ± 1.162
4.567ThrVal: 4.567 ± 0.582
1.637ThrTrp: 1.637 ± 0.292
3.361ThrTyr: 3.361 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
3.791ValAla: 3.791 ± 0.654
1.81ValCys: 1.81 ± 0.385
2.499ValAsp: 2.499 ± 0.614
3.791ValGlu: 3.791 ± 0.503
4.395ValPhe: 4.395 ± 0.485
3.361ValGly: 3.361 ± 0.385
1.465ValHis: 1.465 ± 0.209
4.481ValIle: 4.481 ± 0.725
4.395ValLys: 4.395 ± 0.829
5.17ValLeu: 5.17 ± 0.625
1.465ValMet: 1.465 ± 0.183
2.585ValAsn: 2.585 ± 0.359
2.24ValPro: 2.24 ± 1.017
2.24ValGln: 2.24 ± 0.29
3.274ValArg: 3.274 ± 0.393
5.343ValSer: 5.343 ± 0.4
3.964ValThr: 3.964 ± 1.102
2.93ValVal: 2.93 ± 0.415
0.517ValTrp: 0.517 ± 0.059
3.533ValTyr: 3.533 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
0.689TrpAla: 0.689 ± 0.111
0.086TrpCys: 0.086 ± 0.053
0.862TrpAsp: 0.862 ± 0.141
0.862TrpGlu: 0.862 ± 0.295
0.689TrpPhe: 0.689 ± 0.111
0.345TrpGly: 0.345 ± 0.505
0.259TrpHis: 0.259 ± 0.083
0.345TrpIle: 0.345 ± 0.056
0.948TrpLys: 0.948 ± 0.186
1.723TrpLeu: 1.723 ± 0.208
0.259TrpMet: 0.259 ± 0.09
0.345TrpAsn: 0.345 ± 0.203
0.172TrpPro: 0.172 ± 0.253
0.517TrpGln: 0.517 ± 0.059
1.12TrpArg: 1.12 ± 0.169
0.431TrpSer: 0.431 ± 0.531
0.776TrpThr: 0.776 ± 0.173
0.603TrpVal: 0.603 ± 0.118
0.0TrpTrp: 0.0 ± 0.0
0.431TrpTyr: 0.431 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.723TyrAla: 1.723 ± 0.404
0.689TyrCys: 0.689 ± 0.236
3.016TyrAsp: 3.016 ± 0.329
2.068TyrGlu: 2.068 ± 0.596
1.982TyrPhe: 1.982 ± 0.322
2.068TyrGly: 2.068 ± 0.425
1.12TyrHis: 1.12 ± 0.247
2.585TyrIle: 2.585 ± 0.354
2.671TyrLys: 2.671 ± 0.348
3.705TyrLeu: 3.705 ± 0.847
1.982TyrMet: 1.982 ± 0.538
2.327TyrAsn: 2.327 ± 0.272
1.293TyrPro: 1.293 ± 0.267
0.948TyrGln: 0.948 ± 0.732
1.982TyrArg: 1.982 ± 0.451
3.274TyrSer: 3.274 ± 0.523
2.757TyrThr: 2.757 ± 0.718
3.878TyrVal: 3.878 ± 0.437
0.948TyrTrp: 0.948 ± 0.233
1.896TyrTyr: 1.896 ± 0.282
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (11606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski