Amino acid dipepetide frequency for Bellinger River virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.18AlaAla: 5.18 ± 0.423
0.538AlaCys: 0.538 ± 0.295
1.816AlaAsp: 1.816 ± 0.245
3.498AlaGlu: 3.498 ± 0.678
3.498AlaPhe: 3.498 ± 0.326
2.893AlaGly: 2.893 ± 0.423
1.413AlaHis: 1.413 ± 0.126
5.92AlaIle: 5.92 ± 0.861
3.162AlaLys: 3.162 ± 0.471
5.718AlaLeu: 5.718 ± 0.412
1.682AlaMet: 1.682 ± 0.712
4.036AlaAsn: 4.036 ± 0.301
2.085AlaPro: 2.085 ± 0.479
1.614AlaGln: 1.614 ± 0.456
2.085AlaArg: 2.085 ± 0.374
5.18AlaSer: 5.18 ± 0.495
8.274AlaThr: 8.274 ± 0.756
4.978AlaVal: 4.978 ± 0.492
0.673AlaTrp: 0.673 ± 0.427
2.018AlaTyr: 2.018 ± 0.27
0.0AlaXaa: 0.0 ± 0.0
Cys
1.144CysAla: 1.144 ± 0.507
0.74CysCys: 0.74 ± 0.157
1.48CysAsp: 1.48 ± 0.245
0.807CysGlu: 0.807 ± 0.185
1.413CysPhe: 1.413 ± 0.187
1.144CysGly: 1.144 ± 0.204
2.085CysHis: 2.085 ± 0.226
0.942CysIle: 0.942 ± 0.506
2.691CysLys: 2.691 ± 0.296
1.749CysLeu: 1.749 ± 0.515
0.336CysMet: 0.336 ± 0.094
1.614CysAsn: 1.614 ± 0.147
0.74CysPro: 0.74 ± 0.487
1.413CysGln: 1.413 ± 0.161
1.144CysArg: 1.144 ± 0.201
1.749CysSer: 1.749 ± 0.227
2.758CysThr: 2.758 ± 0.52
0.605CysVal: 0.605 ± 0.284
0.067CysTrp: 0.067 ± 0.043
0.942CysTyr: 0.942 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
1.614AspAla: 1.614 ± 0.394
0.942AspCys: 0.942 ± 0.299
2.489AspAsp: 2.489 ± 0.394
2.018AspGlu: 2.018 ± 0.582
3.094AspPhe: 3.094 ± 0.451
1.749AspGly: 1.749 ± 0.332
1.547AspHis: 1.547 ± 0.171
3.498AspIle: 3.498 ± 0.603
2.422AspLys: 2.422 ± 0.349
2.489AspLeu: 2.489 ± 0.352
1.278AspMet: 1.278 ± 0.237
4.305AspAsn: 4.305 ± 0.686
1.076AspPro: 1.076 ± 0.361
1.951AspGln: 1.951 ± 0.45
0.942AspArg: 0.942 ± 0.332
5.785AspSer: 5.785 ± 0.267
5.314AspThr: 5.314 ± 0.826
1.614AspVal: 1.614 ± 0.293
1.009AspTrp: 1.009 ± 0.14
3.431AspTyr: 3.431 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
2.422GluAla: 2.422 ± 0.463
1.009GluCys: 1.009 ± 0.289
1.48GluAsp: 1.48 ± 0.261
2.085GluGlu: 2.085 ± 0.403
1.614GluPhe: 1.614 ± 0.465
1.211GluGly: 1.211 ± 0.321
2.018GluHis: 2.018 ± 0.296
2.893GluIle: 2.893 ± 0.357
1.816GluLys: 1.816 ± 0.335
5.92GluLeu: 5.92 ± 0.991
0.74GluMet: 0.74 ± 0.207
1.883GluAsn: 1.883 ± 0.362
1.883GluPro: 1.883 ± 0.27
5.247GluGln: 5.247 ± 0.753
1.682GluArg: 1.682 ± 0.255
4.238GluSer: 4.238 ± 0.478
3.296GluThr: 3.296 ± 0.45
1.951GluVal: 1.951 ± 0.318
0.404GluTrp: 0.404 ± 0.123
2.018GluTyr: 2.018 ± 0.238
0.0GluXaa: 0.0 ± 0.0
Phe
3.094PheAla: 3.094 ± 0.41
1.413PheCys: 1.413 ± 0.299
2.623PheAsp: 2.623 ± 0.24
2.354PheGlu: 2.354 ± 0.333
1.345PhePhe: 1.345 ± 0.66
1.816PheGly: 1.816 ± 0.341
1.278PheHis: 1.278 ± 0.201
4.171PheIle: 4.171 ± 0.506
2.422PheLys: 2.422 ± 0.47
3.498PheLeu: 3.498 ± 0.455
0.605PheMet: 0.605 ± 0.098
3.632PheAsn: 3.632 ± 0.563
1.009PhePro: 1.009 ± 0.245
1.413PheGln: 1.413 ± 0.513
1.749PheArg: 1.749 ± 0.27
3.162PheSer: 3.162 ± 0.289
3.7PheThr: 3.7 ± 0.358
2.489PheVal: 2.489 ± 0.374
0.74PheTrp: 0.74 ± 0.202
1.144PheTyr: 1.144 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
1.547GlyAla: 1.547 ± 0.35
0.874GlyCys: 0.874 ± 0.19
1.547GlyAsp: 1.547 ± 0.299
1.345GlyGlu: 1.345 ± 0.293
1.345GlyPhe: 1.345 ± 0.252
0.874GlyGly: 0.874 ± 0.344
1.278GlyHis: 1.278 ± 0.214
3.162GlyIle: 3.162 ± 0.237
2.825GlyLys: 2.825 ± 0.471
2.287GlyLeu: 2.287 ± 0.452
0.673GlyMet: 0.673 ± 0.132
2.22GlyAsn: 2.22 ± 0.316
1.009GlyPro: 1.009 ± 0.168
1.413GlyGln: 1.413 ± 0.42
1.009GlyArg: 1.009 ± 0.247
3.094GlySer: 3.094 ± 0.328
3.296GlyThr: 3.296 ± 0.19
2.018GlyVal: 2.018 ± 0.734
0.269GlyTrp: 0.269 ± 0.24
0.673GlyTyr: 0.673 ± 0.239
0.0GlyXaa: 0.0 ± 0.0
His
2.354HisAla: 2.354 ± 0.56
1.211HisCys: 1.211 ± 0.193
1.816HisAsp: 1.816 ± 0.409
2.422HisGlu: 2.422 ± 0.344
1.816HisPhe: 1.816 ± 0.397
2.018HisGly: 2.018 ± 0.355
2.354HisHis: 2.354 ± 0.674
2.623HisIle: 2.623 ± 0.582
1.614HisLys: 1.614 ± 0.36
2.758HisLeu: 2.758 ± 0.438
0.336HisMet: 0.336 ± 0.21
3.229HisAsn: 3.229 ± 0.509
0.471HisPro: 0.471 ± 0.18
1.883HisGln: 1.883 ± 0.256
1.076HisArg: 1.076 ± 0.274
2.22HisSer: 2.22 ± 0.322
1.951HisThr: 1.951 ± 0.206
1.278HisVal: 1.278 ± 0.136
0.269HisTrp: 0.269 ± 0.135
1.345HisTyr: 1.345 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
3.498IleAla: 3.498 ± 0.738
1.951IleCys: 1.951 ± 0.304
4.103IleAsp: 4.103 ± 0.609
3.969IleGlu: 3.969 ± 0.675
3.7IlePhe: 3.7 ± 0.514
2.422IleGly: 2.422 ± 0.293
2.556IleHis: 2.556 ± 0.362
5.92IleIle: 5.92 ± 0.871
5.045IleLys: 5.045 ± 0.579
6.458IleLeu: 6.458 ± 0.612
0.942IleMet: 0.942 ± 0.208
7.198IleAsn: 7.198 ± 0.958
1.951IlePro: 1.951 ± 0.435
2.691IleGln: 2.691 ± 0.225
3.431IleArg: 3.431 ± 0.204
5.852IleSer: 5.852 ± 1.54
5.381IleThr: 5.381 ± 0.457
3.632IleVal: 3.632 ± 0.287
0.605IleTrp: 0.605 ± 0.17
1.749IleTyr: 1.749 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
5.516LysAla: 5.516 ± 0.801
1.816LysCys: 1.816 ± 0.165
2.018LysAsp: 2.018 ± 0.217
1.883LysGlu: 1.883 ± 0.323
2.422LysPhe: 2.422 ± 0.354
0.874LysGly: 0.874 ± 0.175
2.489LysHis: 2.489 ± 0.458
3.027LysIle: 3.027 ± 0.622
2.623LysLys: 2.623 ± 0.363
6.189LysLeu: 6.189 ± 0.639
0.135LysMet: 0.135 ± 0.107
4.036LysAsn: 4.036 ± 0.501
4.709LysPro: 4.709 ± 0.742
4.843LysGln: 4.843 ± 0.537
1.278LysArg: 1.278 ± 0.203
3.094LysSer: 3.094 ± 0.177
5.65LysThr: 5.65 ± 0.54
3.902LysVal: 3.902 ± 0.61
0.269LysTrp: 0.269 ± 0.084
2.153LysTyr: 2.153 ± 0.338
0.0LysXaa: 0.0 ± 0.0
Leu
6.256LeuAla: 6.256 ± 0.789
1.951LeuCys: 1.951 ± 0.284
4.103LeuAsp: 4.103 ± 0.379
2.825LeuGlu: 2.825 ± 0.805
3.565LeuPhe: 3.565 ± 0.423
2.354LeuGly: 2.354 ± 0.571
2.22LeuHis: 2.22 ± 0.464
5.516LeuIle: 5.516 ± 0.595
4.103LeuLys: 4.103 ± 0.535
4.843LeuLeu: 4.843 ± 0.717
1.413LeuMet: 1.413 ± 0.351
5.718LeuAsn: 5.718 ± 0.438
4.44LeuPro: 4.44 ± 0.63
4.507LeuGln: 4.507 ± 0.395
3.969LeuArg: 3.969 ± 0.535
5.785LeuSer: 5.785 ± 0.376
7.534LeuThr: 7.534 ± 0.652
5.18LeuVal: 5.18 ± 0.363
0.404LeuTrp: 0.404 ± 0.123
3.834LeuTyr: 3.834 ± 0.185
0.0LeuXaa: 0.0 ± 0.0
Met
0.807MetAla: 0.807 ± 0.43
0.135MetCys: 0.135 ± 0.04
0.202MetAsp: 0.202 ± 0.112
0.269MetGlu: 0.269 ± 0.135
0.874MetPhe: 0.874 ± 0.16
0.0MetGly: 0.0 ± 0.0
0.135MetHis: 0.135 ± 0.087
0.673MetIle: 0.673 ± 0.163
0.605MetLys: 0.605 ± 0.147
2.489MetLeu: 2.489 ± 0.355
0.135MetMet: 0.135 ± 0.04
0.471MetAsn: 0.471 ± 0.161
0.807MetPro: 0.807 ± 0.551
1.345MetGln: 1.345 ± 0.358
0.404MetArg: 0.404 ± 0.145
0.605MetSer: 0.605 ± 0.385
1.951MetThr: 1.951 ± 0.357
0.673MetVal: 0.673 ± 0.167
0.067MetTrp: 0.067 ± 0.043
0.269MetTyr: 0.269 ± 0.099
0.0MetXaa: 0.0 ± 0.0
Asn
4.238AsnAla: 4.238 ± 0.422
2.085AsnCys: 2.085 ± 0.34
5.112AsnAsp: 5.112 ± 0.782
3.632AsnGlu: 3.632 ± 0.337
3.969AsnPhe: 3.969 ± 0.376
2.287AsnGly: 2.287 ± 0.246
3.431AsnHis: 3.431 ± 0.454
6.592AsnIle: 6.592 ± 0.455
4.507AsnLys: 4.507 ± 0.333
3.632AsnLeu: 3.632 ± 0.516
0.874AsnMet: 0.874 ± 0.206
6.458AsnAsn: 6.458 ± 0.376
1.48AsnPro: 1.48 ± 0.567
3.027AsnGln: 3.027 ± 0.165
3.162AsnArg: 3.162 ± 0.306
5.516AsnSer: 5.516 ± 0.686
8.005AsnThr: 8.005 ± 0.919
3.431AsnVal: 3.431 ± 0.316
1.076AsnTrp: 1.076 ± 0.35
2.825AsnTyr: 2.825 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
3.7ProAla: 3.7 ± 0.53
1.547ProCys: 1.547 ± 0.617
2.287ProAsp: 2.287 ± 0.464
1.883ProGlu: 1.883 ± 0.287
0.673ProPhe: 0.673 ± 0.238
1.278ProGly: 1.278 ± 0.174
0.942ProHis: 0.942 ± 0.172
3.094ProIle: 3.094 ± 0.523
2.96ProLys: 2.96 ± 0.639
2.758ProLeu: 2.758 ± 0.462
0.336ProMet: 0.336 ± 0.428
1.682ProAsn: 1.682 ± 0.774
1.009ProPro: 1.009 ± 0.199
1.48ProGln: 1.48 ± 0.284
1.211ProArg: 1.211 ± 0.461
2.018ProSer: 2.018 ± 0.296
4.574ProThr: 4.574 ± 0.651
2.556ProVal: 2.556 ± 0.6
0.404ProTrp: 0.404 ± 0.076
1.48ProTyr: 1.48 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
4.574GlnAla: 4.574 ± 0.537
0.74GlnCys: 0.74 ± 0.231
2.691GlnAsp: 2.691 ± 0.298
2.287GlnGlu: 2.287 ± 0.238
1.48GlnPhe: 1.48 ± 0.177
1.413GlnGly: 1.413 ± 0.326
1.278GlnHis: 1.278 ± 0.315
3.296GlnIle: 3.296 ± 0.369
1.682GlnLys: 1.682 ± 0.683
7.601GlnLeu: 7.601 ± 0.625
0.538GlnMet: 0.538 ± 0.156
2.018GlnAsn: 2.018 ± 0.345
2.153GlnPro: 2.153 ± 0.411
4.171GlnGln: 4.171 ± 0.413
1.682GlnArg: 1.682 ± 0.373
2.96GlnSer: 2.96 ± 0.283
3.834GlnThr: 3.834 ± 0.627
2.018GlnVal: 2.018 ± 0.572
0.336GlnTrp: 0.336 ± 0.143
2.287GlnTyr: 2.287 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
2.085ArgAla: 2.085 ± 0.46
1.009ArgCys: 1.009 ± 0.196
0.673ArgAsp: 0.673 ± 0.175
2.153ArgGlu: 2.153 ± 0.271
2.153ArgPhe: 2.153 ± 0.135
1.076ArgGly: 1.076 ± 0.196
1.211ArgHis: 1.211 ± 0.262
1.547ArgIle: 1.547 ± 0.287
2.893ArgLys: 2.893 ± 0.617
3.632ArgLeu: 3.632 ± 0.399
0.202ArgMet: 0.202 ± 0.208
3.902ArgAsn: 3.902 ± 0.504
1.547ArgPro: 1.547 ± 0.587
1.749ArgGln: 1.749 ± 0.586
1.345ArgArg: 1.345 ± 1.196
2.153ArgSer: 2.153 ± 0.622
3.431ArgThr: 3.431 ± 0.51
1.144ArgVal: 1.144 ± 0.62
0.135ArgTrp: 0.135 ± 0.087
1.345ArgTyr: 1.345 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
4.238SerAla: 4.238 ± 0.346
1.682SerCys: 1.682 ± 0.834
4.978SerAsp: 4.978 ± 0.469
3.431SerGlu: 3.431 ± 0.273
2.422SerPhe: 2.422 ± 0.488
2.287SerGly: 2.287 ± 0.304
2.153SerHis: 2.153 ± 0.292
5.18SerIle: 5.18 ± 1.142
4.709SerLys: 4.709 ± 0.296
4.843SerLeu: 4.843 ± 0.334
0.404SerMet: 0.404 ± 0.154
8.61SerAsn: 8.61 ± 0.761
2.691SerPro: 2.691 ± 0.301
3.902SerGln: 3.902 ± 0.475
2.153SerArg: 2.153 ± 0.358
8.543SerSer: 8.543 ± 0.692
10.897SerThr: 10.897 ± 0.801
3.094SerVal: 3.094 ± 0.471
0.74SerTrp: 0.74 ± 0.161
2.893SerTyr: 2.893 ± 0.224
0.0SerXaa: 0.0 ± 0.0
Thr
8.341ThrAla: 8.341 ± 0.457
2.623ThrCys: 2.623 ± 0.435
4.709ThrAsp: 4.709 ± 0.584
4.911ThrGlu: 4.911 ± 0.456
2.623ThrPhe: 2.623 ± 0.722
3.027ThrGly: 3.027 ± 0.374
2.287ThrHis: 2.287 ± 0.313
8.543ThrIle: 8.543 ± 0.176
6.054ThrLys: 6.054 ± 0.72
5.92ThrLeu: 5.92 ± 0.503
0.605ThrMet: 0.605 ± 0.171
7.13ThrAsn: 7.13 ± 0.36
4.776ThrPro: 4.776 ± 0.507
2.489ThrGln: 2.489 ± 0.167
3.296ThrArg: 3.296 ± 0.296
11.166ThrSer: 11.166 ± 1.149
13.588ThrThr: 13.588 ± 1.32
6.861ThrVal: 6.861 ± 0.65
1.009ThrTrp: 1.009 ± 0.454
2.489ThrTyr: 2.489 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
2.691ValAla: 2.691 ± 0.717
1.413ValCys: 1.413 ± 0.16
2.085ValAsp: 2.085 ± 0.192
1.883ValGlu: 1.883 ± 0.478
2.085ValPhe: 2.085 ± 0.373
1.48ValGly: 1.48 ± 0.349
1.951ValHis: 1.951 ± 0.295
4.171ValIle: 4.171 ± 0.462
3.767ValLys: 3.767 ± 0.783
4.036ValLeu: 4.036 ± 0.481
0.471ValMet: 0.471 ± 0.135
3.834ValAsn: 3.834 ± 0.735
2.825ValPro: 2.825 ± 0.378
2.556ValGln: 2.556 ± 0.7
2.153ValArg: 2.153 ± 0.284
3.767ValSer: 3.767 ± 0.41
5.449ValThr: 5.449 ± 0.351
1.951ValVal: 1.951 ± 0.411
0.135ValTrp: 0.135 ± 0.193
2.623ValTyr: 2.623 ± 0.297
0.0ValXaa: 0.0 ± 0.0
Trp
0.202TrpAla: 0.202 ± 0.13
0.538TrpCys: 0.538 ± 0.153
0.874TrpAsp: 0.874 ± 0.173
0.336TrpGlu: 0.336 ± 0.169
1.278TrpPhe: 1.278 ± 0.172
0.404TrpGly: 0.404 ± 0.145
0.538TrpHis: 0.538 ± 0.171
0.269TrpIle: 0.269 ± 0.166
0.404TrpLys: 0.404 ± 0.183
0.74TrpLeu: 0.74 ± 0.121
0.067TrpMet: 0.067 ± 0.159
0.135TrpAsn: 0.135 ± 0.215
0.067TrpPro: 0.067 ± 0.201
0.404TrpGln: 0.404 ± 0.123
0.538TrpArg: 0.538 ± 0.2
0.605TrpSer: 0.605 ± 0.126
0.605TrpThr: 0.605 ± 0.5
0.471TrpVal: 0.471 ± 0.13
0.135TrpTrp: 0.135 ± 0.04
0.269TrpTyr: 0.269 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.758TyrAla: 2.758 ± 0.279
1.345TyrCys: 1.345 ± 0.375
1.48TyrAsp: 1.48 ± 0.186
1.816TyrGlu: 1.816 ± 0.335
2.287TyrPhe: 2.287 ± 0.233
2.153TyrGly: 2.153 ± 0.283
1.883TyrHis: 1.883 ± 0.255
2.018TyrIle: 2.018 ± 0.219
2.623TyrLys: 2.623 ± 0.628
2.96TyrLeu: 2.96 ± 0.258
1.076TyrMet: 1.076 ± 0.617
3.229TyrAsn: 3.229 ± 0.788
1.144TyrPro: 1.144 ± 0.272
0.942TyrGln: 0.942 ± 0.418
1.076TyrArg: 1.076 ± 0.235
2.354TyrSer: 2.354 ± 0.514
2.96TyrThr: 2.96 ± 0.166
1.547TyrVal: 1.547 ± 0.355
0.135TyrTrp: 0.135 ± 0.04
1.009TyrTyr: 1.009 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (14867 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski