Amino acid dipepetide frequency for Wuhan japanese halfbeak arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.442AlaAla: 6.442 ± 2.693
2.025AlaCys: 2.025 ± 1.003
1.288AlaAsp: 1.288 ± 0.354
2.025AlaGlu: 2.025 ± 0.538
4.233AlaPhe: 4.233 ± 1.227
3.497AlaGly: 3.497 ± 0.889
0.368AlaHis: 0.368 ± 0.245
3.497AlaIle: 3.497 ± 0.802
3.681AlaLys: 3.681 ± 0.607
9.019AlaLeu: 9.019 ± 2.137
1.657AlaMet: 1.657 ± 0.459
1.472AlaAsn: 1.472 ± 0.358
4.786AlaPro: 4.786 ± 1.646
2.577AlaGln: 2.577 ± 0.81
2.577AlaArg: 2.577 ± 0.581
4.602AlaSer: 4.602 ± 0.547
4.602AlaThr: 4.602 ± 0.663
4.602AlaVal: 4.602 ± 1.269
0.552AlaTrp: 0.552 ± 0.545
2.945AlaTyr: 2.945 ± 0.662
0.0AlaXaa: 0.0 ± 0.0
Cys
2.209CysAla: 2.209 ± 0.62
1.288CysCys: 1.288 ± 0.883
1.657CysAsp: 1.657 ± 0.372
0.736CysGlu: 0.736 ± 0.489
0.184CysPhe: 0.184 ± 0.152
3.129CysGly: 3.129 ± 0.45
0.736CysHis: 0.736 ± 0.176
1.288CysIle: 1.288 ± 0.375
1.841CysLys: 1.841 ± 0.293
3.313CysLeu: 3.313 ± 0.719
0.736CysMet: 0.736 ± 0.259
0.552CysAsn: 0.552 ± 0.216
2.209CysPro: 2.209 ± 0.992
1.288CysGln: 1.288 ± 0.29
0.552CysArg: 0.552 ± 0.15
1.841CysSer: 1.841 ± 0.555
2.577CysThr: 2.577 ± 0.946
2.761CysVal: 2.761 ± 0.75
1.104CysTrp: 1.104 ± 0.882
2.393CysTyr: 2.393 ± 1.146
0.0CysXaa: 0.0 ± 0.0
Asp
2.761AspAla: 2.761 ± 0.923
0.92AspCys: 0.92 ± 0.214
4.049AspAsp: 4.049 ± 0.498
1.104AspGlu: 1.104 ± 0.431
4.233AspPhe: 4.233 ± 0.706
3.497AspGly: 3.497 ± 0.565
2.025AspHis: 2.025 ± 0.436
2.393AspIle: 2.393 ± 0.703
0.552AspLys: 0.552 ± 0.367
4.233AspLeu: 4.233 ± 0.533
1.104AspMet: 1.104 ± 0.653
1.104AspAsn: 1.104 ± 0.428
2.761AspPro: 2.761 ± 0.727
0.736AspGln: 0.736 ± 0.176
2.393AspArg: 2.393 ± 0.634
4.417AspSer: 4.417 ± 1.058
2.761AspThr: 2.761 ± 0.428
3.497AspVal: 3.497 ± 1.325
1.288AspTrp: 1.288 ± 0.375
0.736AspTyr: 0.736 ± 0.259
0.0AspXaa: 0.0 ± 0.0
Glu
3.313GluAla: 3.313 ± 1.04
0.552GluCys: 0.552 ± 0.216
1.841GluAsp: 1.841 ± 0.777
0.92GluGlu: 0.92 ± 0.214
1.472GluPhe: 1.472 ± 0.762
3.313GluGly: 3.313 ± 0.744
2.393GluHis: 2.393 ± 0.687
2.209GluIle: 2.209 ± 0.6
1.841GluLys: 1.841 ± 0.359
2.761GluLeu: 2.761 ± 0.878
0.736GluMet: 0.736 ± 0.903
0.552GluAsn: 0.552 ± 0.216
2.209GluPro: 2.209 ± 0.889
0.736GluGln: 0.736 ± 0.176
2.393GluArg: 2.393 ± 0.571
2.025GluSer: 2.025 ± 0.456
0.92GluThr: 0.92 ± 0.376
4.417GluVal: 4.417 ± 1.004
0.736GluTrp: 0.736 ± 0.176
1.288GluTyr: 1.288 ± 0.617
0.0GluXaa: 0.0 ± 0.0
Phe
3.681PheAla: 3.681 ± 1.093
2.209PheCys: 2.209 ± 1.087
2.761PheAsp: 2.761 ± 0.334
1.288PheGlu: 1.288 ± 0.375
3.129PhePhe: 3.129 ± 2.302
4.049PheGly: 4.049 ± 0.567
0.184PheHis: 0.184 ± 0.122
1.288PheIle: 1.288 ± 0.405
1.657PheLys: 1.657 ± 0.339
3.681PheLeu: 3.681 ± 0.807
1.104PheMet: 1.104 ± 0.431
2.393PheAsn: 2.393 ± 0.631
2.761PhePro: 2.761 ± 1.215
1.472PheGln: 1.472 ± 0.518
1.841PheArg: 1.841 ± 0.703
5.154PheSer: 5.154 ± 1.186
3.865PheThr: 3.865 ± 0.575
3.129PheVal: 3.129 ± 1.752
0.736PheTrp: 0.736 ± 0.808
1.472PheTyr: 1.472 ± 0.358
0.0PheXaa: 0.0 ± 0.0
Gly
4.786GlyAla: 4.786 ± 1.208
2.577GlyCys: 2.577 ± 0.486
3.865GlyAsp: 3.865 ± 1.411
3.129GlyGlu: 3.129 ± 0.715
4.602GlyPhe: 4.602 ± 1.457
4.049GlyGly: 4.049 ± 4.614
1.841GlyHis: 1.841 ± 1.002
2.393GlyIle: 2.393 ± 0.631
3.313GlyLys: 3.313 ± 3.507
4.417GlyLeu: 4.417 ± 0.854
1.288GlyMet: 1.288 ± 0.29
2.025GlyAsn: 2.025 ± 2.54
4.417GlyPro: 4.417 ± 0.398
1.472GlyGln: 1.472 ± 0.707
3.313GlyArg: 3.313 ± 0.964
6.442GlySer: 6.442 ± 0.859
4.049GlyThr: 4.049 ± 0.734
8.283GlyVal: 8.283 ± 1.899
0.736GlyTrp: 0.736 ± 0.176
4.417GlyTyr: 4.417 ± 1.307
0.0GlyXaa: 0.0 ± 0.0
His
1.104HisAla: 1.104 ± 0.496
0.368HisCys: 0.368 ± 0.501
0.736HisAsp: 0.736 ± 0.176
0.736HisGlu: 0.736 ± 0.259
0.736HisPhe: 0.736 ± 1.532
2.025HisGly: 2.025 ± 0.346
1.472HisHis: 1.472 ± 0.738
0.736HisIle: 0.736 ± 0.259
0.368HisLys: 0.368 ± 0.088
1.288HisLeu: 1.288 ± 0.29
0.368HisMet: 0.368 ± 0.088
0.736HisAsn: 0.736 ± 0.176
1.472HisPro: 1.472 ± 0.353
0.92HisGln: 0.92 ± 0.214
1.104HisArg: 1.104 ± 0.265
2.393HisSer: 2.393 ± 0.581
2.393HisThr: 2.393 ± 0.375
2.393HisVal: 2.393 ± 0.375
0.552HisTrp: 0.552 ± 0.455
0.92HisTyr: 0.92 ± 0.407
0.0HisXaa: 0.0 ± 0.0
Ile
2.577IleAla: 2.577 ± 1.482
1.288IleCys: 1.288 ± 0.354
2.025IleAsp: 2.025 ± 0.506
1.472IleGlu: 1.472 ± 0.738
2.025IlePhe: 2.025 ± 0.872
3.313IleGly: 3.313 ± 0.597
0.92IleHis: 0.92 ± 0.214
2.393IleIle: 2.393 ± 0.893
2.945IleLys: 2.945 ± 0.537
1.841IleLeu: 1.841 ± 0.293
0.552IleMet: 0.552 ± 0.15
1.288IleAsn: 1.288 ± 0.512
4.049IlePro: 4.049 ± 0.913
1.657IleGln: 1.657 ± 0.45
0.92IleArg: 0.92 ± 0.612
4.233IleSer: 4.233 ± 0.977
2.577IleThr: 2.577 ± 0.655
2.761IleVal: 2.761 ± 0.878
1.288IleTrp: 1.288 ± 0.732
0.736IleTyr: 0.736 ± 0.489
0.0IleXaa: 0.0 ± 0.0
Lys
3.129LysAla: 3.129 ± 1.394
1.657LysCys: 1.657 ± 0.635
2.025LysAsp: 2.025 ± 1.825
1.472LysGlu: 1.472 ± 1.421
2.577LysPhe: 2.577 ± 1.018
2.577LysGly: 2.577 ± 0.749
1.104LysHis: 1.104 ± 0.35
1.657LysIle: 1.657 ± 0.339
1.841LysLys: 1.841 ± 1.648
6.074LysLeu: 6.074 ± 1.043
0.552LysMet: 0.552 ± 0.15
1.841LysAsn: 1.841 ± 0.829
3.129LysPro: 3.129 ± 0.946
2.209LysGln: 2.209 ± 0.638
2.945LysArg: 2.945 ± 0.662
4.049LysSer: 4.049 ± 0.734
3.129LysThr: 3.129 ± 1.546
1.841LysVal: 1.841 ± 0.427
0.184LysTrp: 0.184 ± 0.122
0.92LysTyr: 0.92 ± 0.804
0.0LysXaa: 0.0 ± 0.0
Leu
7.178LeuAla: 7.178 ± 2.27
3.497LeuCys: 3.497 ± 1.725
2.393LeuAsp: 2.393 ± 0.571
3.497LeuGlu: 3.497 ± 0.728
3.865LeuPhe: 3.865 ± 0.956
6.626LeuGly: 6.626 ± 0.706
2.025LeuHis: 2.025 ± 0.721
4.417LeuIle: 4.417 ± 0.994
4.233LeuLys: 4.233 ± 0.962
10.307LeuLeu: 10.307 ± 5.332
1.841LeuMet: 1.841 ± 0.739
2.577LeuAsn: 2.577 ± 0.519
8.099LeuPro: 8.099 ± 0.987
3.129LeuGln: 3.129 ± 0.849
3.497LeuArg: 3.497 ± 0.586
12.148LeuSer: 12.148 ± 1.322
7.915LeuThr: 7.915 ± 1.336
7.546LeuVal: 7.546 ± 0.782
1.104LeuTrp: 1.104 ± 0.265
2.025LeuTyr: 2.025 ± 0.744
0.0LeuXaa: 0.0 ± 0.0
Met
1.472MetAla: 1.472 ± 0.353
0.92MetCys: 0.92 ± 0.512
1.472MetAsp: 1.472 ± 1.062
0.736MetGlu: 0.736 ± 0.362
0.92MetPhe: 0.92 ± 0.376
0.92MetGly: 0.92 ± 0.214
0.0MetHis: 0.0 ± 0.0
0.736MetIle: 0.736 ± 0.362
1.104MetLys: 1.104 ± 0.831
2.577MetLeu: 2.577 ± 0.818
0.552MetMet: 0.552 ± 0.439
0.184MetAsn: 0.184 ± 0.122
0.736MetPro: 0.736 ± 0.362
0.184MetGln: 0.184 ± 0.122
0.92MetArg: 0.92 ± 0.214
1.472MetSer: 1.472 ± 0.518
1.472MetThr: 1.472 ± 0.31
0.92MetVal: 0.92 ± 0.293
0.368MetTrp: 0.368 ± 0.088
0.552MetTyr: 0.552 ± 0.439
0.0MetXaa: 0.0 ± 0.0
Asn
2.209AsnAla: 2.209 ± 0.6
0.736AsnCys: 0.736 ± 0.489
1.104AsnAsp: 1.104 ± 0.3
0.736AsnGlu: 0.736 ± 0.176
1.104AsnPhe: 1.104 ± 0.265
2.761AsnGly: 2.761 ± 2.642
0.552AsnHis: 0.552 ± 0.216
1.288AsnIle: 1.288 ± 0.375
0.736AsnLys: 0.736 ± 0.625
2.577AsnLeu: 2.577 ± 0.519
0.736AsnMet: 0.736 ± 0.176
1.104AsnAsn: 1.104 ± 2.015
2.945AsnPro: 2.945 ± 0.523
1.104AsnGln: 1.104 ± 0.265
0.92AsnArg: 0.92 ± 0.214
1.657AsnSer: 1.657 ± 2.007
3.497AsnThr: 3.497 ± 0.802
3.313AsnVal: 3.313 ± 0.744
1.104AsnTrp: 1.104 ± 0.662
0.92AsnTyr: 0.92 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
3.865ProAla: 3.865 ± 0.616
0.736ProCys: 0.736 ± 0.176
3.129ProAsp: 3.129 ± 0.628
2.393ProGlu: 2.393 ± 0.542
2.945ProPhe: 2.945 ± 0.687
4.602ProGly: 4.602 ± 0.437
0.92ProHis: 0.92 ± 0.376
2.209ProIle: 2.209 ± 0.502
4.786ProLys: 4.786 ± 0.845
6.626ProLeu: 6.626 ± 1.029
0.736ProMet: 0.736 ± 0.259
1.472ProAsn: 1.472 ± 0.31
6.81ProPro: 6.81 ± 2.838
1.657ProGln: 1.657 ± 0.946
2.761ProArg: 2.761 ± 1.56
5.522ProSer: 5.522 ± 1.76
7.731ProThr: 7.731 ± 1.852
7.178ProVal: 7.178 ± 1.163
1.104ProTrp: 1.104 ± 1.08
2.025ProTyr: 2.025 ± 0.632
0.0ProXaa: 0.0 ± 0.0
Gln
2.393GlnAla: 2.393 ± 0.604
1.288GlnCys: 1.288 ± 0.405
1.104GlnAsp: 1.104 ± 0.3
2.209GlnGlu: 2.209 ± 0.6
0.92GlnPhe: 0.92 ± 0.214
2.209GlnGly: 2.209 ± 0.992
0.736GlnHis: 0.736 ± 0.176
1.657GlnIle: 1.657 ± 0.339
1.104GlnLys: 1.104 ± 0.771
4.233GlnLeu: 4.233 ± 1.321
0.368GlnMet: 0.368 ± 0.303
0.736GlnAsn: 0.736 ± 0.259
0.92GlnPro: 0.92 ± 0.376
1.104GlnGln: 1.104 ± 0.3
0.184GlnArg: 0.184 ± 0.122
4.049GlnSer: 4.049 ± 1.247
1.657GlnThr: 1.657 ± 0.779
1.288GlnVal: 1.288 ± 0.617
0.0GlnTrp: 0.0 ± 0.0
1.104GlnTyr: 1.104 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
2.393ArgAla: 2.393 ± 0.604
1.472ArgCys: 1.472 ± 0.394
0.92ArgAsp: 0.92 ± 0.376
1.841ArgGlu: 1.841 ± 0.427
2.761ArgPhe: 2.761 ± 0.641
2.577ArgGly: 2.577 ± 1.592
1.104ArgHis: 1.104 ± 0.3
0.92ArgIle: 0.92 ± 0.388
1.657ArgLys: 1.657 ± 0.459
5.338ArgLeu: 5.338 ± 1.359
0.552ArgMet: 0.552 ± 0.434
2.025ArgAsn: 2.025 ± 0.724
3.497ArgPro: 3.497 ± 1.185
1.104ArgGln: 1.104 ± 0.265
3.681ArgArg: 3.681 ± 1.563
3.313ArgSer: 3.313 ± 1.069
3.129ArgThr: 3.129 ± 0.478
4.049ArgVal: 4.049 ± 1.013
0.92ArgTrp: 0.92 ± 0.214
1.472ArgTyr: 1.472 ± 0.31
0.0ArgXaa: 0.0 ± 0.0
Ser
5.522SerAla: 5.522 ± 1.661
3.865SerCys: 3.865 ± 0.645
6.994SerAsp: 6.994 ± 0.882
4.233SerGlu: 4.233 ± 1.422
2.025SerPhe: 2.025 ± 0.721
8.099SerGly: 8.099 ± 2.045
2.025SerHis: 2.025 ± 0.456
2.761SerIle: 2.761 ± 0.641
4.417SerLys: 4.417 ± 1.199
9.387SerLeu: 9.387 ± 1.233
1.104SerMet: 1.104 ± 0.678
2.761SerAsn: 2.761 ± 0.641
4.233SerPro: 4.233 ± 1.302
2.393SerGln: 2.393 ± 1.496
3.865SerArg: 3.865 ± 0.871
10.307SerSer: 10.307 ± 1.903
5.89SerThr: 5.89 ± 1.181
7.546SerVal: 7.546 ± 1.106
1.472SerTrp: 1.472 ± 0.858
3.129SerTyr: 3.129 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
4.049ThrAla: 4.049 ± 0.989
2.025ThrCys: 2.025 ± 0.538
3.497ThrAsp: 3.497 ± 0.643
2.577ThrGlu: 2.577 ± 0.707
3.497ThrPhe: 3.497 ± 0.488
6.258ThrGly: 6.258 ± 0.788
1.104ThrHis: 1.104 ± 0.265
3.313ThrIle: 3.313 ± 0.541
3.313ThrLys: 3.313 ± 1.18
8.835ThrLeu: 8.835 ± 1.815
1.841ThrMet: 1.841 ± 0.293
3.129ThrAsn: 3.129 ± 0.804
4.97ThrPro: 4.97 ± 1.629
1.288ThrGln: 1.288 ± 0.29
4.049ThrArg: 4.049 ± 1.013
7.546ThrSer: 7.546 ± 2.13
6.626ThrThr: 6.626 ± 0.556
5.89ThrVal: 5.89 ± 1.003
0.736ThrTrp: 0.736 ± 0.405
1.104ThrTyr: 1.104 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
5.338ValAla: 5.338 ± 0.322
3.313ValCys: 3.313 ± 1.069
3.681ValAsp: 3.681 ± 1.107
4.233ValGlu: 4.233 ± 0.882
3.681ValPhe: 3.681 ± 0.854
4.049ValGly: 4.049 ± 1.251
1.472ValHis: 1.472 ± 0.685
2.577ValIle: 2.577 ± 0.707
3.129ValLys: 3.129 ± 0.739
7.731ValLeu: 7.731 ± 1.232
1.104ValMet: 1.104 ± 0.3
3.497ValAsn: 3.497 ± 0.892
6.074ValPro: 6.074 ± 0.96
2.761ValGln: 2.761 ± 0.717
3.865ValArg: 3.865 ± 0.516
7.546ValSer: 7.546 ± 1.435
6.81ValThr: 6.81 ± 1.026
8.651ValVal: 8.651 ± 0.807
0.736ValTrp: 0.736 ± 0.489
2.945ValTyr: 2.945 ± 0.723
0.0ValXaa: 0.0 ± 0.0
Trp
0.552TrpAla: 0.552 ± 0.439
0.368TrpCys: 0.368 ± 0.088
0.552TrpAsp: 0.552 ± 0.455
0.368TrpGlu: 0.368 ± 0.088
1.288TrpPhe: 1.288 ± 0.743
1.104TrpGly: 1.104 ± 0.265
0.368TrpHis: 0.368 ± 0.303
0.92TrpIle: 0.92 ± 0.728
0.736TrpLys: 0.736 ± 0.362
1.104TrpLeu: 1.104 ± 1.077
0.552TrpMet: 0.552 ± 0.216
0.552TrpAsn: 0.552 ± 0.367
0.92TrpPro: 0.92 ± 0.293
0.368TrpGln: 0.368 ± 0.501
1.657TrpArg: 1.657 ± 0.339
0.92TrpSer: 0.92 ± 0.776
0.92TrpThr: 0.92 ± 0.293
1.288TrpVal: 1.288 ± 0.375
0.368TrpTrp: 0.368 ± 0.914
0.736TrpTyr: 0.736 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.472TyrAla: 1.472 ± 0.417
1.288TyrCys: 1.288 ± 0.29
1.472TyrAsp: 1.472 ± 0.518
0.92TyrGlu: 0.92 ± 0.376
1.657TyrPhe: 1.657 ± 1.382
2.761TyrGly: 2.761 ± 0.801
1.472TyrHis: 1.472 ± 0.707
2.209TyrIle: 2.209 ± 0.343
1.657TyrLys: 1.657 ± 0.948
2.577TyrLeu: 2.577 ± 0.334
0.552TyrMet: 0.552 ± 0.439
1.104TyrAsn: 1.104 ± 0.639
2.209TyrPro: 2.209 ± 0.992
1.104TyrGln: 1.104 ± 0.496
1.288TyrArg: 1.288 ± 0.29
2.577TyrSer: 2.577 ± 0.904
3.129TyrThr: 3.129 ± 0.894
1.841TyrVal: 1.841 ± 0.441
0.552TyrTrp: 0.552 ± 0.367
1.288TyrTyr: 1.288 ± 0.854
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5434 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski