Amino acid dipepetide frequency for Arteriviridae sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.099AlaAla: 7.099 ± 0.619
3.645AlaCys: 3.645 ± 1.024
2.686AlaAsp: 2.686 ± 0.718
2.878AlaGlu: 2.878 ± 0.608
5.18AlaPhe: 5.18 ± 1.537
6.715AlaGly: 6.715 ± 1.025
2.494AlaHis: 2.494 ± 0.901
4.605AlaIle: 4.605 ± 1.414
4.988AlaLys: 4.988 ± 0.797
6.907AlaLeu: 6.907 ± 0.996
1.535AlaMet: 1.535 ± 0.56
2.111AlaAsn: 2.111 ± 0.757
4.221AlaPro: 4.221 ± 0.721
2.878AlaGln: 2.878 ± 0.805
3.454AlaArg: 3.454 ± 0.506
4.605AlaSer: 4.605 ± 0.856
4.988AlaThr: 4.988 ± 0.713
8.25AlaVal: 8.25 ± 0.962
0.959AlaTrp: 0.959 ± 0.602
1.919AlaTyr: 1.919 ± 1.018
0.0AlaXaa: 0.0 ± 0.0
Cys
2.686CysAla: 2.686 ± 0.801
1.343CysCys: 1.343 ± 0.613
1.727CysAsp: 1.727 ± 0.48
1.343CysGlu: 1.343 ± 0.249
1.919CysPhe: 1.919 ± 0.6
2.111CysGly: 2.111 ± 0.503
1.151CysHis: 1.151 ± 0.263
0.959CysIle: 0.959 ± 0.31
1.343CysLys: 1.343 ± 0.296
3.837CysLeu: 3.837 ± 0.741
0.576CysMet: 0.576 ± 0.319
1.151CysAsn: 1.151 ± 0.376
2.111CysPro: 2.111 ± 0.374
0.384CysGln: 0.384 ± 0.419
3.454CysArg: 3.454 ± 0.847
2.878CysSer: 2.878 ± 0.888
2.686CysThr: 2.686 ± 0.368
1.343CysVal: 1.343 ± 0.547
0.959CysTrp: 0.959 ± 0.48
1.343CysTyr: 1.343 ± 0.452
0.0CysXaa: 0.0 ± 0.0
Asp
1.919AspAla: 1.919 ± 0.559
0.959AspCys: 0.959 ± 0.31
2.878AspAsp: 2.878 ± 0.678
1.919AspGlu: 1.919 ± 0.518
2.686AspPhe: 2.686 ± 0.67
3.454AspGly: 3.454 ± 0.799
1.151AspHis: 1.151 ± 0.556
2.111AspIle: 2.111 ± 0.949
2.494AspLys: 2.494 ± 0.531
6.715AspLeu: 6.715 ± 1.927
0.959AspMet: 0.959 ± 0.29
0.576AspAsn: 0.576 ± 0.277
4.029AspPro: 4.029 ± 0.971
1.151AspGln: 1.151 ± 0.694
1.727AspArg: 1.727 ± 0.728
2.878AspSer: 2.878 ± 0.726
3.837AspThr: 3.837 ± 0.457
4.029AspVal: 4.029 ± 0.927
1.343AspTrp: 1.343 ± 0.545
0.959AspTyr: 0.959 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
3.454GluAla: 3.454 ± 1.151
1.343GluCys: 1.343 ± 0.3
2.686GluAsp: 2.686 ± 0.425
1.919GluGlu: 1.919 ± 0.854
1.343GluPhe: 1.343 ± 0.478
2.686GluGly: 2.686 ± 0.669
1.343GluHis: 1.343 ± 0.296
1.343GluIle: 1.343 ± 0.695
2.111GluLys: 2.111 ± 0.5
3.454GluLeu: 3.454 ± 0.419
0.192GluMet: 0.192 ± 0.124
1.151GluAsn: 1.151 ± 0.376
2.878GluPro: 2.878 ± 1.21
2.494GluGln: 2.494 ± 0.587
1.535GluArg: 1.535 ± 0.667
2.111GluSer: 2.111 ± 0.496
2.302GluThr: 2.302 ± 1.055
3.454GluVal: 3.454 ± 1.36
0.767GluTrp: 0.767 ± 0.415
1.151GluTyr: 1.151 ± 0.958
0.0GluXaa: 0.0 ± 0.0
Phe
2.686PheAla: 2.686 ± 0.805
2.111PheCys: 2.111 ± 1.249
1.343PheAsp: 1.343 ± 0.513
2.878PheGlu: 2.878 ± 0.678
1.727PhePhe: 1.727 ± 0.599
3.262PheGly: 3.262 ± 0.832
0.959PheHis: 0.959 ± 0.56
1.535PheIle: 1.535 ± 0.939
1.919PheLys: 1.919 ± 0.704
4.988PheLeu: 4.988 ± 0.883
0.767PheMet: 0.767 ± 0.352
0.959PheAsn: 0.959 ± 0.61
2.878PhePro: 2.878 ± 0.595
0.767PheGln: 0.767 ± 0.224
1.535PheArg: 1.535 ± 0.345
3.454PheSer: 3.454 ± 0.811
3.262PheThr: 3.262 ± 0.966
4.029PheVal: 4.029 ± 1.12
0.384PheTrp: 0.384 ± 0.27
1.151PheTyr: 1.151 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
5.948GlyAla: 5.948 ± 0.785
2.494GlyCys: 2.494 ± 0.536
4.221GlyAsp: 4.221 ± 1.087
2.111GlyGlu: 2.111 ± 0.439
4.221GlyPhe: 4.221 ± 0.501
4.413GlyGly: 4.413 ± 1.139
2.111GlyHis: 2.111 ± 0.602
3.262GlyIle: 3.262 ± 0.847
4.605GlyLys: 4.605 ± 0.93
6.332GlyLeu: 6.332 ± 1.612
0.576GlyMet: 0.576 ± 0.188
2.686GlyAsn: 2.686 ± 0.458
2.878GlyPro: 2.878 ± 0.886
1.727GlyGln: 1.727 ± 0.359
4.413GlyArg: 4.413 ± 0.979
5.372GlySer: 5.372 ± 0.673
4.605GlyThr: 4.605 ± 0.58
4.797GlyVal: 4.797 ± 0.538
1.151GlyTrp: 1.151 ± 0.347
2.302GlyTyr: 2.302 ± 0.398
0.0GlyXaa: 0.0 ± 0.0
His
1.535HisAla: 1.535 ± 0.63
1.343HisCys: 1.343 ± 0.288
0.767HisAsp: 0.767 ± 0.224
0.767HisGlu: 0.767 ± 0.408
0.959HisPhe: 0.959 ± 0.741
2.878HisGly: 2.878 ± 0.602
1.151HisHis: 1.151 ± 0.373
1.727HisIle: 1.727 ± 0.537
1.151HisLys: 1.151 ± 0.376
2.494HisLeu: 2.494 ± 0.54
1.151HisMet: 1.151 ± 0.497
0.767HisAsn: 0.767 ± 0.351
2.111HisPro: 2.111 ± 0.433
1.343HisGln: 1.343 ± 0.494
1.151HisArg: 1.151 ± 1.014
0.959HisSer: 0.959 ± 0.581
2.111HisThr: 2.111 ± 0.521
2.878HisVal: 2.878 ± 0.881
0.959HisTrp: 0.959 ± 0.449
0.959HisTyr: 0.959 ± 0.678
0.0HisXaa: 0.0 ± 0.0
Ile
4.029IleAla: 4.029 ± 0.818
1.151IleCys: 1.151 ± 0.248
2.302IleAsp: 2.302 ± 0.628
3.07IleGlu: 3.07 ± 0.961
1.343IlePhe: 1.343 ± 0.397
2.878IleGly: 2.878 ± 0.619
1.727IleHis: 1.727 ± 0.363
2.111IleIle: 2.111 ± 0.377
0.576IleLys: 0.576 ± 0.246
4.988IleLeu: 4.988 ± 1.261
0.767IleMet: 0.767 ± 0.537
0.767IleAsn: 0.767 ± 0.5
2.111IlePro: 2.111 ± 0.45
1.919IleGln: 1.919 ± 0.531
2.494IleArg: 2.494 ± 0.75
1.151IleSer: 1.151 ± 0.746
3.262IleThr: 3.262 ± 0.724
3.837IleVal: 3.837 ± 1.036
0.576IleTrp: 0.576 ± 0.314
1.919IleTyr: 1.919 ± 0.729
0.0IleXaa: 0.0 ± 0.0
Lys
1.919LysAla: 1.919 ± 0.383
1.151LysCys: 1.151 ± 0.347
2.302LysAsp: 2.302 ± 0.964
3.262LysGlu: 3.262 ± 0.551
1.919LysPhe: 1.919 ± 0.679
3.645LysGly: 3.645 ± 0.572
1.535LysHis: 1.535 ± 0.406
1.727LysIle: 1.727 ± 0.541
3.837LysLys: 3.837 ± 0.857
3.07LysLeu: 3.07 ± 0.798
1.727LysMet: 1.727 ± 0.591
1.919LysAsn: 1.919 ± 0.533
1.919LysPro: 1.919 ± 0.629
2.878LysGln: 2.878 ± 0.542
1.151LysArg: 1.151 ± 0.376
2.494LysSer: 2.494 ± 0.9
2.302LysThr: 2.302 ± 0.876
4.029LysVal: 4.029 ± 1.07
0.767LysTrp: 0.767 ± 0.379
2.302LysTyr: 2.302 ± 0.743
0.0LysXaa: 0.0 ± 0.0
Leu
11.32LeuAla: 11.32 ± 2.023
4.413LeuCys: 4.413 ± 1.189
6.14LeuAsp: 6.14 ± 1.789
3.645LeuGlu: 3.645 ± 0.749
3.645LeuPhe: 3.645 ± 2.269
6.523LeuGly: 6.523 ± 1.16
1.343LeuHis: 1.343 ± 0.642
3.837LeuIle: 3.837 ± 0.821
3.07LeuLys: 3.07 ± 0.421
9.785LeuLeu: 9.785 ± 2.736
2.111LeuMet: 2.111 ± 0.684
2.686LeuAsn: 2.686 ± 0.382
6.332LeuPro: 6.332 ± 1.145
3.454LeuGln: 3.454 ± 0.725
5.372LeuArg: 5.372 ± 1.055
6.907LeuSer: 6.907 ± 0.94
9.21LeuThr: 9.21 ± 0.788
4.988LeuVal: 4.988 ± 1.561
1.535LeuTrp: 1.535 ± 0.663
1.343LeuTyr: 1.343 ± 1.142
0.0LeuXaa: 0.0 ± 0.0
Met
2.111MetAla: 2.111 ± 0.835
0.576MetCys: 0.576 ± 0.305
1.151MetAsp: 1.151 ± 0.248
0.767MetGlu: 0.767 ± 0.286
0.384MetPhe: 0.384 ± 0.249
2.111MetGly: 2.111 ± 0.895
0.767MetHis: 0.767 ± 0.379
1.343MetIle: 1.343 ± 0.425
0.959MetLys: 0.959 ± 0.514
1.151MetLeu: 1.151 ± 0.369
0.384MetMet: 0.384 ± 0.249
1.151MetAsn: 1.151 ± 0.429
0.959MetPro: 0.959 ± 0.5
0.384MetGln: 0.384 ± 0.249
0.384MetArg: 0.384 ± 0.249
1.343MetSer: 1.343 ± 0.57
1.535MetThr: 1.535 ± 0.732
2.111MetVal: 2.111 ± 0.87
0.767MetTrp: 0.767 ± 0.286
0.576MetTyr: 0.576 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
1.151AsnAla: 1.151 ± 0.359
1.343AsnCys: 1.343 ± 0.494
0.576AsnAsp: 0.576 ± 0.188
0.959AsnGlu: 0.959 ± 0.476
0.959AsnPhe: 0.959 ± 0.397
1.919AsnGly: 1.919 ± 0.496
0.959AsnHis: 0.959 ± 0.546
0.767AsnIle: 0.767 ± 0.286
1.343AsnLys: 1.343 ± 0.502
2.686AsnLeu: 2.686 ± 1.022
0.576AsnMet: 0.576 ± 0.203
0.767AsnAsn: 0.767 ± 0.568
1.535AsnPro: 1.535 ± 0.505
1.535AsnGln: 1.535 ± 0.931
2.111AsnArg: 2.111 ± 0.546
1.919AsnSer: 1.919 ± 0.537
1.727AsnThr: 1.727 ± 0.436
3.262AsnVal: 3.262 ± 0.548
0.384AsnTrp: 0.384 ± 0.247
1.151AsnTyr: 1.151 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
4.605ProAla: 4.605 ± 0.767
2.111ProCys: 2.111 ± 0.563
3.645ProAsp: 3.645 ± 0.929
3.07ProGlu: 3.07 ± 1.139
2.494ProPhe: 2.494 ± 0.953
4.797ProGly: 4.797 ± 1.507
2.111ProHis: 2.111 ± 0.514
3.262ProIle: 3.262 ± 0.548
3.262ProLys: 3.262 ± 0.713
6.332ProLeu: 6.332 ± 1.341
0.576ProMet: 0.576 ± 0.26
1.919ProAsn: 1.919 ± 0.483
6.523ProPro: 6.523 ± 1.856
2.111ProGln: 2.111 ± 0.543
2.494ProArg: 2.494 ± 0.68
4.988ProSer: 4.988 ± 1.042
4.029ProThr: 4.029 ± 1.173
6.332ProVal: 6.332 ± 0.676
1.151ProTrp: 1.151 ± 0.475
1.919ProTyr: 1.919 ± 0.929
0.0ProXaa: 0.0 ± 0.0
Gln
3.262GlnAla: 3.262 ± 0.568
0.959GlnCys: 0.959 ± 0.689
2.302GlnAsp: 2.302 ± 0.869
0.767GlnGlu: 0.767 ± 0.401
1.151GlnPhe: 1.151 ± 0.402
2.111GlnGly: 2.111 ± 0.452
2.111GlnHis: 2.111 ± 0.424
1.535GlnIle: 1.535 ± 0.43
1.343GlnLys: 1.343 ± 0.425
3.837GlnLeu: 3.837 ± 0.971
0.959GlnMet: 0.959 ± 0.264
0.959GlnAsn: 0.959 ± 0.3
2.494GlnPro: 2.494 ± 0.495
1.727GlnGln: 1.727 ± 1.104
2.302GlnArg: 2.302 ± 0.659
1.535GlnSer: 1.535 ± 0.491
2.494GlnThr: 2.494 ± 0.981
3.07GlnVal: 3.07 ± 0.947
0.959GlnTrp: 0.959 ± 0.357
1.343GlnTyr: 1.343 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
4.988ArgAla: 4.988 ± 0.849
1.535ArgCys: 1.535 ± 0.456
2.111ArgAsp: 2.111 ± 0.407
1.535ArgGlu: 1.535 ± 0.492
2.111ArgPhe: 2.111 ± 0.87
3.837ArgGly: 3.837 ± 0.541
0.959ArgHis: 0.959 ± 0.506
2.111ArgIle: 2.111 ± 0.444
2.302ArgLys: 2.302 ± 0.932
4.797ArgLeu: 4.797 ± 0.866
1.727ArgMet: 1.727 ± 1.12
1.535ArgAsn: 1.535 ± 0.438
4.029ArgPro: 4.029 ± 0.804
2.494ArgGln: 2.494 ± 0.547
3.645ArgArg: 3.645 ± 1.506
2.686ArgSer: 2.686 ± 0.554
2.686ArgThr: 2.686 ± 0.587
3.262ArgVal: 3.262 ± 0.468
1.151ArgTrp: 1.151 ± 0.402
1.151ArgTyr: 1.151 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
4.988SerAla: 4.988 ± 1.309
1.151SerCys: 1.151 ± 0.537
3.07SerAsp: 3.07 ± 0.66
2.111SerGlu: 2.111 ± 0.779
1.919SerPhe: 1.919 ± 0.838
5.756SerGly: 5.756 ± 1.206
1.151SerHis: 1.151 ± 0.604
1.919SerIle: 1.919 ± 0.726
3.262SerLys: 3.262 ± 0.781
6.332SerLeu: 6.332 ± 1.654
1.919SerMet: 1.919 ± 0.717
1.727SerAsn: 1.727 ± 0.466
4.797SerPro: 4.797 ± 0.83
3.07SerGln: 3.07 ± 0.607
2.302SerArg: 2.302 ± 0.6
5.18SerSer: 5.18 ± 1.39
4.413SerThr: 4.413 ± 0.857
5.948SerVal: 5.948 ± 0.73
1.919SerTrp: 1.919 ± 0.682
1.535SerTyr: 1.535 ± 0.745
0.0SerXaa: 0.0 ± 0.0
Thr
6.332ThrAla: 6.332 ± 1.057
2.878ThrCys: 2.878 ± 0.887
1.727ThrAsp: 1.727 ± 0.794
1.343ThrGlu: 1.343 ± 0.452
1.919ThrPhe: 1.919 ± 0.761
3.837ThrGly: 3.837 ± 0.771
2.494ThrHis: 2.494 ± 0.432
2.494ThrIle: 2.494 ± 0.878
2.878ThrLys: 2.878 ± 0.471
4.029ThrLeu: 4.029 ± 0.849
1.535ThrMet: 1.535 ± 0.383
1.535ThrAsn: 1.535 ± 0.415
9.21ThrPro: 9.21 ± 1.825
2.494ThrGln: 2.494 ± 1.026
4.413ThrArg: 4.413 ± 0.75
5.564ThrSer: 5.564 ± 0.81
3.645ThrThr: 3.645 ± 0.777
5.372ThrVal: 5.372 ± 0.782
0.767ThrTrp: 0.767 ± 0.261
2.302ThrTyr: 2.302 ± 0.633
0.0ThrXaa: 0.0 ± 0.0
Val
8.25ValAla: 8.25 ± 1.091
3.454ValCys: 3.454 ± 1.181
4.029ValAsp: 4.029 ± 1.041
3.645ValGlu: 3.645 ± 0.328
4.413ValPhe: 4.413 ± 1.321
5.18ValGly: 5.18 ± 1.199
1.727ValHis: 1.727 ± 0.66
3.07ValIle: 3.07 ± 0.473
3.07ValLys: 3.07 ± 1.034
10.169ValLeu: 10.169 ± 1.829
1.727ValMet: 1.727 ± 0.352
1.919ValAsn: 1.919 ± 0.64
5.18ValPro: 5.18 ± 1.003
2.302ValGln: 2.302 ± 0.462
3.837ValArg: 3.837 ± 1.249
4.797ValSer: 4.797 ± 0.445
4.797ValThr: 4.797 ± 0.568
8.826ValVal: 8.826 ± 3.269
1.151ValTrp: 1.151 ± 0.376
2.686ValTyr: 2.686 ± 0.504
0.0ValXaa: 0.0 ± 0.0
Trp
1.343TrpAla: 1.343 ± 0.495
0.384TrpCys: 0.384 ± 0.143
0.576TrpAsp: 0.576 ± 0.188
0.576TrpGlu: 0.576 ± 0.188
0.767TrpPhe: 0.767 ± 0.585
0.767TrpGly: 0.767 ± 0.456
0.767TrpHis: 0.767 ± 0.365
0.959TrpIle: 0.959 ± 0.475
0.576TrpLys: 0.576 ± 0.461
2.878TrpLeu: 2.878 ± 1.157
0.384TrpMet: 0.384 ± 0.14
0.192TrpAsn: 0.192 ± 0.124
0.959TrpPro: 0.959 ± 0.284
0.767TrpGln: 0.767 ± 0.285
1.151TrpArg: 1.151 ± 0.381
1.343TrpSer: 1.343 ± 0.586
0.767TrpThr: 0.767 ± 0.498
2.302TrpVal: 2.302 ± 0.756
0.384TrpTrp: 0.384 ± 0.323
0.384TrpTyr: 0.384 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.494TyrAla: 2.494 ± 0.987
0.959TyrCys: 0.959 ± 0.254
1.535TyrAsp: 1.535 ± 0.762
1.151TyrGlu: 1.151 ± 0.628
1.535TyrPhe: 1.535 ± 0.394
1.343TyrGly: 1.343 ± 0.402
1.151TyrHis: 1.151 ± 0.683
2.302TyrIle: 2.302 ± 0.47
0.767TyrLys: 0.767 ± 0.285
2.878TyrLeu: 2.878 ± 1.83
0.576TyrMet: 0.576 ± 0.426
1.151TyrAsn: 1.151 ± 0.446
0.767TyrPro: 0.767 ± 0.224
1.343TyrGln: 1.343 ± 0.496
1.727TyrArg: 1.727 ± 0.586
2.302TyrSer: 2.302 ± 0.386
2.111TyrThr: 2.111 ± 0.464
2.111TyrVal: 2.111 ± 0.6
0.192TyrTrp: 0.192 ± 0.327
0.384TyrTyr: 0.384 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (5213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski