Amino acid dipepetide frequency for Sanxia Water Strider Virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.32AlaAla: 3.32 ± 2.052
0.766AlaCys: 0.766 ± 0.571
2.298AlaAsp: 2.298 ± 0.386
3.32AlaGlu: 3.32 ± 0.927
1.021AlaPhe: 1.021 ± 0.287
2.043AlaGly: 2.043 ± 1.136
1.021AlaHis: 1.021 ± 0.417
2.298AlaIle: 2.298 ± 0.546
1.788AlaLys: 1.788 ± 1.037
4.341AlaLeu: 4.341 ± 0.465
1.277AlaMet: 1.277 ± 0.482
1.788AlaAsn: 1.788 ± 1.114
2.043AlaPro: 2.043 ± 0.346
2.554AlaGln: 2.554 ± 1.038
2.809AlaArg: 2.809 ± 1.508
5.107AlaSer: 5.107 ± 0.471
3.32AlaThr: 3.32 ± 0.996
1.788AlaVal: 1.788 ± 0.64
0.511AlaTrp: 0.511 ± 0.48
1.532AlaTyr: 1.532 ± 0.731
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.386
0.255CysCys: 0.255 ± 0.152
1.021CysAsp: 1.021 ± 0.608
0.0CysGlu: 0.0 ± 0.0
1.788CysPhe: 1.788 ± 0.446
1.532CysGly: 1.532 ± 0.665
0.0CysHis: 0.0 ± 0.0
1.532CysIle: 1.532 ± 0.436
1.021CysLys: 1.021 ± 0.608
1.788CysLeu: 1.788 ± 1.037
0.255CysMet: 0.255 ± 0.152
0.511CysAsn: 0.511 ± 0.244
0.766CysPro: 0.766 ± 0.699
0.766CysGln: 0.766 ± 0.456
1.277CysArg: 1.277 ± 0.474
1.021CysSer: 1.021 ± 0.731
0.511CysThr: 0.511 ± 0.244
0.511CysVal: 0.511 ± 0.244
0.255CysTrp: 0.255 ± 0.152
0.511CysTyr: 0.511 ± 0.304
0.0CysXaa: 0.0 ± 0.0
Asp
2.043AspAla: 2.043 ± 0.853
0.766AspCys: 0.766 ± 0.456
1.277AspAsp: 1.277 ± 0.524
1.532AspGlu: 1.532 ± 0.718
2.809AspPhe: 2.809 ± 0.781
1.788AspGly: 1.788 ± 0.725
2.043AspHis: 2.043 ± 0.54
1.788AspIle: 1.788 ± 0.261
1.277AspLys: 1.277 ± 0.662
7.406AspLeu: 7.406 ± 1.534
1.021AspMet: 1.021 ± 0.729
3.064AspAsn: 3.064 ± 1.076
4.852AspPro: 4.852 ± 0.842
2.298AspGln: 2.298 ± 0.569
0.766AspArg: 0.766 ± 0.346
3.83AspSer: 3.83 ± 1.06
1.788AspThr: 1.788 ± 0.261
1.532AspVal: 1.532 ± 0.408
1.021AspTrp: 1.021 ± 0.359
1.788AspTyr: 1.788 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
1.532GluAla: 1.532 ± 0.345
1.788GluCys: 1.788 ± 0.456
2.809GluAsp: 2.809 ± 0.917
3.575GluGlu: 3.575 ± 0.83
2.554GluPhe: 2.554 ± 0.667
4.341GluGly: 4.341 ± 1.478
0.511GluHis: 0.511 ± 0.304
3.064GluIle: 3.064 ± 0.89
4.341GluLys: 4.341 ± 0.641
7.661GluLeu: 7.661 ± 1.489
1.788GluMet: 1.788 ± 0.751
2.809GluAsn: 2.809 ± 0.596
1.021GluPro: 1.021 ± 0.731
1.532GluGln: 1.532 ± 0.296
2.043GluArg: 2.043 ± 0.731
3.575GluSer: 3.575 ± 1.155
3.064GluThr: 3.064 ± 0.723
3.32GluVal: 3.32 ± 1.836
0.766GluTrp: 0.766 ± 0.738
2.298GluTyr: 2.298 ± 0.652
0.0GluXaa: 0.0 ± 0.0
Phe
0.766PheAla: 0.766 ± 0.534
0.766PheCys: 0.766 ± 0.267
2.043PheAsp: 2.043 ± 0.718
2.809PheGlu: 2.809 ± 1.009
2.554PhePhe: 2.554 ± 0.863
2.554PheGly: 2.554 ± 0.898
0.511PheHis: 0.511 ± 0.244
2.298PheIle: 2.298 ± 0.761
3.32PheLys: 3.32 ± 1.008
5.618PheLeu: 5.618 ± 1.02
1.021PheMet: 1.021 ± 0.617
2.554PheAsn: 2.554 ± 0.567
3.32PhePro: 3.32 ± 0.776
1.277PheGln: 1.277 ± 0.524
2.043PheArg: 2.043 ± 0.346
6.384PheSer: 6.384 ± 1.363
3.32PheThr: 3.32 ± 0.692
2.043PheVal: 2.043 ± 0.601
0.511PheTrp: 0.511 ± 0.244
1.532PheTyr: 1.532 ± 0.462
0.0PheXaa: 0.0 ± 0.0
Gly
2.809GlyAla: 2.809 ± 1.369
0.255GlyCys: 0.255 ± 0.402
2.043GlyAsp: 2.043 ± 0.583
2.809GlyGlu: 2.809 ± 0.741
2.809GlyPhe: 2.809 ± 0.621
1.532GlyGly: 1.532 ± 0.345
2.043GlyHis: 2.043 ± 0.951
2.809GlyIle: 2.809 ± 1.012
1.788GlyLys: 1.788 ± 0.51
4.852GlyLeu: 4.852 ± 1.521
1.021GlyMet: 1.021 ± 0.699
1.788GlyAsn: 1.788 ± 0.261
1.277GlyPro: 1.277 ± 0.279
1.021GlyGln: 1.021 ± 0.699
3.064GlyArg: 3.064 ± 1.231
4.597GlySer: 4.597 ± 0.937
2.298GlyThr: 2.298 ± 1.078
2.554GlyVal: 2.554 ± 0.379
1.277GlyTrp: 1.277 ± 0.482
2.043GlyTyr: 2.043 ± 0.718
0.0GlyXaa: 0.0 ± 0.0
His
0.511HisAla: 0.511 ± 0.244
0.766HisCys: 0.766 ± 0.405
1.277HisAsp: 1.277 ± 0.279
0.511HisGlu: 0.511 ± 0.244
1.788HisPhe: 1.788 ± 0.76
1.021HisGly: 1.021 ± 0.359
0.511HisHis: 0.511 ± 0.35
1.788HisIle: 1.788 ± 0.538
0.766HisLys: 0.766 ± 0.894
4.341HisLeu: 4.341 ± 1.399
0.766HisMet: 0.766 ± 0.346
2.554HisAsn: 2.554 ± 0.544
2.298HisPro: 2.298 ± 1.053
0.766HisGln: 0.766 ± 0.405
1.021HisArg: 1.021 ± 0.359
4.341HisSer: 4.341 ± 0.203
1.277HisThr: 1.277 ± 0.575
1.021HisVal: 1.021 ± 0.62
0.511HisTrp: 0.511 ± 0.35
1.021HisTyr: 1.021 ± 0.489
0.0HisXaa: 0.0 ± 0.0
Ile
4.341IleAla: 4.341 ± 1.343
1.532IleCys: 1.532 ± 0.345
4.086IleAsp: 4.086 ± 0.858
3.32IleGlu: 3.32 ± 0.596
3.32IlePhe: 3.32 ± 0.744
3.575IleGly: 3.575 ± 1.47
2.298IleHis: 2.298 ± 0.757
5.618IleIle: 5.618 ± 1.644
4.852IleLys: 4.852 ± 1.205
7.15IleLeu: 7.15 ± 0.328
1.277IleMet: 1.277 ± 0.547
3.575IleAsn: 3.575 ± 1.09
4.086IlePro: 4.086 ± 1.065
3.32IleGln: 3.32 ± 0.475
1.532IleArg: 1.532 ± 0.403
7.15IleSer: 7.15 ± 1.087
5.107IleThr: 5.107 ± 0.983
2.298IleVal: 2.298 ± 0.574
1.277IleTrp: 1.277 ± 0.706
1.532IleTyr: 1.532 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
2.298LysAla: 2.298 ± 1.377
0.255LysCys: 0.255 ± 0.152
1.788LysAsp: 1.788 ± 0.454
4.597LysGlu: 4.597 ± 1.664
2.809LysPhe: 2.809 ± 0.997
2.554LysGly: 2.554 ± 0.979
1.277LysHis: 1.277 ± 0.528
5.107LysIle: 5.107 ± 1.037
4.341LysLys: 4.341 ± 0.85
4.852LysLeu: 4.852 ± 1.852
1.788LysMet: 1.788 ± 0.369
2.554LysAsn: 2.554 ± 0.798
2.554LysPro: 2.554 ± 1.048
1.021LysGln: 1.021 ± 0.731
3.575LysArg: 3.575 ± 1.034
5.363LysSer: 5.363 ± 1.502
5.107LysThr: 5.107 ± 0.589
3.83LysVal: 3.83 ± 1.06
1.532LysTrp: 1.532 ± 0.462
2.298LysTyr: 2.298 ± 0.751
0.0LysXaa: 0.0 ± 0.0
Leu
4.852LeuAla: 4.852 ± 0.943
1.788LeuCys: 1.788 ± 0.76
5.873LeuAsp: 5.873 ± 0.651
7.916LeuGlu: 7.916 ± 1.079
4.852LeuPhe: 4.852 ± 0.82
4.852LeuGly: 4.852 ± 0.798
4.597LeuHis: 4.597 ± 0.933
7.916LeuIle: 7.916 ± 0.806
8.938LeuLys: 8.938 ± 2.809
14.3LeuLeu: 14.3 ± 2.806
4.852LeuMet: 4.852 ± 0.749
5.618LeuAsn: 5.618 ± 0.905
4.597LeuPro: 4.597 ± 0.679
5.618LeuGln: 5.618 ± 1.227
5.618LeuArg: 5.618 ± 0.456
13.023LeuSer: 13.023 ± 1.297
7.15LeuThr: 7.15 ± 1.038
2.809LeuVal: 2.809 ± 1.372
1.532LeuTrp: 1.532 ± 0.718
4.086LeuTyr: 4.086 ± 0.732
0.0LeuXaa: 0.0 ± 0.0
Met
1.532MetAla: 1.532 ± 0.296
0.0MetCys: 0.0 ± 0.0
1.532MetAsp: 1.532 ± 1.675
0.766MetGlu: 0.766 ± 0.511
1.788MetPhe: 1.788 ± 0.781
0.511MetGly: 0.511 ± 0.304
0.255MetHis: 0.255 ± 0.307
2.043MetIle: 2.043 ± 0.925
2.554MetLys: 2.554 ± 1.038
3.064MetLeu: 3.064 ± 0.418
0.255MetMet: 0.255 ± 0.402
2.809MetAsn: 2.809 ± 0.672
0.0MetPro: 0.0 ± 0.0
1.532MetGln: 1.532 ± 0.441
1.788MetArg: 1.788 ± 0.76
2.043MetSer: 2.043 ± 1.512
2.298MetThr: 2.298 ± 0.652
0.766MetVal: 0.766 ± 0.456
0.255MetTrp: 0.255 ± 0.152
0.255MetTyr: 0.255 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
1.788AsnAla: 1.788 ± 0.528
0.766AsnCys: 0.766 ± 0.841
0.766AsnAsp: 0.766 ± 0.366
1.788AsnGlu: 1.788 ± 0.786
2.809AsnPhe: 2.809 ± 0.306
1.277AsnGly: 1.277 ± 0.73
2.554AsnHis: 2.554 ± 0.736
4.086AsnIle: 4.086 ± 1.392
2.298AsnLys: 2.298 ± 0.796
8.172AsnLeu: 8.172 ± 0.785
1.532AsnMet: 1.532 ± 0.983
1.532AsnAsn: 1.532 ± 1.107
3.575AsnPro: 3.575 ± 0.63
2.809AsnGln: 2.809 ± 1.067
1.788AsnArg: 1.788 ± 0.382
3.83AsnSer: 3.83 ± 1.135
2.809AsnThr: 2.809 ± 0.306
2.043AsnVal: 2.043 ± 0.853
0.511AsnTrp: 0.511 ± 0.304
2.809AsnTyr: 2.809 ± 0.759
0.0AsnXaa: 0.0 ± 0.0
Pro
1.788ProAla: 1.788 ± 0.512
0.766ProCys: 0.766 ± 0.534
4.086ProAsp: 4.086 ± 1.438
5.107ProGlu: 5.107 ± 1.017
2.043ProPhe: 2.043 ± 0.906
1.788ProGly: 1.788 ± 0.454
2.809ProHis: 2.809 ± 0.847
4.341ProIle: 4.341 ± 1.478
2.298ProLys: 2.298 ± 1.32
5.873ProLeu: 5.873 ± 0.741
0.511ProMet: 0.511 ± 0.304
2.809ProAsn: 2.809 ± 0.546
2.298ProPro: 2.298 ± 1.085
1.021ProGln: 1.021 ± 0.667
1.021ProArg: 1.021 ± 0.417
5.618ProSer: 5.618 ± 1.022
3.32ProThr: 3.32 ± 1.387
3.064ProVal: 3.064 ± 0.757
0.511ProTrp: 0.511 ± 0.615
1.277ProTyr: 1.277 ± 0.528
0.0ProXaa: 0.0 ± 0.0
Gln
1.021GlnAla: 1.021 ± 0.655
0.0GlnCys: 0.0 ± 0.0
2.043GlnAsp: 2.043 ± 0.811
1.788GlnGlu: 1.788 ± 0.784
1.788GlnPhe: 1.788 ± 0.781
1.788GlnGly: 1.788 ± 0.382
0.766GlnHis: 0.766 ± 0.456
2.554GlnIle: 2.554 ± 0.71
1.277GlnLys: 1.277 ± 0.759
4.597GlnLeu: 4.597 ± 1.521
1.277GlnMet: 1.277 ± 0.918
1.788GlnAsn: 1.788 ± 0.512
3.32GlnPro: 3.32 ± 2.021
1.788GlnGln: 1.788 ± 0.638
2.043GlnArg: 2.043 ± 0.623
3.064GlnSer: 3.064 ± 0.481
2.298GlnThr: 2.298 ± 1.404
2.809GlnVal: 2.809 ± 0.661
0.0GlnTrp: 0.0 ± 0.0
1.788GlnTyr: 1.788 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
2.043ArgAla: 2.043 ± 0.595
1.021ArgCys: 1.021 ± 0.426
1.532ArgAsp: 1.532 ± 0.618
3.064ArgGlu: 3.064 ± 1.004
1.532ArgPhe: 1.532 ± 0.856
1.788ArgGly: 1.788 ± 1.164
0.766ArgHis: 0.766 ± 0.456
2.298ArgIle: 2.298 ± 0.761
2.554ArgLys: 2.554 ± 1.938
6.129ArgLeu: 6.129 ± 1.831
1.021ArgMet: 1.021 ± 0.359
1.277ArgAsn: 1.277 ± 0.53
1.788ArgPro: 1.788 ± 0.418
1.021ArgGln: 1.021 ± 0.446
1.532ArgArg: 1.532 ± 0.733
4.086ArgSer: 4.086 ± 1.807
3.064ArgThr: 3.064 ± 0.682
3.32ArgVal: 3.32 ± 0.857
0.766ArgTrp: 0.766 ± 0.359
1.277ArgTyr: 1.277 ± 0.58
0.0ArgXaa: 0.0 ± 0.0
Ser
4.852SerAla: 4.852 ± 2.246
2.298SerCys: 2.298 ± 0.315
3.064SerAsp: 3.064 ± 0.833
4.086SerGlu: 4.086 ± 0.858
4.086SerPhe: 4.086 ± 1.136
4.852SerGly: 4.852 ± 0.897
2.298SerHis: 2.298 ± 0.315
8.172SerIle: 8.172 ± 1.69
6.129SerLys: 6.129 ± 1.707
13.534SerLeu: 13.534 ± 1.173
1.788SerMet: 1.788 ± 0.587
3.575SerAsn: 3.575 ± 0.664
4.086SerPro: 4.086 ± 0.3
5.107SerGln: 5.107 ± 0.627
4.086SerArg: 4.086 ± 0.445
11.491SerSer: 11.491 ± 1.391
5.363SerThr: 5.363 ± 1.507
4.341SerVal: 4.341 ± 1.108
2.043SerTrp: 2.043 ± 0.637
4.597SerTyr: 4.597 ± 1.986
0.0SerXaa: 0.0 ± 0.0
Thr
4.341ThrAla: 4.341 ± 1.403
0.255ThrCys: 0.255 ± 0.402
2.043ThrAsp: 2.043 ± 0.427
3.064ThrGlu: 3.064 ± 0.888
1.532ThrPhe: 1.532 ± 0.436
2.554ThrGly: 2.554 ± 0.706
2.043ThrHis: 2.043 ± 0.874
6.639ThrIle: 6.639 ± 1.832
2.554ThrLys: 2.554 ± 0.862
6.639ThrLeu: 6.639 ± 0.637
1.532ThrMet: 1.532 ± 0.345
3.83ThrAsn: 3.83 ± 0.721
5.107ThrPro: 5.107 ± 0.778
2.043ThrGln: 2.043 ± 1.067
2.298ThrArg: 2.298 ± 1.347
5.873ThrSer: 5.873 ± 1.487
4.852ThrThr: 4.852 ± 1.585
3.575ThrVal: 3.575 ± 1.382
1.277ThrTrp: 1.277 ± 0.396
1.788ThrTyr: 1.788 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
1.532ValAla: 1.532 ± 1.049
0.766ValCys: 0.766 ± 0.405
3.32ValAsp: 3.32 ± 0.237
1.788ValGlu: 1.788 ± 1.025
2.298ValPhe: 2.298 ± 1.067
2.298ValGly: 2.298 ± 1.036
0.511ValHis: 0.511 ± 0.668
3.064ValIle: 3.064 ± 0.78
3.83ValLys: 3.83 ± 1.129
4.597ValLeu: 4.597 ± 1.12
1.277ValMet: 1.277 ± 0.396
2.554ValAsn: 2.554 ± 0.784
2.809ValPro: 2.809 ± 0.654
0.511ValGln: 0.511 ± 0.654
1.788ValArg: 1.788 ± 0.551
5.363ValSer: 5.363 ± 1.19
3.064ValThr: 3.064 ± 0.895
2.554ValVal: 2.554 ± 0.596
1.277ValTrp: 1.277 ± 0.772
1.021ValTyr: 1.021 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.766TrpAla: 0.766 ± 0.456
0.255TrpCys: 0.255 ± 0.307
0.255TrpAsp: 0.255 ± 0.152
0.766TrpGlu: 0.766 ± 0.359
0.766TrpPhe: 0.766 ± 0.405
1.021TrpGly: 1.021 ± 0.489
0.511TrpHis: 0.511 ± 0.35
0.766TrpIle: 0.766 ± 0.267
1.532TrpLys: 1.532 ± 0.653
1.788TrpLeu: 1.788 ± 1.24
0.255TrpMet: 0.255 ± 0.152
0.766TrpAsn: 0.766 ± 0.738
0.511TrpPro: 0.511 ± 0.333
0.255TrpGln: 0.255 ± 0.152
0.766TrpArg: 0.766 ± 0.456
2.554TrpSer: 2.554 ± 0.565
1.532TrpThr: 1.532 ± 0.656
0.255TrpVal: 0.255 ± 0.386
0.0TrpTrp: 0.0 ± 0.0
0.511TrpTyr: 0.511 ± 0.35
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.554TyrAla: 2.554 ± 0.593
0.511TyrCys: 0.511 ± 0.304
1.532TyrAsp: 1.532 ± 1.184
1.788TyrGlu: 1.788 ± 0.995
2.298TyrPhe: 2.298 ± 0.652
1.021TyrGly: 1.021 ± 0.287
1.532TyrHis: 1.532 ± 0.911
3.064TyrIle: 3.064 ± 1.5
2.043TyrLys: 2.043 ± 0.448
4.086TyrLeu: 4.086 ± 1.19
1.277TyrMet: 1.277 ± 0.662
1.788TyrAsn: 1.788 ± 0.508
2.043TyrPro: 2.043 ± 1.115
1.532TyrGln: 1.532 ± 0.57
1.021TyrArg: 1.021 ± 0.437
1.788TyrSer: 1.788 ± 1.014
2.298TyrThr: 2.298 ± 0.798
1.788TyrVal: 1.788 ± 0.355
0.0TyrTrp: 0.0 ± 0.0
0.766TyrTyr: 0.766 ± 0.267
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski