Amino acid dipepetide frequency for Guangdong greater green snake arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.728AlaAla: 5.728 ± 1.537
4.513AlaCys: 4.513 ± 1.157
2.951AlaAsp: 2.951 ± 0.848
2.257AlaGlu: 2.257 ± 0.535
2.777AlaPhe: 2.777 ± 0.952
3.124AlaGly: 3.124 ± 1.096
1.736AlaHis: 1.736 ± 2.358
3.819AlaIle: 3.819 ± 0.602
4.166AlaLys: 4.166 ± 1.731
6.422AlaLeu: 6.422 ± 0.939
1.736AlaMet: 1.736 ± 0.807
2.777AlaAsn: 2.777 ± 1.162
2.777AlaPro: 2.777 ± 1.168
2.43AlaGln: 2.43 ± 0.538
2.43AlaArg: 2.43 ± 1.168
5.728AlaSer: 5.728 ± 0.547
3.298AlaThr: 3.298 ± 1.355
4.687AlaVal: 4.687 ± 1.699
1.389AlaTrp: 1.389 ± 0.456
3.819AlaTyr: 3.819 ± 0.752
0.0AlaXaa: 0.0 ± 0.0
Cys
1.909CysAla: 1.909 ± 1.436
1.041CysCys: 1.041 ± 0.51
2.083CysAsp: 2.083 ± 0.83
0.868CysGlu: 0.868 ± 0.574
2.257CysPhe: 2.257 ± 1.02
1.909CysGly: 1.909 ± 1.281
1.041CysHis: 1.041 ± 0.664
1.215CysIle: 1.215 ± 1.06
1.736CysLys: 1.736 ± 0.956
2.43CysLeu: 2.43 ± 0.615
0.694CysMet: 0.694 ± 0.631
0.694CysAsn: 0.694 ± 0.363
2.43CysPro: 2.43 ± 0.796
0.347CysGln: 0.347 ± 0.765
1.215CysArg: 1.215 ± 0.398
4.34CysSer: 4.34 ± 1.509
3.124CysThr: 3.124 ± 0.621
2.43CysVal: 2.43 ± 0.755
0.521CysTrp: 0.521 ± 0.272
0.694CysTyr: 0.694 ± 0.363
0.0CysXaa: 0.0 ± 0.0
Asp
3.645AspAla: 3.645 ± 0.572
1.389AspCys: 1.389 ± 0.55
2.951AspAsp: 2.951 ± 1.033
2.951AspGlu: 2.951 ± 1.033
1.909AspPhe: 1.909 ± 0.67
2.43AspGly: 2.43 ± 0.471
1.041AspHis: 1.041 ± 1.402
2.083AspIle: 2.083 ± 0.815
2.43AspLys: 2.43 ± 0.678
6.075AspLeu: 6.075 ± 1.946
0.868AspMet: 0.868 ± 0.267
1.562AspAsn: 1.562 ± 0.672
3.472AspPro: 3.472 ± 1.069
1.909AspGln: 1.909 ± 0.999
1.909AspArg: 1.909 ± 0.565
4.166AspSer: 4.166 ± 2.177
3.992AspThr: 3.992 ± 1.543
3.819AspVal: 3.819 ± 1.072
1.562AspTrp: 1.562 ± 0.612
2.083AspTyr: 2.083 ± 1.189
0.0AspXaa: 0.0 ± 0.0
Glu
2.43GluAla: 2.43 ± 0.471
0.868GluCys: 0.868 ± 0.507
2.257GluAsp: 2.257 ± 0.909
2.604GluGlu: 2.604 ± 0.873
2.43GluPhe: 2.43 ± 0.464
1.562GluGly: 1.562 ± 0.612
1.215GluHis: 1.215 ± 0.398
2.257GluIle: 2.257 ± 0.61
2.951GluLys: 2.951 ± 0.638
2.777GluLeu: 2.777 ± 0.599
0.694GluMet: 0.694 ± 0.229
1.909GluAsn: 1.909 ± 0.461
2.43GluPro: 2.43 ± 0.678
0.868GluGln: 0.868 ± 0.267
2.083GluArg: 2.083 ± 0.83
3.645GluSer: 3.645 ± 0.835
2.083GluThr: 2.083 ± 0.687
3.298GluVal: 3.298 ± 0.914
0.694GluTrp: 0.694 ± 0.631
1.041GluTyr: 1.041 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
3.472PheAla: 3.472 ± 1.003
2.777PheCys: 2.777 ± 0.917
3.298PheAsp: 3.298 ± 1.048
2.604PheGlu: 2.604 ± 0.792
2.257PhePhe: 2.257 ± 0.951
3.645PheGly: 3.645 ± 1.37
1.909PheHis: 1.909 ± 0.461
2.777PheIle: 2.777 ± 0.864
2.604PheLys: 2.604 ± 0.899
4.86PheLeu: 4.86 ± 1.85
1.562PheMet: 1.562 ± 0.361
3.124PheAsn: 3.124 ± 0.725
2.777PhePro: 2.777 ± 0.599
1.909PheGln: 1.909 ± 0.999
1.215PheArg: 1.215 ± 0.635
3.992PheSer: 3.992 ± 0.986
2.257PheThr: 2.257 ± 0.535
3.645PheVal: 3.645 ± 0.769
1.215PheTrp: 1.215 ± 0.398
0.868PheTyr: 0.868 ± 0.454
0.0PheXaa: 0.0 ± 0.0
Gly
2.257GlyAla: 2.257 ± 1.412
1.736GlyCys: 1.736 ± 0.621
3.472GlyAsp: 3.472 ± 0.699
1.736GlyGlu: 1.736 ± 0.907
3.992GlyPhe: 3.992 ± 1.582
3.298GlyGly: 3.298 ± 1.714
0.868GlyHis: 0.868 ± 0.66
2.777GlyIle: 2.777 ± 0.952
2.951GlyLys: 2.951 ± 0.638
5.207GlyLeu: 5.207 ± 1.25
1.041GlyMet: 1.041 ± 0.544
2.951GlyAsn: 2.951 ± 1.493
4.34GlyPro: 4.34 ± 1.963
1.215GlyGln: 1.215 ± 0.398
2.951GlyArg: 2.951 ± 0.66
5.207GlySer: 5.207 ± 2.528
3.645GlyThr: 3.645 ± 1.185
2.257GlyVal: 2.257 ± 0.94
0.868GlyTrp: 0.868 ± 0.454
2.083GlyTyr: 2.083 ± 0.653
0.0GlyXaa: 0.0 ± 0.0
His
2.604HisAla: 2.604 ± 1.101
0.347HisCys: 0.347 ± 0.467
1.562HisAsp: 1.562 ± 1.18
1.215HisGlu: 1.215 ± 0.682
1.909HisPhe: 1.909 ± 0.375
2.43HisGly: 2.43 ± 2.291
1.389HisHis: 1.389 ± 0.456
1.736HisIle: 1.736 ± 0.981
1.909HisLys: 1.909 ± 0.461
3.124HisLeu: 3.124 ± 0.433
0.174HisMet: 0.174 ± 0.486
0.521HisAsn: 0.521 ± 0.69
1.909HisPro: 1.909 ± 0.67
0.694HisGln: 0.694 ± 0.425
1.736HisArg: 1.736 ± 0.383
1.736HisSer: 1.736 ± 1.148
1.215HisThr: 1.215 ± 0.513
1.389HisVal: 1.389 ± 0.614
0.694HisTrp: 0.694 ± 0.363
2.257HisTyr: 2.257 ± 0.716
0.0HisXaa: 0.0 ± 0.0
Ile
3.645IleAla: 3.645 ± 1.886
1.562IleCys: 1.562 ± 0.595
3.645IleAsp: 3.645 ± 0.769
1.562IleGlu: 1.562 ± 0.816
2.257IlePhe: 2.257 ± 0.6
2.604IleGly: 2.604 ± 1.258
0.868IleHis: 0.868 ± 0.432
1.041IleIle: 1.041 ± 0.641
1.909IleLys: 1.909 ± 0.734
4.513IleLeu: 4.513 ± 1.394
0.868IleMet: 0.868 ± 0.434
1.909IleAsn: 1.909 ± 1.098
3.298IlePro: 3.298 ± 1.048
2.083IleGln: 2.083 ± 0.815
1.389IleArg: 1.389 ± 0.476
5.555IleSer: 5.555 ± 0.865
3.992IleThr: 3.992 ± 0.986
3.992IleVal: 3.992 ± 0.794
1.736IleTrp: 1.736 ± 1.014
0.521IleTyr: 0.521 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
2.604LysAla: 2.604 ± 0.825
1.041LysCys: 1.041 ± 0.327
1.909LysAsp: 1.909 ± 0.998
1.909LysGlu: 1.909 ± 0.998
4.166LysPhe: 4.166 ± 1.63
2.257LysGly: 2.257 ± 0.907
2.257LysHis: 2.257 ± 0.723
3.124LysIle: 3.124 ± 1.116
1.909LysLys: 1.909 ± 0.728
6.422LysLeu: 6.422 ± 1.296
1.215LysMet: 1.215 ± 0.444
1.215LysAsn: 1.215 ± 0.698
3.819LysPro: 3.819 ± 0.998
2.43LysGln: 2.43 ± 0.471
2.43LysArg: 2.43 ± 0.935
3.472LysSer: 3.472 ± 1.022
3.472LysThr: 3.472 ± 1.814
3.819LysVal: 3.819 ± 1.4
0.347LysTrp: 0.347 ± 0.181
1.562LysTyr: 1.562 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
7.464LeuAla: 7.464 ± 1.543
2.604LeuCys: 2.604 ± 0.567
3.819LeuAsp: 3.819 ± 0.989
4.34LeuGlu: 4.34 ± 0.784
3.992LeuPhe: 3.992 ± 1.119
3.819LeuGly: 3.819 ± 0.812
3.819LeuHis: 3.819 ± 1.26
4.513LeuIle: 4.513 ± 1.432
5.381LeuLys: 5.381 ± 1.389
9.721LeuLeu: 9.721 ± 3.243
2.257LeuMet: 2.257 ± 1.739
3.645LeuAsn: 3.645 ± 1.012
7.985LeuPro: 7.985 ± 2.854
2.604LeuGln: 2.604 ± 0.668
4.687LeuArg: 4.687 ± 1.304
8.505LeuSer: 8.505 ± 1.614
6.422LeuThr: 6.422 ± 1.049
6.77LeuVal: 6.77 ± 1.219
1.041LeuTrp: 1.041 ± 1.054
3.819LeuTyr: 3.819 ± 1.421
0.0LeuXaa: 0.0 ± 0.0
Met
1.909MetAla: 1.909 ± 0.649
0.521MetCys: 0.521 ± 0.224
0.521MetAsp: 0.521 ± 0.224
1.389MetGlu: 1.389 ± 0.456
1.389MetPhe: 1.389 ± 1.013
1.041MetGly: 1.041 ± 0.664
0.868MetHis: 0.868 ± 0.65
1.562MetIle: 1.562 ± 0.361
0.521MetLys: 0.521 ± 0.224
2.083MetLeu: 2.083 ± 0.438
0.694MetMet: 0.694 ± 0.555
0.174MetAsn: 0.174 ± 0.091
0.347MetPro: 0.347 ± 0.181
0.521MetGln: 0.521 ± 0.224
0.521MetArg: 0.521 ± 0.437
1.736MetSer: 1.736 ± 0.971
2.257MetThr: 2.257 ± 0.742
1.736MetVal: 1.736 ± 0.501
0.347MetTrp: 0.347 ± 0.589
0.347MetTyr: 0.347 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 1.19
1.215AsnCys: 1.215 ± 0.635
1.389AsnAsp: 1.389 ± 0.476
1.736AsnGlu: 1.736 ± 0.545
1.909AsnPhe: 1.909 ± 0.59
3.645AsnGly: 3.645 ± 1.12
0.521AsnHis: 0.521 ± 0.224
1.909AsnIle: 1.909 ± 1.093
1.389AsnLys: 1.389 ± 0.55
3.472AsnLeu: 3.472 ± 0.341
0.347AsnMet: 0.347 ± 0.181
2.257AsnAsn: 2.257 ± 1.159
3.645AsnPro: 3.645 ± 0.75
1.041AsnGln: 1.041 ± 0.526
2.257AsnArg: 2.257 ± 1.952
2.777AsnSer: 2.777 ± 1.292
2.604AsnThr: 2.604 ± 0.74
3.124AsnVal: 3.124 ± 0.724
1.736AsnTrp: 1.736 ± 0.383
1.389AsnTyr: 1.389 ± 1.124
0.0AsnXaa: 0.0 ± 0.0
Pro
2.604ProAla: 2.604 ± 0.754
1.562ProCys: 1.562 ± 0.468
3.472ProAsp: 3.472 ± 0.738
2.777ProGlu: 2.777 ± 0.856
2.951ProPhe: 2.951 ± 0.694
4.34ProGly: 4.34 ± 1.476
0.868ProHis: 0.868 ± 0.267
3.298ProIle: 3.298 ± 0.829
6.249ProLys: 6.249 ± 1.773
5.034ProLeu: 5.034 ± 1.878
0.694ProMet: 0.694 ± 0.716
2.777ProAsn: 2.777 ± 0.599
3.645ProPro: 3.645 ± 0.928
2.083ProGln: 2.083 ± 0.598
2.777ProArg: 2.777 ± 1.101
4.513ProSer: 4.513 ± 1.226
3.645ProThr: 3.645 ± 1.37
3.124ProVal: 3.124 ± 1.469
0.694ProTrp: 0.694 ± 0.425
2.777ProTyr: 2.777 ± 0.599
0.0ProXaa: 0.0 ± 0.0
Gln
2.43GlnAla: 2.43 ± 0.782
0.694GlnCys: 0.694 ± 0.52
0.868GlnAsp: 0.868 ± 0.267
1.389GlnGlu: 1.389 ± 0.458
1.736GlnPhe: 1.736 ± 0.939
1.909GlnGly: 1.909 ± 0.582
1.041GlnHis: 1.041 ± 0.361
1.736GlnIle: 1.736 ± 0.559
2.083GlnLys: 2.083 ± 0.914
3.298GlnLeu: 3.298 ± 1.774
0.868GlnMet: 0.868 ± 0.432
0.868GlnAsn: 0.868 ± 0.454
1.562GlnPro: 1.562 ± 0.923
1.215GlnGln: 1.215 ± 0.575
1.389GlnArg: 1.389 ± 2.914
3.472GlnSer: 3.472 ± 1.022
2.777GlnThr: 2.777 ± 0.674
1.562GlnVal: 1.562 ± 0.489
0.868GlnTrp: 0.868 ± 0.454
1.215GlnTyr: 1.215 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
3.472ArgAla: 3.472 ± 0.911
1.736ArgCys: 1.736 ± 0.482
1.389ArgAsp: 1.389 ± 0.67
1.041ArgGlu: 1.041 ± 0.361
2.083ArgPhe: 2.083 ± 0.815
2.777ArgGly: 2.777 ± 1.989
2.43ArgHis: 2.43 ± 0.547
1.389ArgIle: 1.389 ± 0.476
1.909ArgLys: 1.909 ± 0.728
4.166ArgLeu: 4.166 ± 1.338
0.174ArgMet: 0.174 ± 0.091
2.257ArgAsn: 2.257 ± 1.304
1.041ArgPro: 1.041 ± 0.858
1.562ArgGln: 1.562 ± 2.237
2.083ArgArg: 2.083 ± 1.985
5.034ArgSer: 5.034 ± 2.525
2.777ArgThr: 2.777 ± 1.11
2.257ArgVal: 2.257 ± 2.148
0.694ArgTrp: 0.694 ± 0.363
1.909ArgTyr: 1.909 ± 0.594
0.0ArgXaa: 0.0 ± 0.0
Ser
6.075SerAla: 6.075 ± 1.624
2.604SerCys: 2.604 ± 0.873
5.728SerAsp: 5.728 ± 1.386
3.124SerGlu: 3.124 ± 1.347
6.249SerPhe: 6.249 ± 1.688
4.34SerGly: 4.34 ± 1.7
3.124SerHis: 3.124 ± 1.767
3.298SerIle: 3.298 ± 1.416
4.166SerLys: 4.166 ± 1.428
8.158SerLeu: 8.158 ± 1.548
1.909SerMet: 1.909 ± 1.513
3.298SerAsn: 3.298 ± 0.72
3.298SerPro: 3.298 ± 1.067
2.777SerGln: 2.777 ± 1.258
4.513SerArg: 4.513 ± 2.608
7.464SerSer: 7.464 ± 2.32
6.77SerThr: 6.77 ± 0.808
4.513SerVal: 4.513 ± 1.188
0.868SerTrp: 0.868 ± 0.267
3.472SerTyr: 3.472 ± 1.469
0.0SerXaa: 0.0 ± 0.0
Thr
5.207ThrAla: 5.207 ± 1.062
3.645ThrCys: 3.645 ± 1.397
3.124ThrAsp: 3.124 ± 0.622
1.389ThrGlu: 1.389 ± 0.337
2.604ThrPhe: 2.604 ± 0.739
4.34ThrGly: 4.34 ± 1.181
2.257ThrHis: 2.257 ± 0.742
2.43ThrIle: 2.43 ± 0.518
1.909ThrLys: 1.909 ± 0.998
6.943ThrLeu: 6.943 ± 2.569
0.694ThrMet: 0.694 ± 0.363
3.298ThrAsn: 3.298 ± 0.705
5.381ThrPro: 5.381 ± 1.609
3.298ThrGln: 3.298 ± 0.785
2.777ThrArg: 2.777 ± 1.162
5.381ThrSer: 5.381 ± 1.454
5.902ThrThr: 5.902 ± 0.975
4.687ThrVal: 4.687 ± 1.856
1.215ThrTrp: 1.215 ± 0.444
1.909ThrTyr: 1.909 ± 1.227
0.0ThrXaa: 0.0 ± 0.0
Val
4.166ValAla: 4.166 ± 0.766
0.868ValCys: 0.868 ± 0.469
4.166ValAsp: 4.166 ± 1.016
2.951ValGlu: 2.951 ± 0.705
2.777ValPhe: 2.777 ± 0.521
2.604ValGly: 2.604 ± 0.74
1.909ValHis: 1.909 ± 1.569
5.207ValIle: 5.207 ± 1.273
2.604ValLys: 2.604 ± 0.564
5.381ValLeu: 5.381 ± 0.807
2.43ValMet: 2.43 ± 1.254
3.472ValAsn: 3.472 ± 1.027
2.951ValPro: 2.951 ± 0.672
2.083ValGln: 2.083 ± 1.025
1.736ValArg: 1.736 ± 0.404
4.687ValSer: 4.687 ± 2.599
5.902ValThr: 5.902 ± 1.964
4.86ValVal: 4.86 ± 2.152
1.389ValTrp: 1.389 ± 0.642
2.604ValTyr: 2.604 ± 0.567
0.0ValXaa: 0.0 ± 0.0
Trp
1.736TrpAla: 1.736 ± 0.746
0.868TrpCys: 0.868 ± 0.267
1.562TrpAsp: 1.562 ± 0.816
0.868TrpGlu: 0.868 ± 0.432
1.041TrpPhe: 1.041 ± 0.327
0.694TrpGly: 0.694 ± 0.229
0.521TrpHis: 0.521 ± 0.224
0.694TrpIle: 0.694 ± 0.363
0.694TrpLys: 0.694 ± 0.363
3.124TrpLeu: 3.124 ± 2.299
0.868TrpMet: 0.868 ± 0.454
1.041TrpAsn: 1.041 ± 0.544
0.521TrpPro: 0.521 ± 0.224
0.694TrpGln: 0.694 ± 0.425
1.041TrpArg: 1.041 ± 0.526
1.562TrpSer: 1.562 ± 1.476
0.347TrpThr: 0.347 ± 0.6
0.521TrpVal: 0.521 ± 0.272
0.174TrpTrp: 0.174 ± 0.308
0.174TrpTyr: 0.174 ± 0.091
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.083TyrAla: 2.083 ± 0.953
1.389TyrCys: 1.389 ± 0.458
2.083TyrAsp: 2.083 ± 1.213
1.389TyrGlu: 1.389 ± 0.476
2.083TyrPhe: 2.083 ± 0.687
2.083TyrGly: 2.083 ± 0.527
1.041TyrHis: 1.041 ± 0.448
2.083TyrIle: 2.083 ± 1.213
1.909TyrLys: 1.909 ± 0.67
3.992TyrLeu: 3.992 ± 1.479
0.521TyrMet: 0.521 ± 0.224
1.736TyrAsn: 1.736 ± 0.534
2.257TyrPro: 2.257 ± 0.861
1.215TyrGln: 1.215 ± 0.491
1.041TyrArg: 1.041 ± 0.361
2.777TyrSer: 2.777 ± 0.84
1.909TyrThr: 1.909 ± 0.913
2.257TyrVal: 2.257 ± 1.012
0.694TyrTrp: 0.694 ± 0.229
1.215TyrTyr: 1.215 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5762 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski