Amino acid dipepetide frequency for Shuangao chryso-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.059AlaAla: 3.059 ± 0.731
2.039AlaCys: 2.039 ± 0.592
2.549AlaAsp: 2.549 ± 0.464
5.098AlaGlu: 5.098 ± 0.742
2.294AlaPhe: 2.294 ± 0.61
5.863AlaGly: 5.863 ± 0.756
1.784AlaHis: 1.784 ± 0.513
1.02AlaIle: 1.02 ± 0.372
5.098AlaLys: 5.098 ± 0.665
4.588AlaLeu: 4.588 ± 0.761
2.294AlaMet: 2.294 ± 0.431
2.294AlaAsn: 2.294 ± 0.78
3.824AlaPro: 3.824 ± 0.766
1.529AlaGln: 1.529 ± 0.401
4.843AlaArg: 4.843 ± 1.424
5.098AlaSer: 5.098 ± 0.473
1.784AlaThr: 1.784 ± 0.606
4.843AlaVal: 4.843 ± 0.582
0.255AlaTrp: 0.255 ± 0.216
2.804AlaTyr: 2.804 ± 1.356
0.0AlaXaa: 0.0 ± 0.0
Cys
1.02CysAla: 1.02 ± 0.372
0.765CysCys: 0.765 ± 0.202
1.784CysAsp: 1.784 ± 0.308
0.765CysGlu: 0.765 ± 0.42
0.51CysPhe: 0.51 ± 0.444
2.549CysGly: 2.549 ± 0.763
0.51CysHis: 0.51 ± 0.248
0.765CysIle: 0.765 ± 0.42
1.02CysLys: 1.02 ± 0.495
1.275CysLeu: 1.275 ± 0.232
0.765CysMet: 0.765 ± 0.589
0.51CysAsn: 0.51 ± 0.496
1.275CysPro: 1.275 ± 0.439
0.51CysGln: 0.51 ± 0.257
1.275CysArg: 1.275 ± 0.48
2.039CysSer: 2.039 ± 0.47
0.51CysThr: 0.51 ± 0.277
1.275CysVal: 1.275 ± 0.386
0.765CysTrp: 0.765 ± 0.265
1.529CysTyr: 1.529 ± 0.417
0.0CysXaa: 0.0 ± 0.0
Asp
2.294AspAla: 2.294 ± 0.584
2.294AspCys: 2.294 ± 1.012
3.824AspAsp: 3.824 ± 1.19
5.863AspGlu: 5.863 ± 0.499
1.275AspPhe: 1.275 ± 0.827
3.314AspGly: 3.314 ± 0.862
1.275AspHis: 1.275 ± 0.263
3.569AspIle: 3.569 ± 0.348
4.079AspLys: 4.079 ± 0.214
5.863AspLeu: 5.863 ± 0.775
1.275AspMet: 1.275 ± 0.223
2.804AspAsn: 2.804 ± 0.845
3.314AspPro: 3.314 ± 0.656
0.51AspGln: 0.51 ± 0.266
3.059AspArg: 3.059 ± 0.937
3.314AspSer: 3.314 ± 0.751
2.039AspThr: 2.039 ± 1.045
5.863AspVal: 5.863 ± 0.824
0.765AspTrp: 0.765 ± 0.202
2.294AspTyr: 2.294 ± 0.138
0.0AspXaa: 0.0 ± 0.0
Glu
4.588GluAla: 4.588 ± 0.757
1.275GluCys: 1.275 ± 0.508
3.059GluAsp: 3.059 ± 0.618
3.059GluGlu: 3.059 ± 0.677
2.549GluPhe: 2.549 ± 1.162
3.824GluGly: 3.824 ± 1.058
0.0GluHis: 0.0 ± 0.0
3.314GluIle: 3.314 ± 0.718
3.059GluLys: 3.059 ± 1.012
5.863GluLeu: 5.863 ± 0.602
2.039GluMet: 2.039 ± 0.553
1.784GluAsn: 1.784 ± 0.391
1.529GluPro: 1.529 ± 0.31
2.039GluGln: 2.039 ± 0.089
2.549GluArg: 2.549 ± 0.559
4.843GluSer: 4.843 ± 1.753
2.294GluThr: 2.294 ± 0.584
5.608GluVal: 5.608 ± 0.577
1.02GluTrp: 1.02 ± 0.372
3.059GluTyr: 3.059 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
1.784PheAla: 1.784 ± 0.384
0.0PheCys: 0.0 ± 0.0
3.824PheAsp: 3.824 ± 0.321
2.294PheGlu: 2.294 ± 0.482
3.059PhePhe: 3.059 ± 0.541
3.314PheGly: 3.314 ± 0.751
0.0PheHis: 0.0 ± 0.0
0.765PheIle: 0.765 ± 0.4
2.294PheLys: 2.294 ± 0.804
2.294PheLeu: 2.294 ± 0.85
1.02PheMet: 1.02 ± 0.384
1.784PheAsn: 1.784 ± 0.755
1.275PhePro: 1.275 ± 0.232
0.255PheGln: 0.255 ± 0.226
1.275PheArg: 1.275 ± 0.561
4.588PheSer: 4.588 ± 0.661
1.529PheThr: 1.529 ± 0.233
1.275PheVal: 1.275 ± 0.397
0.765PheTrp: 0.765 ± 0.467
0.765PheTyr: 0.765 ± 0.414
0.0PheXaa: 0.0 ± 0.0
Gly
5.353GlyAla: 5.353 ± 1.079
0.51GlyCys: 0.51 ± 0.444
4.843GlyAsp: 4.843 ± 0.248
4.588GlyGlu: 4.588 ± 1.347
3.569GlyPhe: 3.569 ± 0.543
5.608GlyGly: 5.608 ± 1.818
1.529GlyHis: 1.529 ± 0.852
4.588GlyIle: 4.588 ± 0.509
3.824GlyLys: 3.824 ± 0.915
5.863GlyLeu: 5.863 ± 1.393
2.549GlyMet: 2.549 ± 0.386
0.765GlyAsn: 0.765 ± 0.265
2.549GlyPro: 2.549 ± 0.718
2.549GlyGln: 2.549 ± 0.464
1.529GlyArg: 1.529 ± 0.743
5.353GlySer: 5.353 ± 0.793
3.824GlyThr: 3.824 ± 0.767
4.843GlyVal: 4.843 ± 0.622
0.255GlyTrp: 0.255 ± 0.222
3.314GlyTyr: 3.314 ± 0.971
0.0GlyXaa: 0.0 ± 0.0
His
1.02HisAla: 1.02 ± 0.385
0.51HisCys: 0.51 ± 0.248
2.804HisAsp: 2.804 ± 0.897
0.255HisGlu: 0.255 ± 0.222
1.529HisPhe: 1.529 ± 0.239
1.784HisGly: 1.784 ± 0.567
0.765HisHis: 0.765 ± 0.429
1.275HisIle: 1.275 ± 0.664
1.529HisLys: 1.529 ± 0.401
2.294HisLeu: 2.294 ± 0.607
1.02HisMet: 1.02 ± 0.415
1.275HisAsn: 1.275 ± 0.444
0.51HisPro: 0.51 ± 0.286
0.255HisGln: 0.255 ± 0.216
1.02HisArg: 1.02 ± 0.321
1.529HisSer: 1.529 ± 0.401
0.765HisThr: 0.765 ± 0.453
1.529HisVal: 1.529 ± 0.769
0.0HisTrp: 0.0 ± 0.0
0.51HisTyr: 0.51 ± 0.248
0.0HisXaa: 0.0 ± 0.0
Ile
3.059IleAla: 3.059 ± 0.861
0.255IleCys: 0.255 ± 0.248
4.079IleAsp: 4.079 ± 1.546
1.529IleGlu: 1.529 ± 0.58
0.51IlePhe: 0.51 ± 0.277
4.333IleGly: 4.333 ± 0.983
2.039IleHis: 2.039 ± 0.489
2.039IleIle: 2.039 ± 0.663
2.549IleLys: 2.549 ± 0.596
3.314IleLeu: 3.314 ± 0.556
1.529IleMet: 1.529 ± 0.607
3.824IleAsn: 3.824 ± 0.784
2.549IlePro: 2.549 ± 0.206
1.02IleGln: 1.02 ± 0.372
3.824IleArg: 3.824 ± 1.455
5.863IleSer: 5.863 ± 0.738
1.784IleThr: 1.784 ± 0.412
4.333IleVal: 4.333 ± 1.011
0.255IleTrp: 0.255 ± 0.222
2.039IleTyr: 2.039 ± 0.662
0.0IleXaa: 0.0 ± 0.0
Lys
2.804LysAla: 2.804 ± 0.357
1.529LysCys: 1.529 ± 0.598
3.314LysAsp: 3.314 ± 0.556
3.824LysGlu: 3.824 ± 0.835
2.549LysPhe: 2.549 ± 0.487
3.314LysGly: 3.314 ± 1.046
1.275LysHis: 1.275 ± 0.664
3.569LysIle: 3.569 ± 0.421
2.549LysLys: 2.549 ± 1.052
4.843LysLeu: 4.843 ± 0.417
2.549LysMet: 2.549 ± 0.723
2.549LysAsn: 2.549 ± 0.911
2.294LysPro: 2.294 ± 0.735
1.784LysGln: 1.784 ± 0.701
2.039LysArg: 2.039 ± 0.29
3.824LysSer: 3.824 ± 0.459
4.079LysThr: 4.079 ± 0.448
4.843LysVal: 4.843 ± 0.265
0.51LysTrp: 0.51 ± 0.257
4.079LysTyr: 4.079 ± 1.301
0.0LysXaa: 0.0 ± 0.0
Leu
5.608LeuAla: 5.608 ± 0.623
1.275LeuCys: 1.275 ± 0.263
6.373LeuAsp: 6.373 ± 2.475
5.353LeuGlu: 5.353 ± 0.743
2.804LeuPhe: 2.804 ± 0.442
5.608LeuGly: 5.608 ± 1.028
1.784LeuHis: 1.784 ± 0.952
3.569LeuIle: 3.569 ± 0.243
5.098LeuLys: 5.098 ± 0.544
7.647LeuLeu: 7.647 ± 0.551
4.588LeuMet: 4.588 ± 0.24
4.079LeuAsn: 4.079 ± 1.279
2.549LeuPro: 2.549 ± 0.501
2.804LeuGln: 2.804 ± 1.035
5.863LeuArg: 5.863 ± 0.775
7.902LeuSer: 7.902 ± 0.872
2.294LeuThr: 2.294 ± 0.583
8.157LeuVal: 8.157 ± 0.837
1.275LeuTrp: 1.275 ± 0.255
3.569LeuTyr: 3.569 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.549MetAla: 2.549 ± 0.911
0.765MetCys: 0.765 ± 0.416
1.529MetAsp: 1.529 ± 0.319
1.784MetGlu: 1.784 ± 0.721
0.255MetPhe: 0.255 ± 0.222
2.549MetGly: 2.549 ± 1.074
0.255MetHis: 0.255 ± 0.226
1.529MetIle: 1.529 ± 0.658
1.784MetLys: 1.784 ± 0.412
3.569MetLeu: 3.569 ± 1.302
0.51MetMet: 0.51 ± 0.249
0.765MetAsn: 0.765 ± 0.42
1.784MetPro: 1.784 ± 0.21
2.039MetGln: 2.039 ± 0.567
2.039MetArg: 2.039 ± 1.244
2.804MetSer: 2.804 ± 1.173
2.294MetThr: 2.294 ± 1.053
2.804MetVal: 2.804 ± 0.508
0.255MetTrp: 0.255 ± 0.248
0.255MetTyr: 0.255 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
2.549AsnAla: 2.549 ± 0.97
1.275AsnCys: 1.275 ± 0.449
1.784AsnAsp: 1.784 ± 0.658
1.275AsnGlu: 1.275 ± 0.627
2.039AsnPhe: 2.039 ± 0.884
1.275AsnGly: 1.275 ± 0.664
0.765AsnHis: 0.765 ± 0.429
2.549AsnIle: 2.549 ± 0.501
1.784AsnLys: 1.784 ± 0.36
2.804AsnLeu: 2.804 ± 0.894
2.294AsnMet: 2.294 ± 0.603
3.059AsnAsn: 3.059 ± 0.661
1.275AsnPro: 1.275 ± 0.482
0.255AsnGln: 0.255 ± 0.248
4.079AsnArg: 4.079 ± 0.305
4.333AsnSer: 4.333 ± 0.58
1.784AsnThr: 1.784 ± 0.606
3.569AsnVal: 3.569 ± 1.107
1.02AsnTrp: 1.02 ± 0.371
2.039AsnTyr: 2.039 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
3.059ProAla: 3.059 ± 0.555
0.765ProCys: 0.765 ± 0.42
2.549ProAsp: 2.549 ± 0.879
3.059ProGlu: 3.059 ± 0.918
0.765ProPhe: 0.765 ± 0.648
3.059ProGly: 3.059 ± 1.412
1.02ProHis: 1.02 ± 0.389
2.549ProIle: 2.549 ± 0.819
2.294ProLys: 2.294 ± 0.652
3.059ProLeu: 3.059 ± 1.09
0.51ProMet: 0.51 ± 0.257
1.784ProAsn: 1.784 ± 0.698
1.275ProPro: 1.275 ± 0.48
1.529ProGln: 1.529 ± 0.31
2.804ProArg: 2.804 ± 0.828
4.079ProSer: 4.079 ± 1.201
2.804ProThr: 2.804 ± 0.462
4.079ProVal: 4.079 ± 1.846
1.02ProTrp: 1.02 ± 0.321
1.275ProTyr: 1.275 ± 0.232
0.0ProXaa: 0.0 ± 0.0
Gln
2.294GlnAla: 2.294 ± 0.289
0.765GlnCys: 0.765 ± 0.209
1.275GlnAsp: 1.275 ± 0.758
1.784GlnGlu: 1.784 ± 0.894
0.51GlnPhe: 0.51 ± 0.266
2.294GlnGly: 2.294 ± 0.696
0.255GlnHis: 0.255 ± 0.222
1.02GlnIle: 1.02 ± 0.532
1.529GlnLys: 1.529 ± 0.888
2.549GlnLeu: 2.549 ± 0.636
1.02GlnMet: 1.02 ± 0.389
0.255GlnAsn: 0.255 ± 0.226
1.529GlnPro: 1.529 ± 0.289
1.02GlnGln: 1.02 ± 0.371
1.02GlnArg: 1.02 ± 0.321
1.529GlnSer: 1.529 ± 0.224
1.784GlnThr: 1.784 ± 0.772
2.039GlnVal: 2.039 ± 0.457
1.02GlnTrp: 1.02 ± 0.633
1.784GlnTyr: 1.784 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
2.804ArgAla: 2.804 ± 1.116
1.275ArgCys: 1.275 ± 0.391
2.294ArgAsp: 2.294 ± 0.825
3.059ArgGlu: 3.059 ± 1.133
2.039ArgPhe: 2.039 ± 0.391
3.059ArgGly: 3.059 ± 0.722
1.784ArgHis: 1.784 ± 0.36
2.294ArgIle: 2.294 ± 0.66
3.569ArgLys: 3.569 ± 1.323
4.843ArgLeu: 4.843 ± 1.077
1.529ArgMet: 1.529 ± 0.538
2.294ArgAsn: 2.294 ± 0.583
1.275ArgPro: 1.275 ± 0.255
2.039ArgGln: 2.039 ± 0.391
2.804ArgArg: 2.804 ± 0.609
5.098ArgSer: 5.098 ± 0.974
2.804ArgThr: 2.804 ± 0.772
6.628ArgVal: 6.628 ± 1.046
0.51ArgTrp: 0.51 ± 0.248
2.294ArgTyr: 2.294 ± 0.66
0.0ArgXaa: 0.0 ± 0.0
Ser
4.843SerAla: 4.843 ± 1.414
0.765SerCys: 0.765 ± 0.202
4.333SerAsp: 4.333 ± 0.709
3.824SerGlu: 3.824 ± 0.889
3.824SerPhe: 3.824 ± 1.216
6.118SerGly: 6.118 ± 1.282
2.549SerHis: 2.549 ± 0.826
4.843SerIle: 4.843 ± 1.147
4.588SerLys: 4.588 ± 0.489
8.157SerLeu: 8.157 ± 1.029
1.784SerMet: 1.784 ± 0.36
3.824SerAsn: 3.824 ± 0.799
4.333SerPro: 4.333 ± 0.519
2.039SerGln: 2.039 ± 0.954
4.079SerArg: 4.079 ± 1.198
6.628SerSer: 6.628 ± 1.317
6.882SerThr: 6.882 ± 1.488
6.882SerVal: 6.882 ± 0.241
1.529SerTrp: 1.529 ± 0.233
3.314SerTyr: 3.314 ± 0.746
0.0SerXaa: 0.0 ± 0.0
Thr
3.059ThrAla: 3.059 ± 0.916
1.529ThrCys: 1.529 ± 0.628
0.765ThrAsp: 0.765 ± 0.467
2.804ThrGlu: 2.804 ± 0.944
1.529ThrPhe: 1.529 ± 0.588
2.804ThrGly: 2.804 ± 0.765
1.275ThrHis: 1.275 ± 0.758
3.569ThrIle: 3.569 ± 1.219
4.079ThrLys: 4.079 ± 1.201
4.843ThrLeu: 4.843 ± 0.959
1.02ThrMet: 1.02 ± 0.044
1.784ThrAsn: 1.784 ± 0.631
2.549ThrPro: 2.549 ± 1.145
1.02ThrGln: 1.02 ± 0.39
2.804ThrArg: 2.804 ± 0.504
4.588ThrSer: 4.588 ± 1.015
4.079ThrThr: 4.079 ± 1.663
4.588ThrVal: 4.588 ± 0.364
1.02ThrTrp: 1.02 ± 0.044
2.549ThrTyr: 2.549 ± 0.487
0.0ThrXaa: 0.0 ± 0.0
Val
5.098ValAla: 5.098 ± 1.069
1.784ValCys: 1.784 ± 0.606
5.863ValAsp: 5.863 ± 0.532
4.588ValGlu: 4.588 ± 1.243
1.529ValPhe: 1.529 ± 0.233
4.333ValGly: 4.333 ± 1.18
1.529ValHis: 1.529 ± 0.509
5.098ValIle: 5.098 ± 1.212
5.098ValLys: 5.098 ± 0.8
7.647ValLeu: 7.647 ± 0.639
2.294ValMet: 2.294 ± 0.786
3.314ValAsn: 3.314 ± 0.31
5.098ValPro: 5.098 ± 1.019
2.804ValGln: 2.804 ± 0.819
4.333ValArg: 4.333 ± 0.394
7.392ValSer: 7.392 ± 1.628
5.863ValThr: 5.863 ± 0.836
7.392ValVal: 7.392 ± 1.111
1.02ValTrp: 1.02 ± 0.372
3.059ValTyr: 3.059 ± 1.101
0.0ValXaa: 0.0 ± 0.0
Trp
1.275TrpAla: 1.275 ± 0.846
0.765TrpCys: 0.765 ± 0.4
0.0TrpAsp: 0.0 ± 0.0
1.02TrpGlu: 1.02 ± 0.432
0.255TrpPhe: 0.255 ± 0.222
0.51TrpGly: 0.51 ± 0.248
0.51TrpHis: 0.51 ± 0.277
1.02TrpIle: 1.02 ± 0.331
0.0TrpLys: 0.0 ± 0.0
1.784TrpLeu: 1.784 ± 0.241
0.255TrpMet: 0.255 ± 0.248
0.51TrpAsn: 0.51 ± 0.266
0.51TrpPro: 0.51 ± 0.286
0.255TrpGln: 0.255 ± 0.216
1.02TrpArg: 1.02 ± 0.372
0.765TrpSer: 0.765 ± 0.486
0.765TrpThr: 0.765 ± 0.202
1.275TrpVal: 1.275 ± 0.663
0.255TrpTrp: 0.255 ± 0.222
0.765TrpTyr: 0.765 ± 0.414
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.843TyrAla: 4.843 ± 0.554
1.275TyrCys: 1.275 ± 0.482
2.039TyrAsp: 2.039 ± 0.805
1.529TyrGlu: 1.529 ± 0.598
0.765TyrPhe: 0.765 ± 0.429
2.294TyrGly: 2.294 ± 0.591
1.275TyrHis: 1.275 ± 0.613
1.784TyrIle: 1.784 ± 0.494
2.549TyrLys: 2.549 ± 0.317
5.098TyrLeu: 5.098 ± 1.118
1.02TyrMet: 1.02 ± 0.044
2.549TyrAsn: 2.549 ± 0.717
2.039TyrPro: 2.039 ± 0.688
1.02TyrGln: 1.02 ± 0.331
2.039TyrArg: 2.039 ± 0.898
3.569TyrSer: 3.569 ± 0.783
2.294TyrThr: 2.294 ± 0.623
3.314TyrVal: 3.314 ± 0.648
0.0TyrTrp: 0.0 ± 0.0
2.039TyrTyr: 2.039 ± 0.489
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3924 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski