Amino acid dipepetide frequency for Ying Kou virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.038AlaAla: 3.038 ± 1.4
0.338AlaCys: 0.338 ± 0.773
1.35AlaAsp: 1.35 ± 0.414
2.701AlaGlu: 2.701 ± 0.55
1.013AlaPhe: 1.013 ± 0.474
2.701AlaGly: 2.701 ± 0.939
1.688AlaHis: 1.688 ± 0.36
1.688AlaIle: 1.688 ± 1.681
3.038AlaLys: 3.038 ± 2.142
7.765AlaLeu: 7.765 ± 3.952
2.026AlaMet: 2.026 ± 0.949
2.026AlaAsn: 2.026 ± 0.63
2.701AlaPro: 2.701 ± 2.28
2.026AlaGln: 2.026 ± 0.37
2.026AlaArg: 2.026 ± 1.428
4.389AlaSer: 4.389 ± 1.131
2.701AlaThr: 2.701 ± 0.402
5.402AlaVal: 5.402 ± 1.746
0.0AlaTrp: 0.0 ± 0.0
2.026AlaTyr: 2.026 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
1.013CysAla: 1.013 ± 0.474
0.338CysCys: 0.338 ± 0.158
1.013CysAsp: 1.013 ± 0.474
1.013CysGlu: 1.013 ± 0.513
1.688CysPhe: 1.688 ± 0.36
0.338CysGly: 0.338 ± 0.158
1.013CysHis: 1.013 ± 0.513
0.675CysIle: 0.675 ± 0.316
1.688CysLys: 1.688 ± 0.791
2.026CysLeu: 2.026 ± 0.37
1.35CysMet: 1.35 ± 0.414
1.35CysAsn: 1.35 ± 0.632
1.013CysPro: 1.013 ± 0.474
0.0CysGln: 0.0 ± 0.0
1.013CysArg: 1.013 ± 0.513
3.038CysSer: 3.038 ± 0.759
0.338CysThr: 0.338 ± 0.158
2.026CysVal: 2.026 ± 0.37
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.038AspAla: 3.038 ± 1.423
2.363AspCys: 2.363 ± 0.442
3.714AspAsp: 3.714 ± 1.739
3.038AspGlu: 3.038 ± 0.679
4.051AspPhe: 4.051 ± 0.539
3.038AspGly: 3.038 ± 1.54
2.363AspHis: 2.363 ± 0.442
1.688AspIle: 1.688 ± 0.791
2.026AspLys: 2.026 ± 1.027
4.389AspLeu: 4.389 ± 1.582
2.026AspMet: 2.026 ± 0.949
2.026AspAsn: 2.026 ± 0.949
2.363AspPro: 2.363 ± 0.442
1.013AspGln: 1.013 ± 0.474
2.363AspArg: 2.363 ± 1.107
5.739AspSer: 5.739 ± 0.733
3.714AspThr: 3.714 ± 0.963
7.09AspVal: 7.09 ± 0.4
0.338AspTrp: 0.338 ± 0.773
1.013AspTyr: 1.013 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
1.35GluAla: 1.35 ± 0.632
2.701GluCys: 2.701 ± 0.828
4.051GluAsp: 4.051 ± 0.741
3.038GluGlu: 3.038 ± 1.423
4.727GluPhe: 4.727 ± 1.114
1.688GluGly: 1.688 ± 0.791
1.35GluHis: 1.35 ± 0.414
5.064GluIle: 5.064 ± 0.879
5.739GluLys: 5.739 ± 0.364
6.077GluLeu: 6.077 ± 1.111
1.688GluMet: 1.688 ± 0.791
1.688GluAsn: 1.688 ± 0.791
1.688GluPro: 1.688 ± 0.36
0.675GluGln: 0.675 ± 0.316
2.026GluArg: 2.026 ± 0.37
4.727GluSer: 4.727 ± 1.414
2.363GluThr: 2.363 ± 0.442
3.376GluVal: 3.376 ± 1.581
0.675GluTrp: 0.675 ± 0.637
2.701GluTyr: 2.701 ± 1.059
0.0GluXaa: 0.0 ± 0.0
Phe
2.363PheAla: 2.363 ± 0.505
1.35PheCys: 1.35 ± 0.414
4.727PheAsp: 4.727 ± 0.111
5.402PheGlu: 5.402 ± 1.656
5.739PhePhe: 5.739 ± 2.425
4.727PheGly: 4.727 ± 0.111
1.35PheHis: 1.35 ± 0.632
2.026PheIle: 2.026 ± 0.37
1.688PheLys: 1.688 ± 0.36
5.739PheLeu: 5.739 ± 0.364
0.675PheMet: 0.675 ± 0.441
2.363PheAsn: 2.363 ± 1.107
2.363PhePro: 2.363 ± 1.782
1.688PheGln: 1.688 ± 0.36
4.051PheArg: 4.051 ± 0.427
6.415PheSer: 6.415 ± 0.682
5.064PheThr: 5.064 ± 1.795
4.727PheVal: 4.727 ± 1.025
1.013PheTrp: 1.013 ± 0.714
2.026PheTyr: 2.026 ± 1.911
0.0PheXaa: 0.0 ± 0.0
Gly
2.363GlyAla: 2.363 ± 1.782
2.026GlyCys: 2.026 ± 0.949
5.064GlyAsp: 5.064 ± 1.568
3.376GlyGlu: 3.376 ± 1.43
1.688GlyPhe: 1.688 ± 0.713
2.363GlyGly: 2.363 ± 1.107
1.013GlyHis: 1.013 ± 0.714
2.701GlyIle: 2.701 ± 0.55
2.701GlyLys: 2.701 ± 1.059
3.038GlyLeu: 3.038 ± 1.245
1.013GlyMet: 1.013 ± 0.527
2.026GlyAsn: 2.026 ± 0.949
1.013GlyPro: 1.013 ± 0.513
2.363GlyGln: 2.363 ± 0.442
2.026GlyArg: 2.026 ± 1.027
1.688GlySer: 1.688 ± 0.791
0.675GlyThr: 0.675 ± 0.316
4.389GlyVal: 4.389 ± 0.669
0.338GlyTrp: 0.338 ± 0.158
2.026GlyTyr: 2.026 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.013HisAla: 1.013 ± 0.474
1.013HisCys: 1.013 ± 0.513
1.35HisAsp: 1.35 ± 0.632
0.675HisGlu: 0.675 ± 0.316
2.026HisPhe: 2.026 ± 0.37
0.338HisGly: 0.338 ± 0.773
0.0HisHis: 0.0 ± 0.0
2.026HisIle: 2.026 ± 1.027
0.338HisLys: 0.338 ± 0.158
2.363HisLeu: 2.363 ± 1.107
1.35HisMet: 1.35 ± 0.696
1.35HisAsn: 1.35 ± 0.632
2.026HisPro: 2.026 ± 0.37
0.675HisGln: 0.675 ± 0.765
1.35HisArg: 1.35 ± 0.632
2.026HisSer: 2.026 ± 0.37
1.35HisThr: 1.35 ± 0.696
2.363HisVal: 2.363 ± 0.841
0.338HisTrp: 0.338 ± 0.158
1.013HisTyr: 1.013 ± 0.513
0.0HisXaa: 0.0 ± 0.0
Ile
2.026IleAla: 2.026 ± 0.37
1.35IleCys: 1.35 ± 0.632
4.727IleAsp: 4.727 ± 1.414
3.376IleGlu: 3.376 ± 0.719
2.363IlePhe: 2.363 ± 0.841
2.701IleGly: 2.701 ± 0.828
0.675IleHis: 0.675 ± 0.637
2.026IleIle: 2.026 ± 2.251
2.363IleLys: 2.363 ± 0.505
3.714IleLeu: 3.714 ± 2.909
0.338IleMet: 0.338 ± 0.158
3.038IleAsn: 3.038 ± 0.759
3.376IlePro: 3.376 ± 0.719
1.35IleGln: 1.35 ± 0.696
2.363IleArg: 2.363 ± 1.107
5.064IleSer: 5.064 ± 1.079
2.026IleThr: 2.026 ± 0.37
5.064IleVal: 5.064 ± 1.743
0.0IleTrp: 0.0 ± 0.0
2.363IleTyr: 2.363 ± 2.996
0.0IleXaa: 0.0 ± 0.0
Lys
1.688LysAla: 1.688 ± 0.768
0.675LysCys: 0.675 ± 0.316
2.026LysAsp: 2.026 ± 0.37
3.376LysGlu: 3.376 ± 0.818
3.714LysPhe: 3.714 ± 0.428
1.013LysGly: 1.013 ± 0.474
1.35LysHis: 1.35 ± 0.632
4.389LysIle: 4.389 ± 1.943
3.714LysLys: 3.714 ± 2.004
6.415LysLeu: 6.415 ± 1.562
2.026LysMet: 2.026 ± 0.949
3.376LysAsn: 3.376 ± 0.743
4.051LysPro: 4.051 ± 0.539
0.338LysGln: 0.338 ± 0.158
3.038LysArg: 3.038 ± 1.423
2.026LysSer: 2.026 ± 0.763
3.376LysThr: 3.376 ± 0.355
3.376LysVal: 3.376 ± 1.63
0.338LysTrp: 0.338 ± 0.843
2.701LysTyr: 2.701 ± 0.828
0.0LysXaa: 0.0 ± 0.0
Leu
7.765LeuAla: 7.765 ± 4.807
2.026LeuCys: 2.026 ± 1.027
2.701LeuAsp: 2.701 ± 0.939
6.415LeuGlu: 6.415 ± 1.495
3.376LeuPhe: 3.376 ± 2.292
2.701LeuGly: 2.701 ± 0.402
2.026LeuHis: 2.026 ± 0.763
5.402LeuIle: 5.402 ± 1.652
4.389LeuLys: 4.389 ± 0.8
9.115LeuLeu: 9.115 ± 2.03
2.026LeuMet: 2.026 ± 0.37
4.389LeuAsn: 4.389 ± 1.599
2.363LeuPro: 2.363 ± 1.782
2.026LeuGln: 2.026 ± 0.63
5.402LeuArg: 5.402 ± 0.805
10.804LeuSer: 10.804 ± 2.209
4.727LeuThr: 4.727 ± 1.726
9.115LeuVal: 9.115 ± 2.809
0.675LeuTrp: 0.675 ± 0.765
3.714LeuTyr: 3.714 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
0.675MetAla: 0.675 ± 0.316
1.013MetCys: 1.013 ± 0.474
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
2.026MetPhe: 2.026 ± 0.949
0.338MetGly: 0.338 ± 0.158
0.675MetHis: 0.675 ± 0.316
1.35MetIle: 1.35 ± 0.414
1.35MetLys: 1.35 ± 0.632
2.701MetLeu: 2.701 ± 0.402
0.0MetMet: 0.0 ± 0.0
1.688MetAsn: 1.688 ± 0.791
1.013MetPro: 1.013 ± 0.474
0.0MetGln: 0.0 ± 0.0
1.688MetArg: 1.688 ± 0.791
3.714MetSer: 3.714 ± 1.102
1.688MetThr: 1.688 ± 0.791
1.013MetVal: 1.013 ± 0.513
0.0MetTrp: 0.0 ± 0.0
0.675MetTyr: 0.675 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
1.013AsnAla: 1.013 ± 0.474
1.013AsnCys: 1.013 ± 0.513
2.701AsnAsp: 2.701 ± 1.059
0.675AsnGlu: 0.675 ± 0.316
3.714AsnPhe: 3.714 ± 2.17
4.389AsnGly: 4.389 ± 2.056
1.35AsnHis: 1.35 ± 0.632
1.688AsnIle: 1.688 ± 0.791
2.026AsnLys: 2.026 ± 1.375
4.051AsnLeu: 4.051 ± 1.897
1.35AsnMet: 1.35 ± 0.337
2.701AsnAsn: 2.701 ± 0.55
2.363AsnPro: 2.363 ± 1.107
2.026AsnGln: 2.026 ± 1.428
4.051AsnArg: 4.051 ± 1.897
7.765AsnSer: 7.765 ± 2.155
2.363AsnThr: 2.363 ± 0.442
3.714AsnVal: 3.714 ± 0.963
0.338AsnTrp: 0.338 ± 0.773
2.363AsnTyr: 2.363 ± 1.107
0.0AsnXaa: 0.0 ± 0.0
Pro
2.026ProAla: 2.026 ± 1.559
0.675ProCys: 0.675 ± 0.316
1.35ProAsp: 1.35 ± 0.414
3.376ProGlu: 3.376 ± 0.355
3.376ProPhe: 3.376 ± 1.43
2.363ProGly: 2.363 ± 1.107
1.013ProHis: 1.013 ± 0.474
2.026ProIle: 2.026 ± 2.816
1.35ProLys: 1.35 ± 0.414
3.376ProLeu: 3.376 ± 1.165
0.338ProMet: 0.338 ± 0.158
2.363ProAsn: 2.363 ± 0.442
1.688ProPro: 1.688 ± 1.681
1.688ProGln: 1.688 ± 0.768
3.038ProArg: 3.038 ± 0.901
2.701ProSer: 2.701 ± 1.392
3.376ProThr: 3.376 ± 1.43
7.427ProVal: 7.427 ± 0.483
0.338ProTrp: 0.338 ± 0.158
1.35ProTyr: 1.35 ± 1.274
0.0ProXaa: 0.0 ± 0.0
Gln
2.026GlnAla: 2.026 ± 1.428
0.0GlnCys: 0.0 ± 0.0
1.013GlnAsp: 1.013 ± 0.474
1.688GlnGlu: 1.688 ± 0.36
1.688GlnPhe: 1.688 ± 0.36
1.013GlnGly: 1.013 ± 0.513
0.338GlnHis: 0.338 ± 0.158
0.675GlnIle: 0.675 ± 0.316
2.026GlnLys: 2.026 ± 0.949
2.701GlnLeu: 2.701 ± 0.55
0.0GlnMet: 0.0 ± 0.0
2.363GlnAsn: 2.363 ± 1.445
1.688GlnPro: 1.688 ± 0.768
1.013GlnGln: 1.013 ± 0.474
1.688GlnArg: 1.688 ± 0.791
2.363GlnSer: 2.363 ± 0.442
2.026GlnThr: 2.026 ± 0.37
1.688GlnVal: 1.688 ± 0.768
0.0GlnTrp: 0.0 ± 0.0
1.35GlnTyr: 1.35 ± 0.632
0.0GlnXaa: 0.0 ± 0.0
Arg
3.038ArgAla: 3.038 ± 1.052
0.338ArgCys: 0.338 ± 0.158
5.064ArgAsp: 5.064 ± 0.05
3.038ArgGlu: 3.038 ± 0.679
2.363ArgPhe: 2.363 ± 1.107
1.688ArgGly: 1.688 ± 0.36
1.688ArgHis: 1.688 ± 0.791
3.376ArgIle: 3.376 ± 0.355
3.376ArgLys: 3.376 ± 1.581
7.427ArgLeu: 7.427 ± 1.381
1.013ArgMet: 1.013 ± 0.474
5.739ArgAsn: 5.739 ± 1.877
2.363ArgPro: 2.363 ± 0.919
1.35ArgGln: 1.35 ± 0.632
3.038ArgArg: 3.038 ± 0.679
3.714ArgSer: 3.714 ± 1.33
3.376ArgThr: 3.376 ± 0.818
4.727ArgVal: 4.727 ± 1.813
0.338ArgTrp: 0.338 ± 0.158
1.688ArgTyr: 1.688 ± 0.791
0.0ArgXaa: 0.0 ± 0.0
Ser
5.402SerAla: 5.402 ± 1.117
0.0SerCys: 0.0 ± 0.0
5.739SerAsp: 5.739 ± 1.256
5.402SerGlu: 5.402 ± 0.206
7.427SerPhe: 7.427 ± 2.66
5.739SerGly: 5.739 ± 1.733
2.701SerHis: 2.701 ± 0.55
4.727SerIle: 4.727 ± 1.025
5.739SerLys: 5.739 ± 1.077
6.415SerLeu: 6.415 ± 0.682
1.35SerMet: 1.35 ± 0.632
4.389SerAsn: 4.389 ± 1.169
3.038SerPro: 3.038 ± 1.245
3.714SerGln: 3.714 ± 0.963
6.077SerArg: 6.077 ± 2.103
6.077SerSer: 6.077 ± 2.486
6.752SerThr: 6.752 ± 1.634
5.402SerVal: 5.402 ± 0.805
0.675SerTrp: 0.675 ± 0.316
4.727SerTyr: 4.727 ± 0.883
0.0SerXaa: 0.0 ± 0.0
Thr
3.714ThrAla: 3.714 ± 0.585
1.013ThrCys: 1.013 ± 0.474
1.688ThrAsp: 1.688 ± 0.791
3.376ThrGlu: 3.376 ± 1.581
5.064ThrPhe: 5.064 ± 0.879
1.35ThrGly: 1.35 ± 0.414
1.35ThrHis: 1.35 ± 0.913
3.038ThrIle: 3.038 ± 1.423
3.376ThrLys: 3.376 ± 1.581
5.739ThrLeu: 5.739 ± 1.185
1.013ThrMet: 1.013 ± 0.474
2.026ThrAsn: 2.026 ± 2.251
3.038ThrPro: 3.038 ± 1.54
1.688ThrGln: 1.688 ± 0.36
3.376ThrArg: 3.376 ± 0.719
3.376ThrSer: 3.376 ± 0.719
3.038ThrThr: 3.038 ± 0.759
4.051ThrVal: 4.051 ± 1.324
0.0ThrTrp: 0.0 ± 0.0
3.714ThrTyr: 3.714 ± 0.585
0.0ThrXaa: 0.0 ± 0.0
Val
5.064ValAla: 5.064 ± 4.665
2.026ValCys: 2.026 ± 0.949
5.064ValAsp: 5.064 ± 0.985
4.727ValGlu: 4.727 ± 1.414
5.739ValPhe: 5.739 ± 2.026
2.701ValGly: 2.701 ± 1.059
2.026ValHis: 2.026 ± 0.63
4.051ValIle: 4.051 ± 0.741
4.389ValLys: 4.389 ± 1.131
3.714ValLeu: 3.714 ± 2.885
1.013ValMet: 1.013 ± 0.714
5.064ValAsn: 5.064 ± 1.871
6.752ValPro: 6.752 ± 1.634
2.363ValGln: 2.363 ± 0.442
7.09ValArg: 7.09 ± 0.996
11.141ValSer: 11.141 ± 0.474
3.714ValThr: 3.714 ± 2.17
9.791ValVal: 9.791 ± 1.573
0.0ValTrp: 0.0 ± 0.0
2.701ValTyr: 2.701 ± 1.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.675TrpAsp: 0.675 ± 0.316
0.338TrpGlu: 0.338 ± 0.158
0.675TrpPhe: 0.675 ± 0.637
0.0TrpGly: 0.0 ± 0.0
0.338TrpHis: 0.338 ± 0.158
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.013TrpLeu: 1.013 ± 1.408
0.0TrpMet: 0.0 ± 0.0
0.338TrpAsn: 0.338 ± 0.158
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.338TrpArg: 0.338 ± 0.158
0.0TrpSer: 0.0 ± 0.0
0.338TrpThr: 0.338 ± 0.158
1.013TrpVal: 1.013 ± 2.53
0.0TrpTrp: 0.0 ± 0.0
0.338TrpTyr: 0.338 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.026TyrAla: 2.026 ± 0.63
0.675TyrCys: 0.675 ± 0.316
3.376TyrAsp: 3.376 ± 0.818
2.363TyrGlu: 2.363 ± 1.217
3.376TyrPhe: 3.376 ± 1.426
2.701TyrGly: 2.701 ± 0.828
0.675TyrHis: 0.675 ± 0.316
1.688TyrIle: 1.688 ± 0.36
1.688TyrLys: 1.688 ± 0.791
3.376TyrLeu: 3.376 ± 1.43
0.675TyrMet: 0.675 ± 0.637
1.35TyrAsn: 1.35 ± 0.414
0.338TyrPro: 0.338 ± 0.158
1.35TyrGln: 1.35 ± 0.414
2.701TyrArg: 2.701 ± 0.55
4.727TyrSer: 4.727 ± 1.114
2.026TyrThr: 2.026 ± 1.027
3.376TyrVal: 3.376 ± 0.719
0.0TyrTrp: 0.0 ± 0.0
0.675TyrTyr: 0.675 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski