Amino acid dipepetide frequency for Wenling picorna-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.656AlaAla: 7.656 ± 0.229
1.823AlaCys: 1.823 ± 0.303
5.468AlaAsp: 5.468 ± 1.537
2.187AlaGlu: 2.187 ± 0.113
2.552AlaPhe: 2.552 ± 0.703
4.01AlaGly: 4.01 ± 0.417
1.823AlaHis: 1.823 ± 0.303
1.823AlaIle: 1.823 ± 0.323
2.917AlaLys: 2.917 ± 0.36
5.104AlaLeu: 5.104 ± 1.406
2.552AlaMet: 2.552 ± 1.329
5.468AlaAsn: 5.468 ± 0.969
3.646AlaPro: 3.646 ± 0.02
1.823AlaGln: 1.823 ± 0.323
2.552AlaArg: 2.552 ± 0.703
8.385AlaSer: 8.385 ± 5.029
3.646AlaThr: 3.646 ± 0.02
4.375AlaVal: 4.375 ± 1.026
0.365AlaTrp: 0.365 ± 0.437
2.187AlaTyr: 2.187 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
2.552CysAla: 2.552 ± 0.076
1.094CysCys: 1.094 ± 0.57
0.729CysAsp: 0.729 ± 0.38
1.823CysGlu: 1.823 ± 0.949
0.365CysPhe: 0.365 ± 0.19
4.01CysGly: 4.01 ± 1.462
1.094CysHis: 1.094 ± 0.057
0.729CysIle: 0.729 ± 0.38
1.094CysLys: 1.094 ± 0.57
0.0CysLeu: 0.0 ± 0.0
0.365CysMet: 0.365 ± 0.19
0.729CysAsn: 0.729 ± 0.873
0.0CysPro: 0.0 ± 0.0
0.729CysGln: 0.729 ± 0.38
0.0CysArg: 0.0 ± 0.0
2.187CysSer: 2.187 ± 1.993
2.187CysThr: 2.187 ± 0.113
1.823CysVal: 1.823 ± 0.949
0.365CysTrp: 0.365 ± 0.19
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.01AspAla: 4.01 ± 0.21
0.729AspCys: 0.729 ± 0.38
3.646AspAsp: 3.646 ± 0.646
5.468AspGlu: 5.468 ± 2.222
4.375AspPhe: 4.375 ± 1.652
2.917AspGly: 2.917 ± 0.893
1.458AspHis: 1.458 ± 0.133
2.917AspIle: 2.917 ± 0.266
1.823AspLys: 1.823 ± 0.949
4.01AspLeu: 4.01 ± 1.67
1.094AspMet: 1.094 ± 0.367
4.01AspAsn: 4.01 ± 0.417
2.187AspPro: 2.187 ± 0.113
3.281AspGln: 3.281 ± 0.456
1.094AspArg: 1.094 ± 0.057
2.552AspSer: 2.552 ± 0.076
4.01AspThr: 4.01 ± 1.043
5.104AspVal: 5.104 ± 1.406
0.0AspTrp: 0.0 ± 0.0
4.01AspTyr: 4.01 ± 0.21
0.0AspXaa: 0.0 ± 0.0
Glu
3.281GluAla: 3.281 ± 0.456
2.552GluCys: 2.552 ± 0.076
1.823GluAsp: 1.823 ± 0.949
4.739GluGlu: 4.739 ± 0.037
3.646GluPhe: 3.646 ± 0.607
2.917GluGly: 2.917 ± 0.266
0.729GluHis: 0.729 ± 0.247
2.917GluIle: 2.917 ± 0.893
3.646GluLys: 3.646 ± 0.02
7.291GluLeu: 7.291 ± 1.918
2.187GluMet: 2.187 ± 0.442
2.917GluAsn: 2.917 ± 0.266
1.823GluPro: 1.823 ± 0.323
1.094GluGln: 1.094 ± 0.57
0.729GluArg: 0.729 ± 0.38
3.281GluSer: 3.281 ± 0.456
4.739GluThr: 4.739 ± 0.663
5.833GluVal: 5.833 ± 1.159
0.365GluTrp: 0.365 ± 0.19
3.281GluTyr: 3.281 ± 1.083
0.0GluXaa: 0.0 ± 0.0
Phe
4.739PheAla: 4.739 ± 0.663
0.729PheCys: 0.729 ± 0.38
1.458PheAsp: 1.458 ± 0.493
2.917PheGlu: 2.917 ± 0.36
1.094PhePhe: 1.094 ± 0.057
2.917PheGly: 2.917 ± 0.266
0.365PheHis: 0.365 ± 0.437
2.187PheIle: 2.187 ± 0.513
2.552PheLys: 2.552 ± 0.076
2.917PheLeu: 2.917 ± 0.987
1.823PheMet: 1.823 ± 0.323
2.187PheAsn: 2.187 ± 0.113
1.823PhePro: 1.823 ± 0.303
1.458PheGln: 1.458 ± 1.12
2.917PheArg: 2.917 ± 0.893
6.198PheSer: 6.198 ± 0.53
3.281PheThr: 3.281 ± 0.797
3.281PheVal: 3.281 ± 0.456
0.729PheTrp: 0.729 ± 0.38
1.458PheTyr: 1.458 ± 0.76
0.0PheXaa: 0.0 ± 0.0
Gly
2.552GlyAla: 2.552 ± 0.076
0.729GlyCys: 0.729 ± 0.38
3.281GlyAsp: 3.281 ± 1.709
3.281GlyGlu: 3.281 ± 0.17
1.458GlyPhe: 1.458 ± 1.746
0.365GlyGly: 0.365 ± 0.437
0.729GlyHis: 0.729 ± 0.247
3.281GlyIle: 3.281 ± 1.709
4.01GlyLys: 4.01 ± 0.21
2.917GlyLeu: 2.917 ± 0.266
2.187GlyMet: 2.187 ± 0.513
2.187GlyAsn: 2.187 ± 0.113
1.458GlyPro: 1.458 ± 1.12
1.823GlyGln: 1.823 ± 0.323
1.094GlyArg: 1.094 ± 0.057
3.281GlySer: 3.281 ± 0.17
1.094GlyThr: 1.094 ± 0.683
2.917GlyVal: 2.917 ± 0.987
0.365GlyTrp: 0.365 ± 0.437
3.281GlyTyr: 3.281 ± 0.17
0.0GlyXaa: 0.0 ± 0.0
His
0.729HisAla: 0.729 ± 0.247
0.729HisCys: 0.729 ± 0.38
1.823HisAsp: 1.823 ± 0.323
2.187HisGlu: 2.187 ± 0.74
1.458HisPhe: 1.458 ± 0.133
1.094HisGly: 1.094 ± 0.057
1.458HisHis: 1.458 ± 0.76
1.094HisIle: 1.094 ± 0.57
2.187HisLys: 2.187 ± 0.113
1.094HisLeu: 1.094 ± 0.57
1.458HisMet: 1.458 ± 0.76
1.458HisAsn: 1.458 ± 0.133
1.823HisPro: 1.823 ± 1.556
1.094HisGln: 1.094 ± 0.57
1.823HisArg: 1.823 ± 0.303
2.917HisSer: 2.917 ± 0.266
0.729HisThr: 0.729 ± 0.247
1.094HisVal: 1.094 ± 0.57
0.365HisTrp: 0.365 ± 0.437
0.365HisTyr: 0.365 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
4.01IleAla: 4.01 ± 0.21
1.094IleCys: 1.094 ± 0.057
3.646IleAsp: 3.646 ± 0.646
4.739IleGlu: 4.739 ± 0.037
2.917IlePhe: 2.917 ± 0.266
1.823IleGly: 1.823 ± 0.323
0.0IleHis: 0.0 ± 0.0
2.552IleIle: 2.552 ± 1.329
2.917IleLys: 2.917 ± 0.36
2.552IleLeu: 2.552 ± 1.176
1.094IleMet: 1.094 ± 0.57
3.281IleAsn: 3.281 ± 1.083
3.646IlePro: 3.646 ± 0.607
2.552IleGln: 2.552 ± 0.55
5.104IleArg: 5.104 ± 0.474
1.823IleSer: 1.823 ± 0.949
2.187IleThr: 2.187 ± 0.513
4.739IleVal: 4.739 ± 1.842
0.729IleTrp: 0.729 ± 0.38
1.823IleTyr: 1.823 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
3.646LysAla: 3.646 ± 0.646
1.094LysCys: 1.094 ± 0.57
4.375LysAsp: 4.375 ± 2.279
3.646LysGlu: 3.646 ± 1.899
1.094LysPhe: 1.094 ± 0.057
1.458LysGly: 1.458 ± 0.76
3.281LysHis: 3.281 ± 0.17
2.187LysIle: 2.187 ± 0.74
5.468LysLys: 5.468 ± 2.848
4.375LysLeu: 4.375 ± 0.399
1.094LysMet: 1.094 ± 0.57
2.552LysAsn: 2.552 ± 0.703
2.917LysPro: 2.917 ± 2.239
3.281LysGln: 3.281 ± 0.456
2.917LysArg: 2.917 ± 0.987
2.917LysSer: 2.917 ± 0.893
3.646LysThr: 3.646 ± 0.646
5.104LysVal: 5.104 ± 0.474
0.365LysTrp: 0.365 ± 0.437
2.917LysTyr: 2.917 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
5.104LeuAla: 5.104 ± 1.406
1.458LeuCys: 1.458 ± 0.133
4.01LeuAsp: 4.01 ± 1.043
4.739LeuGlu: 4.739 ± 1.216
3.646LeuPhe: 3.646 ± 0.646
2.552LeuGly: 2.552 ± 1.176
2.187LeuHis: 2.187 ± 0.113
4.375LeuIle: 4.375 ± 1.026
4.375LeuLys: 4.375 ± 1.652
4.01LeuLeu: 4.01 ± 0.21
2.552LeuMet: 2.552 ± 0.076
3.646LeuAsn: 3.646 ± 1.233
2.917LeuPro: 2.917 ± 0.36
2.917LeuGln: 2.917 ± 0.36
4.01LeuArg: 4.01 ± 1.043
4.739LeuSer: 4.739 ± 0.589
5.104LeuThr: 5.104 ± 0.779
5.104LeuVal: 5.104 ± 0.153
0.365LeuTrp: 0.365 ± 0.437
2.552LeuTyr: 2.552 ± 1.176
0.0LeuXaa: 0.0 ± 0.0
Met
3.281MetAla: 3.281 ± 1.083
1.823MetCys: 1.823 ± 0.949
2.552MetAsp: 2.552 ± 0.076
2.187MetGlu: 2.187 ± 1.139
1.823MetPhe: 1.823 ± 0.949
1.094MetGly: 1.094 ± 0.57
1.458MetHis: 1.458 ± 0.76
1.458MetIle: 1.458 ± 0.133
1.823MetLys: 1.823 ± 0.323
1.094MetLeu: 1.094 ± 0.057
0.0MetMet: 0.0 ± 0.0
1.823MetAsn: 1.823 ± 0.323
0.729MetPro: 0.729 ± 0.247
1.094MetGln: 1.094 ± 0.057
0.365MetArg: 0.365 ± 0.19
2.552MetSer: 2.552 ± 0.076
1.823MetThr: 1.823 ± 0.93
1.823MetVal: 1.823 ± 0.303
0.0MetTrp: 0.0 ± 0.0
2.917MetTyr: 2.917 ± 0.36
0.0MetXaa: 0.0 ± 0.0
Asn
5.104AsnAla: 5.104 ± 0.474
1.458AsnCys: 1.458 ± 0.133
0.729AsnAsp: 0.729 ± 0.38
2.917AsnGlu: 2.917 ± 0.36
3.281AsnPhe: 3.281 ± 0.456
1.458AsnGly: 1.458 ± 0.493
1.823AsnHis: 1.823 ± 0.949
3.646AsnIle: 3.646 ± 0.02
1.823AsnLys: 1.823 ± 0.93
4.739AsnLeu: 4.739 ± 1.29
2.917AsnMet: 2.917 ± 0.36
2.552AsnAsn: 2.552 ± 0.076
2.917AsnPro: 2.917 ± 0.266
2.187AsnGln: 2.187 ± 0.113
2.552AsnArg: 2.552 ± 1.329
2.187AsnSer: 2.187 ± 0.113
2.917AsnThr: 2.917 ± 1.613
4.01AsnVal: 4.01 ± 0.21
1.094AsnTrp: 1.094 ± 0.57
2.917AsnTyr: 2.917 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
1.458ProAla: 1.458 ± 0.133
1.094ProCys: 1.094 ± 0.57
2.917ProAsp: 2.917 ± 0.266
3.281ProGlu: 3.281 ± 0.456
3.646ProPhe: 3.646 ± 1.86
2.187ProGly: 2.187 ± 2.619
1.094ProHis: 1.094 ± 1.31
1.458ProIle: 1.458 ± 0.133
2.187ProLys: 2.187 ± 1.366
5.468ProLeu: 5.468 ± 0.969
1.458ProMet: 1.458 ± 0.133
2.187ProAsn: 2.187 ± 1.366
2.552ProPro: 2.552 ± 0.55
2.187ProGln: 2.187 ± 0.113
2.187ProArg: 2.187 ± 0.513
2.917ProSer: 2.917 ± 1.613
2.187ProThr: 2.187 ± 1.366
3.646ProVal: 3.646 ± 0.646
0.729ProTrp: 0.729 ± 0.247
2.187ProTyr: 2.187 ± 0.74
0.0ProXaa: 0.0 ± 0.0
Gln
1.458GlnAla: 1.458 ± 0.493
1.094GlnCys: 1.094 ± 0.057
1.823GlnAsp: 1.823 ± 0.303
2.187GlnGlu: 2.187 ± 1.139
2.552GlnPhe: 2.552 ± 1.803
1.094GlnGly: 1.094 ± 0.57
0.729GlnHis: 0.729 ± 0.38
2.187GlnIle: 2.187 ± 1.366
2.552GlnLys: 2.552 ± 1.329
2.917GlnLeu: 2.917 ± 0.266
1.458GlnMet: 1.458 ± 0.493
2.552GlnAsn: 2.552 ± 0.076
1.823GlnPro: 1.823 ± 0.303
1.458GlnGln: 1.458 ± 0.493
1.094GlnArg: 1.094 ± 0.683
4.375GlnSer: 4.375 ± 0.227
2.187GlnThr: 2.187 ± 0.513
2.552GlnVal: 2.552 ± 0.55
0.0GlnTrp: 0.0 ± 0.0
0.729GlnTyr: 0.729 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
2.917ArgAla: 2.917 ± 0.266
0.0ArgCys: 0.0 ± 0.0
4.739ArgAsp: 4.739 ± 0.589
1.823ArgGlu: 1.823 ± 0.323
1.458ArgPhe: 1.458 ± 0.76
1.458ArgGly: 1.458 ± 0.133
1.823ArgHis: 1.823 ± 0.323
4.375ArgIle: 4.375 ± 0.853
3.646ArgLys: 3.646 ± 0.646
2.917ArgLeu: 2.917 ± 0.36
1.458ArgMet: 1.458 ± 0.133
1.458ArgAsn: 1.458 ± 0.133
3.281ArgPro: 3.281 ± 1.083
0.729ArgGln: 0.729 ± 0.247
4.375ArgArg: 4.375 ± 2.279
2.552ArgSer: 2.552 ± 1.329
1.823ArgThr: 1.823 ± 0.323
4.739ArgVal: 4.739 ± 1.29
1.458ArgTrp: 1.458 ± 0.133
1.458ArgTyr: 1.458 ± 0.133
0.0ArgXaa: 0.0 ± 0.0
Ser
6.562SerAla: 6.562 ± 0.967
1.094SerCys: 1.094 ± 0.683
2.917SerAsp: 2.917 ± 0.266
2.917SerGlu: 2.917 ± 0.266
4.739SerPhe: 4.739 ± 0.663
3.281SerGly: 3.281 ± 0.456
1.094SerHis: 1.094 ± 0.57
3.646SerIle: 3.646 ± 0.646
4.739SerLys: 4.739 ± 0.663
6.198SerLeu: 6.198 ± 1.349
2.187SerMet: 2.187 ± 0.113
4.739SerAsn: 4.739 ± 2.543
2.917SerPro: 2.917 ± 0.987
3.646SerGln: 3.646 ± 1.86
4.01SerArg: 4.01 ± 0.21
8.75SerSer: 8.75 ± 2.333
7.291SerThr: 7.291 ± 1.84
6.927SerVal: 6.927 ± 1.403
0.729SerTrp: 0.729 ± 0.873
3.646SerTyr: 3.646 ± 1.233
0.0SerXaa: 0.0 ± 0.0
Thr
3.646ThrAla: 3.646 ± 0.02
2.187ThrCys: 2.187 ± 0.113
3.281ThrAsp: 3.281 ± 0.797
3.646ThrGlu: 3.646 ± 0.02
2.917ThrPhe: 2.917 ± 0.987
1.094ThrGly: 1.094 ± 0.057
1.458ThrHis: 1.458 ± 0.76
5.104ThrIle: 5.104 ± 1.726
3.646ThrLys: 3.646 ± 0.646
2.917ThrLeu: 2.917 ± 1.519
2.187ThrMet: 2.187 ± 0.113
1.823ThrAsn: 1.823 ± 0.323
3.646ThrPro: 3.646 ± 1.86
2.187ThrGln: 2.187 ± 0.74
2.552ThrArg: 2.552 ± 0.703
4.739ThrSer: 4.739 ± 1.29
3.646ThrThr: 3.646 ± 0.607
5.833ThrVal: 5.833 ± 2.599
0.365ThrTrp: 0.365 ± 0.19
2.187ThrTyr: 2.187 ± 0.74
0.0ThrXaa: 0.0 ± 0.0
Val
4.739ValAla: 4.739 ± 2.543
0.729ValCys: 0.729 ± 0.247
6.198ValAsp: 6.198 ± 1.349
2.187ValGlu: 2.187 ± 0.113
1.823ValPhe: 1.823 ± 0.303
3.281ValGly: 3.281 ± 1.423
2.187ValHis: 2.187 ± 0.513
5.104ValIle: 5.104 ± 1.406
5.468ValLys: 5.468 ± 2.222
5.833ValLeu: 5.833 ± 0.72
2.187ValMet: 2.187 ± 0.513
3.646ValAsn: 3.646 ± 0.02
5.104ValPro: 5.104 ± 1.1
2.187ValGln: 2.187 ± 1.139
4.375ValArg: 4.375 ± 1.026
10.208ValSer: 10.208 ± 2.2
3.281ValThr: 3.281 ± 0.797
4.375ValVal: 4.375 ± 0.853
0.729ValTrp: 0.729 ± 0.38
4.375ValTyr: 4.375 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.365TrpAsp: 0.365 ± 0.19
0.365TrpGlu: 0.365 ± 0.19
0.365TrpPhe: 0.365 ± 0.19
0.729TrpGly: 0.729 ± 0.38
0.365TrpHis: 0.365 ± 0.437
0.0TrpIle: 0.0 ± 0.0
0.729TrpLys: 0.729 ± 0.247
0.0TrpLeu: 0.0 ± 0.0
0.365TrpMet: 0.365 ± 0.19
1.823TrpAsn: 1.823 ± 0.323
0.365TrpPro: 0.365 ± 0.437
0.0TrpGln: 0.0 ± 0.0
0.729TrpArg: 0.729 ± 0.247
1.458TrpSer: 1.458 ± 0.493
0.729TrpThr: 0.729 ± 0.38
1.458TrpVal: 1.458 ± 1.12
0.365TrpTrp: 0.365 ± 0.437
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.917TyrAla: 2.917 ± 0.36
0.365TyrCys: 0.365 ± 0.19
3.281TyrAsp: 3.281 ± 0.456
2.552TyrGlu: 2.552 ± 0.076
1.823TyrPhe: 1.823 ± 0.323
2.552TyrGly: 2.552 ± 0.55
1.823TyrHis: 1.823 ± 0.303
2.187TyrIle: 2.187 ± 0.113
0.729TyrLys: 0.729 ± 0.247
4.01TyrLeu: 4.01 ± 1.67
0.729TyrMet: 0.729 ± 0.247
2.187TyrAsn: 2.187 ± 0.513
1.458TyrPro: 1.458 ± 0.76
1.094TyrGln: 1.094 ± 0.683
4.01TyrArg: 4.01 ± 1.462
4.01TyrSer: 4.01 ± 1.043
2.552TyrThr: 2.552 ± 0.076
3.281TyrVal: 3.281 ± 0.797
0.729TyrTrp: 0.729 ± 0.247
2.187TyrTyr: 2.187 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2744 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski