Amino acid dipepetide frequency for Hubei toti-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.358AlaAla: 8.358 ± 0.654
1.194AlaCys: 1.194 ± 0.023
4.179AlaAsp: 4.179 ± 0.491
4.776AlaGlu: 4.776 ± 2.362
0.597AlaPhe: 0.597 ± 0.421
4.776AlaGly: 4.776 ± 0.725
2.985AlaHis: 2.985 ± 0.351
4.776AlaIle: 4.776 ± 1.544
4.776AlaLys: 4.776 ± 1.731
6.567AlaLeu: 6.567 ± 0.28
1.791AlaMet: 1.791 ± 0.374
2.388AlaAsn: 2.388 ± 0.772
4.179AlaPro: 4.179 ± 1.31
4.776AlaGln: 4.776 ± 0.094
7.164AlaArg: 7.164 ± 2.597
5.373AlaSer: 5.373 ± 0.515
4.776AlaThr: 4.776 ± 1.544
5.373AlaVal: 5.373 ± 1.941
3.582AlaTrp: 3.582 ± 0.748
1.791AlaTyr: 1.791 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
2.388CysAla: 2.388 ± 0.772
1.194CysCys: 1.194 ± 0.842
1.194CysAsp: 1.194 ± 0.795
1.791CysGlu: 1.791 ± 0.445
0.597CysPhe: 0.597 ± 0.421
1.791CysGly: 1.791 ± 1.263
1.194CysHis: 1.194 ± 0.795
1.194CysIle: 1.194 ± 0.023
0.597CysLys: 0.597 ± 0.421
0.597CysLeu: 0.597 ± 0.398
0.0CysMet: 0.0 ± 0.0
1.194CysAsn: 1.194 ± 0.023
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.597CysArg: 0.597 ± 0.421
2.388CysSer: 2.388 ± 0.047
0.597CysThr: 0.597 ± 0.421
0.597CysVal: 0.597 ± 0.421
0.0CysTrp: 0.0 ± 0.0
0.597CysTyr: 0.597 ± 0.421
0.0CysXaa: 0.0 ± 0.0
Asp
3.582AspAla: 3.582 ± 0.889
0.0AspCys: 0.0 ± 0.0
2.388AspAsp: 2.388 ± 0.866
1.791AspGlu: 1.791 ± 0.374
1.791AspPhe: 1.791 ± 1.193
2.388AspGly: 2.388 ± 0.047
0.597AspHis: 0.597 ± 0.421
2.388AspIle: 2.388 ± 0.866
1.194AspLys: 1.194 ± 0.795
4.179AspLeu: 4.179 ± 0.327
2.985AspMet: 2.985 ± 1.287
0.0AspAsn: 0.0 ± 0.0
5.373AspPro: 5.373 ± 0.304
2.388AspGln: 2.388 ± 0.772
4.179AspArg: 4.179 ± 1.31
1.194AspSer: 1.194 ± 0.795
1.791AspThr: 1.791 ± 0.445
3.582AspVal: 3.582 ± 1.567
4.776AspTrp: 4.776 ± 1.544
0.597AspTyr: 0.597 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
2.985GluAla: 2.985 ± 1.169
0.597GluCys: 0.597 ± 0.421
1.791GluAsp: 1.791 ± 1.263
3.582GluGlu: 3.582 ± 0.889
1.791GluPhe: 1.791 ± 1.263
2.388GluGly: 2.388 ± 0.866
1.791GluHis: 1.791 ± 0.445
1.194GluIle: 1.194 ± 0.023
2.388GluLys: 2.388 ± 0.772
3.582GluLeu: 3.582 ± 0.07
2.985GluMet: 2.985 ± 0.265
1.194GluAsn: 1.194 ± 0.023
1.791GluPro: 1.791 ± 0.445
1.194GluGln: 1.194 ± 0.795
3.582GluArg: 3.582 ± 0.889
5.373GluSer: 5.373 ± 2.76
2.985GluThr: 2.985 ± 1.169
3.582GluVal: 3.582 ± 0.748
2.985GluTrp: 2.985 ± 1.287
1.791GluTyr: 1.791 ± 0.445
0.0GluXaa: 0.0 ± 0.0
Phe
1.194PheAla: 1.194 ± 0.023
0.0PheCys: 0.0 ± 0.0
1.194PheAsp: 1.194 ± 0.023
1.791PheGlu: 1.791 ± 0.445
0.597PhePhe: 0.597 ± 0.421
2.985PheGly: 2.985 ± 1.287
0.597PheHis: 0.597 ± 0.421
1.194PheIle: 1.194 ± 0.023
2.388PheLys: 2.388 ± 0.866
2.985PheLeu: 2.985 ± 1.169
0.597PheMet: 0.597 ± 0.421
2.388PheAsn: 2.388 ± 0.047
0.597PhePro: 0.597 ± 0.398
0.597PheGln: 0.597 ± 0.421
0.597PheArg: 0.597 ± 0.421
1.791PheSer: 1.791 ± 0.445
1.194PheThr: 1.194 ± 0.795
2.985PheVal: 2.985 ± 1.287
0.0PheTrp: 0.0 ± 0.0
3.582PheTyr: 3.582 ± 0.889
0.0PheXaa: 0.0 ± 0.0
Gly
4.776GlyAla: 4.776 ± 1.544
2.388GlyCys: 2.388 ± 0.047
2.985GlyAsp: 2.985 ± 0.468
3.582GlyGlu: 3.582 ± 0.07
2.388GlyPhe: 2.388 ± 0.047
4.179GlyGly: 4.179 ± 1.31
0.0GlyHis: 0.0 ± 0.0
2.985GlyIle: 2.985 ± 0.351
1.791GlyLys: 1.791 ± 1.263
5.97GlyLeu: 5.97 ± 1.755
0.597GlyMet: 0.597 ± 0.398
1.791GlyAsn: 1.791 ± 0.374
1.791GlyPro: 1.791 ± 0.374
1.791GlyGln: 1.791 ± 1.193
4.179GlyArg: 4.179 ± 0.327
4.776GlySer: 4.776 ± 0.094
3.582GlyThr: 3.582 ± 0.889
7.761GlyVal: 7.761 ± 1.381
2.388GlyTrp: 2.388 ± 1.684
1.194GlyTyr: 1.194 ± 0.842
0.0GlyXaa: 0.0 ± 0.0
His
2.388HisAla: 2.388 ± 0.047
0.0HisCys: 0.0 ± 0.0
2.388HisAsp: 2.388 ± 0.772
1.791HisGlu: 1.791 ± 0.374
0.0HisPhe: 0.0 ± 0.0
1.194HisGly: 1.194 ± 0.023
1.194HisHis: 1.194 ± 0.795
2.388HisIle: 2.388 ± 0.866
0.597HisLys: 0.597 ± 0.398
1.791HisLeu: 1.791 ± 0.445
0.0HisMet: 0.0 ± 0.0
0.597HisAsn: 0.597 ± 0.421
1.194HisPro: 1.194 ± 0.023
1.194HisGln: 1.194 ± 0.023
1.194HisArg: 1.194 ± 0.023
2.985HisSer: 2.985 ± 1.169
0.597HisThr: 0.597 ± 0.398
1.194HisVal: 1.194 ± 0.795
1.194HisTrp: 1.194 ± 0.023
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.179IleAla: 4.179 ± 1.146
0.0IleCys: 0.0 ± 0.0
4.179IleAsp: 4.179 ± 0.491
2.985IleGlu: 2.985 ± 0.468
0.0IlePhe: 0.0 ± 0.0
2.388IleGly: 2.388 ± 0.047
1.791IleHis: 1.791 ± 0.374
1.194IleIle: 1.194 ± 0.023
3.582IleLys: 3.582 ± 0.889
3.582IleLeu: 3.582 ± 0.889
1.791IleMet: 1.791 ± 1.193
2.985IleAsn: 2.985 ± 0.351
5.373IlePro: 5.373 ± 0.515
1.194IleGln: 1.194 ± 0.023
0.0IleArg: 0.0 ± 0.0
4.179IleSer: 4.179 ± 0.327
1.791IleThr: 1.791 ± 0.374
2.388IleVal: 2.388 ± 1.591
1.791IleTrp: 1.791 ± 0.374
1.194IleTyr: 1.194 ± 0.842
0.0IleXaa: 0.0 ± 0.0
Lys
2.985LysAla: 2.985 ± 0.351
0.0LysCys: 0.0 ± 0.0
1.194LysAsp: 1.194 ± 0.842
2.985LysGlu: 2.985 ± 0.351
2.388LysPhe: 2.388 ± 0.047
1.194LysGly: 1.194 ± 0.842
1.791LysHis: 1.791 ± 0.374
1.791LysIle: 1.791 ± 1.263
1.791LysLys: 1.791 ± 1.263
4.179LysLeu: 4.179 ± 0.327
0.597LysMet: 0.597 ± 0.398
2.388LysAsn: 2.388 ± 0.866
1.791LysPro: 1.791 ± 0.445
1.194LysGln: 1.194 ± 0.842
2.388LysArg: 2.388 ± 0.866
3.582LysSer: 3.582 ± 0.07
3.582LysThr: 3.582 ± 0.889
1.791LysVal: 1.791 ± 1.193
2.388LysTrp: 2.388 ± 0.866
0.597LysTyr: 0.597 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.761LeuAla: 7.761 ± 0.562
1.791LeuCys: 1.791 ± 0.445
4.179LeuAsp: 4.179 ± 1.146
4.179LeuGlu: 4.179 ± 1.146
4.179LeuPhe: 4.179 ± 1.31
7.164LeuGly: 7.164 ± 0.678
2.985LeuHis: 2.985 ± 0.351
2.985LeuIle: 2.985 ± 0.351
2.985LeuLys: 2.985 ± 0.468
8.358LeuLeu: 8.358 ± 0.164
2.985LeuMet: 2.985 ± 0.468
4.179LeuAsn: 4.179 ± 0.327
2.985LeuPro: 2.985 ± 0.468
7.761LeuGln: 7.761 ± 1.076
7.164LeuArg: 7.164 ± 0.141
4.179LeuSer: 4.179 ± 0.491
5.373LeuThr: 5.373 ± 1.123
4.179LeuVal: 4.179 ± 0.327
0.597LeuTrp: 0.597 ± 0.421
4.179LeuTyr: 4.179 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
2.985MetAla: 2.985 ± 0.468
2.388MetCys: 2.388 ± 0.772
1.791MetAsp: 1.791 ± 0.445
0.597MetGlu: 0.597 ± 0.421
0.597MetPhe: 0.597 ± 0.398
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.194MetIle: 1.194 ± 0.023
1.791MetLys: 1.791 ± 0.445
4.179MetLeu: 4.179 ± 1.965
0.597MetMet: 0.597 ± 0.398
1.194MetAsn: 1.194 ± 0.795
3.582MetPro: 3.582 ± 1.567
1.791MetGln: 1.791 ± 0.374
1.791MetArg: 1.791 ± 1.193
3.582MetSer: 3.582 ± 1.708
0.597MetThr: 0.597 ± 0.421
3.582MetVal: 3.582 ± 0.07
0.0MetTrp: 0.0 ± 0.0
1.194MetTyr: 1.194 ± 0.842
0.0MetXaa: 0.0 ± 0.0
Asn
0.597AsnAla: 0.597 ± 0.398
0.597AsnCys: 0.597 ± 0.398
2.985AsnAsp: 2.985 ± 1.169
2.388AsnGlu: 2.388 ± 0.772
0.597AsnPhe: 0.597 ± 0.421
1.194AsnGly: 1.194 ± 0.023
0.597AsnHis: 0.597 ± 0.398
2.985AsnIle: 2.985 ± 0.351
1.791AsnLys: 1.791 ± 1.263
4.776AsnLeu: 4.776 ± 0.725
0.597AsnMet: 0.597 ± 0.421
2.388AsnAsn: 2.388 ± 0.772
2.985AsnPro: 2.985 ± 1.988
0.597AsnGln: 0.597 ± 0.421
1.791AsnArg: 1.791 ± 0.445
2.985AsnSer: 2.985 ± 0.351
1.791AsnThr: 1.791 ± 0.445
3.582AsnVal: 3.582 ± 0.07
2.388AsnTrp: 2.388 ± 0.772
1.791AsnTyr: 1.791 ± 1.193
0.0AsnXaa: 0.0 ± 0.0
Pro
4.776ProAla: 4.776 ± 2.362
1.194ProCys: 1.194 ± 0.023
4.776ProAsp: 4.776 ± 2.362
2.985ProGlu: 2.985 ± 0.468
1.194ProPhe: 1.194 ± 0.795
4.179ProGly: 4.179 ± 0.491
0.597ProHis: 0.597 ± 0.421
2.985ProIle: 2.985 ± 0.468
1.791ProLys: 1.791 ± 1.263
4.179ProLeu: 4.179 ± 1.965
1.791ProMet: 1.791 ± 0.445
2.985ProAsn: 2.985 ± 2.105
1.791ProPro: 1.791 ± 1.263
2.985ProGln: 2.985 ± 0.468
3.582ProArg: 3.582 ± 0.748
2.388ProSer: 2.388 ± 0.772
3.582ProThr: 3.582 ± 0.07
3.582ProVal: 3.582 ± 0.748
3.582ProTrp: 3.582 ± 0.889
1.791ProTyr: 1.791 ± 1.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.776GlnAla: 4.776 ± 0.094
0.597GlnCys: 0.597 ± 0.398
1.791GlnAsp: 1.791 ± 0.445
1.194GlnGlu: 1.194 ± 0.023
1.194GlnPhe: 1.194 ± 0.842
3.582GlnGly: 3.582 ± 0.07
1.194GlnHis: 1.194 ± 0.023
2.985GlnIle: 2.985 ± 0.468
1.194GlnLys: 1.194 ± 0.795
4.179GlnLeu: 4.179 ± 0.491
1.194GlnMet: 1.194 ± 0.795
1.194GlnAsn: 1.194 ± 0.795
2.388GlnPro: 2.388 ± 0.866
0.597GlnGln: 0.597 ± 0.421
2.985GlnArg: 2.985 ± 0.468
2.985GlnSer: 2.985 ± 0.351
2.388GlnThr: 2.388 ± 0.772
3.582GlnVal: 3.582 ± 1.567
1.791GlnTrp: 1.791 ± 1.263
1.194GlnTyr: 1.194 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
5.97ArgAla: 5.97 ± 0.936
1.791ArgCys: 1.791 ± 0.445
0.597ArgAsp: 0.597 ± 0.398
2.388ArgGlu: 2.388 ± 0.866
1.791ArgPhe: 1.791 ± 0.445
1.194ArgGly: 1.194 ± 0.842
1.194ArgHis: 1.194 ± 0.023
2.388ArgIle: 2.388 ± 1.591
0.597ArgLys: 0.597 ± 0.398
3.582ArgLeu: 3.582 ± 0.889
1.791ArgMet: 1.791 ± 0.374
1.194ArgAsn: 1.194 ± 0.795
5.373ArgPro: 5.373 ± 0.515
2.388ArgGln: 2.388 ± 0.866
2.388ArgArg: 2.388 ± 0.047
7.164ArgSer: 7.164 ± 3.416
4.776ArgThr: 4.776 ± 1.731
6.567ArgVal: 6.567 ± 0.538
3.582ArgTrp: 3.582 ± 0.889
2.388ArgTyr: 2.388 ± 0.772
0.0ArgXaa: 0.0 ± 0.0
Ser
5.97SerAla: 5.97 ± 2.573
1.194SerCys: 1.194 ± 0.842
3.582SerAsp: 3.582 ± 0.748
1.791SerGlu: 1.791 ± 0.374
3.582SerPhe: 3.582 ± 0.07
6.567SerGly: 6.567 ± 0.538
1.791SerHis: 1.791 ± 0.374
5.373SerIle: 5.373 ± 1.123
4.179SerLys: 4.179 ± 0.491
4.776SerLeu: 4.776 ± 0.094
3.582SerMet: 3.582 ± 0.748
4.179SerAsn: 4.179 ± 1.965
4.179SerPro: 4.179 ± 0.327
2.388SerGln: 2.388 ± 0.866
5.97SerArg: 5.97 ± 0.117
7.761SerSer: 7.761 ± 0.257
3.582SerThr: 3.582 ± 1.567
5.373SerVal: 5.373 ± 2.76
2.985SerTrp: 2.985 ± 0.351
3.582SerTyr: 3.582 ± 0.889
0.0SerXaa: 0.0 ± 0.0
Thr
5.373ThrAla: 5.373 ± 1.123
2.388ThrCys: 2.388 ± 0.047
2.388ThrAsp: 2.388 ± 0.866
1.194ThrGlu: 1.194 ± 0.023
0.597ThrPhe: 0.597 ± 0.398
4.179ThrGly: 4.179 ± 1.965
0.597ThrHis: 0.597 ± 0.421
0.597ThrIle: 0.597 ± 0.398
1.791ThrLys: 1.791 ± 0.374
2.388ThrLeu: 2.388 ± 0.866
2.388ThrMet: 2.388 ± 0.866
2.985ThrAsn: 2.985 ± 1.169
4.776ThrPro: 4.776 ± 0.094
1.194ThrGln: 1.194 ± 0.023
3.582ThrArg: 3.582 ± 0.889
7.164ThrSer: 7.164 ± 1.497
2.388ThrThr: 2.388 ± 0.047
2.388ThrVal: 2.388 ± 0.047
1.194ThrTrp: 1.194 ± 0.023
1.791ThrTyr: 1.791 ± 0.374
0.0ThrXaa: 0.0 ± 0.0
Val
5.97ValAla: 5.97 ± 3.158
0.597ValCys: 0.597 ± 0.421
2.388ValAsp: 2.388 ± 0.772
4.776ValGlu: 4.776 ± 0.913
2.388ValPhe: 2.388 ± 0.047
7.761ValGly: 7.761 ± 0.562
1.791ValHis: 1.791 ± 1.193
2.388ValIle: 2.388 ± 0.866
2.388ValLys: 2.388 ± 0.047
8.358ValLeu: 8.358 ± 0.654
2.985ValMet: 2.985 ± 0.468
1.791ValAsn: 1.791 ± 0.445
4.776ValPro: 4.776 ± 0.725
5.97ValGln: 5.97 ± 0.701
2.985ValArg: 2.985 ± 0.351
6.567ValSer: 6.567 ± 1.918
3.582ValThr: 3.582 ± 0.748
2.388ValVal: 2.388 ± 0.047
1.194ValTrp: 1.194 ± 0.795
1.791ValTyr: 1.791 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
4.776TrpAla: 4.776 ± 2.55
0.0TrpCys: 0.0 ± 0.0
1.194TrpAsp: 1.194 ± 0.023
1.194TrpGlu: 1.194 ± 0.023
1.791TrpPhe: 1.791 ± 1.263
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.388TrpIle: 2.388 ± 0.772
1.791TrpLys: 1.791 ± 0.374
6.567TrpLeu: 6.567 ± 0.538
1.791TrpMet: 1.791 ± 0.999
1.791TrpAsn: 1.791 ± 1.193
1.194TrpPro: 1.194 ± 0.023
1.791TrpGln: 1.791 ± 0.374
0.597TrpArg: 0.597 ± 0.421
2.388TrpSer: 2.388 ± 0.047
1.194TrpThr: 1.194 ± 0.842
6.567TrpVal: 6.567 ± 1.357
1.194TrpTrp: 1.194 ± 0.795
0.597TrpTyr: 0.597 ± 0.421
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.388TyrAla: 2.388 ± 0.047
0.597TyrCys: 0.597 ± 0.421
0.0TyrAsp: 0.0 ± 0.0
1.791TyrGlu: 1.791 ± 1.263
1.194TyrPhe: 1.194 ± 0.842
1.791TyrGly: 1.791 ± 0.445
1.194TyrHis: 1.194 ± 0.023
1.194TyrIle: 1.194 ± 0.023
1.194TyrLys: 1.194 ± 0.023
5.373TyrLeu: 5.373 ± 0.515
1.791TyrMet: 1.791 ± 1.193
0.597TyrAsn: 0.597 ± 0.398
0.597TyrPro: 0.597 ± 0.398
1.194TyrGln: 1.194 ± 0.842
2.388TyrArg: 2.388 ± 1.684
3.582TyrSer: 3.582 ± 0.748
1.194TyrThr: 1.194 ± 0.795
1.791TyrVal: 1.791 ± 0.445
1.791TyrTrp: 1.791 ± 1.263
0.597TyrTyr: 0.597 ± 0.398
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski