Amino acid dipepetide frequency for Beihai tombus-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.412AlaAla: 5.412 ± 1.766
1.203AlaCys: 1.203 ± 1.732
2.405AlaAsp: 2.405 ± 2.259
3.608AlaGlu: 3.608 ± 0.832
1.203AlaPhe: 1.203 ± 0.526
9.621AlaGly: 9.621 ± 6.623
1.203AlaHis: 1.203 ± 0.526
6.013AlaIle: 6.013 ± 1.426
6.013AlaLys: 6.013 ± 2.191
7.817AlaLeu: 7.817 ± 0.407
1.804AlaMet: 1.804 ± 1.393
1.804AlaAsn: 1.804 ± 1.393
3.608AlaPro: 3.608 ± 1.579
2.405AlaGln: 2.405 ± 1.053
3.608AlaArg: 3.608 ± 0.832
4.209AlaSer: 4.209 ± 1.24
4.811AlaThr: 4.811 ± 2.106
5.412AlaVal: 5.412 ± 0.646
0.601AlaTrp: 0.601 ± 0.34
3.007AlaTyr: 3.007 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.601CysAsp: 0.601 ± 0.866
0.601CysGlu: 0.601 ± 0.34
0.601CysPhe: 0.601 ± 0.34
1.203CysGly: 1.203 ± 0.526
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.203CysAsn: 1.203 ± 0.679
1.804CysPro: 1.804 ± 1.393
0.601CysGln: 0.601 ± 0.34
1.804CysArg: 1.804 ± 0.187
2.405CysSer: 2.405 ± 1.359
0.601CysThr: 0.601 ± 0.34
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.601CysTyr: 0.601 ± 0.866
0.0CysXaa: 0.0 ± 0.0
Asp
4.811AspAla: 4.811 ± 0.9
0.601AspCys: 0.601 ± 0.34
4.811AspAsp: 4.811 ± 0.9
1.203AspGlu: 1.203 ± 0.679
3.608AspPhe: 3.608 ± 0.832
3.007AspGly: 3.007 ± 0.713
1.203AspHis: 1.203 ± 0.679
0.601AspIle: 0.601 ± 0.866
6.615AspLys: 6.615 ± 1.325
3.608AspLeu: 3.608 ± 2.038
1.203AspMet: 1.203 ± 0.679
6.013AspAsn: 6.013 ± 0.22
2.405AspPro: 2.405 ± 0.153
1.804AspGln: 1.804 ± 1.019
4.811AspArg: 4.811 ± 1.512
3.007AspSer: 3.007 ± 1.919
2.405AspThr: 2.405 ± 2.259
3.007AspVal: 3.007 ± 0.493
1.203AspTrp: 1.203 ± 0.526
3.007AspTyr: 3.007 ± 0.713
0.0AspXaa: 0.0 ± 0.0
Glu
6.013GluAla: 6.013 ± 0.985
1.203GluCys: 1.203 ± 0.679
3.007GluAsp: 3.007 ± 0.493
3.007GluGlu: 3.007 ± 0.493
2.405GluPhe: 2.405 ± 0.153
1.804GluGly: 1.804 ± 0.187
0.601GluHis: 0.601 ± 0.34
1.203GluIle: 1.203 ± 0.526
3.608GluLys: 3.608 ± 2.038
4.811GluLeu: 4.811 ± 0.9
3.007GluMet: 3.007 ± 1.553
3.608GluAsn: 3.608 ± 2.038
3.608GluPro: 3.608 ± 2.038
0.601GluGln: 0.601 ± 0.34
1.203GluArg: 1.203 ± 0.679
1.804GluSer: 1.804 ± 0.187
4.811GluThr: 4.811 ± 2.106
3.608GluVal: 3.608 ± 0.832
0.601GluTrp: 0.601 ± 0.34
1.804GluTyr: 1.804 ± 1.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.007PheAla: 3.007 ± 0.493
0.601PheCys: 0.601 ± 0.34
3.608PheAsp: 3.608 ± 2.038
3.608PheGlu: 3.608 ± 1.579
5.412PhePhe: 5.412 ± 3.057
2.405PheGly: 2.405 ± 0.153
1.804PheHis: 1.804 ± 0.187
0.601PheIle: 0.601 ± 0.866
3.608PheLys: 3.608 ± 0.832
4.209PheLeu: 4.209 ± 1.172
0.601PheMet: 0.601 ± 0.34
3.007PheAsn: 3.007 ± 0.493
2.405PhePro: 2.405 ± 1.359
0.601PheGln: 0.601 ± 0.34
3.007PheArg: 3.007 ± 0.493
3.608PheSer: 3.608 ± 0.373
3.007PheThr: 3.007 ± 1.699
1.804PheVal: 1.804 ± 1.019
0.0PheTrp: 0.0 ± 0.0
1.203PheTyr: 1.203 ± 0.679
0.0PheXaa: 0.0 ± 0.0
Gly
3.608GlyAla: 3.608 ± 2.785
1.203GlyCys: 1.203 ± 1.732
7.216GlyAsp: 7.216 ± 0.747
1.203GlyGlu: 1.203 ± 0.526
4.209GlyPhe: 4.209 ± 1.172
7.817GlyGly: 7.817 ± 2.005
1.203GlyHis: 1.203 ± 0.679
3.608GlyIle: 3.608 ± 2.038
3.608GlyLys: 3.608 ± 2.038
6.013GlyLeu: 6.013 ± 1.426
3.007GlyMet: 3.007 ± 1.699
2.405GlyAsn: 2.405 ± 1.359
3.007GlyPro: 3.007 ± 1.699
3.608GlyGln: 3.608 ± 1.579
4.209GlyArg: 4.209 ± 1.172
5.412GlySer: 5.412 ± 0.56
3.007GlyThr: 3.007 ± 0.713
3.608GlyVal: 3.608 ± 1.579
0.0GlyTrp: 0.0 ± 0.0
4.209GlyTyr: 4.209 ± 1.24
0.0GlyXaa: 0.0 ± 0.0
His
1.804HisAla: 1.804 ± 0.187
0.0HisCys: 0.0 ± 0.0
0.601HisAsp: 0.601 ± 0.866
0.601HisGlu: 0.601 ± 0.34
0.601HisPhe: 0.601 ± 0.34
3.007HisGly: 3.007 ± 0.493
0.601HisHis: 0.601 ± 0.866
1.203HisIle: 1.203 ± 0.526
1.203HisLys: 1.203 ± 0.679
1.203HisLeu: 1.203 ± 0.526
0.601HisMet: 0.601 ± 0.34
0.601HisAsn: 0.601 ± 0.866
0.601HisPro: 0.601 ± 0.34
1.804HisGln: 1.804 ± 1.393
2.405HisArg: 2.405 ± 0.153
1.804HisSer: 1.804 ± 1.019
1.203HisThr: 1.203 ± 0.679
1.804HisVal: 1.804 ± 1.019
0.601HisTrp: 0.601 ± 0.34
0.601HisTyr: 0.601 ± 0.866
0.0HisXaa: 0.0 ± 0.0
Ile
4.209IleAla: 4.209 ± 0.034
0.601IleCys: 0.601 ± 0.34
1.203IleAsp: 1.203 ± 0.679
2.405IleGlu: 2.405 ± 2.259
2.405IlePhe: 2.405 ± 1.359
2.405IleGly: 2.405 ± 1.053
1.203IleHis: 1.203 ± 0.679
0.601IleIle: 0.601 ± 0.866
4.209IleLys: 4.209 ± 0.034
3.608IleLeu: 3.608 ± 2.038
0.601IleMet: 0.601 ± 0.34
1.804IleAsn: 1.804 ± 1.019
0.601IlePro: 0.601 ± 0.34
1.203IleGln: 1.203 ± 0.526
6.013IleArg: 6.013 ± 0.22
4.209IleSer: 4.209 ± 0.034
4.811IleThr: 4.811 ± 2.106
3.608IleVal: 3.608 ± 0.832
0.601IleTrp: 0.601 ± 0.34
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
7.216LysAla: 7.216 ± 3.159
1.804LysCys: 1.804 ± 1.019
1.804LysAsp: 1.804 ± 1.019
9.02LysGlu: 9.02 ± 5.096
3.007LysPhe: 3.007 ± 0.493
3.608LysGly: 3.608 ± 0.832
2.405LysHis: 2.405 ± 1.359
2.405LysIle: 2.405 ± 1.359
4.811LysLys: 4.811 ± 0.306
6.013LysLeu: 6.013 ± 2.191
0.601LysMet: 0.601 ± 0.212
2.405LysAsn: 2.405 ± 1.359
1.804LysPro: 1.804 ± 1.019
3.608LysGln: 3.608 ± 1.579
5.412LysArg: 5.412 ± 3.057
3.007LysSer: 3.007 ± 0.713
3.608LysThr: 3.608 ± 0.832
4.811LysVal: 4.811 ± 2.106
0.0LysTrp: 0.0 ± 0.0
0.601LysTyr: 0.601 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
4.209LeuAla: 4.209 ± 1.24
1.203LeuCys: 1.203 ± 0.526
6.615LeuAsp: 6.615 ± 0.119
6.013LeuGlu: 6.013 ± 3.397
3.608LeuPhe: 3.608 ± 2.038
8.419LeuGly: 8.419 ± 2.344
0.0LeuHis: 0.0 ± 0.0
2.405LeuIle: 2.405 ± 1.359
4.209LeuLys: 4.209 ± 1.172
4.209LeuLeu: 4.209 ± 1.172
0.601LeuMet: 0.601 ± 0.34
3.007LeuAsn: 3.007 ± 1.919
6.013LeuPro: 6.013 ± 0.22
1.203LeuGln: 1.203 ± 0.679
3.608LeuArg: 3.608 ± 0.373
9.621LeuSer: 9.621 ± 0.612
7.817LeuThr: 7.817 ± 0.407
5.412LeuVal: 5.412 ± 0.56
1.203LeuTrp: 1.203 ± 0.679
3.608LeuTyr: 3.608 ± 1.579
0.0LeuXaa: 0.0 ± 0.0
Met
4.209MetAla: 4.209 ± 1.24
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.601MetGlu: 0.601 ± 0.34
1.203MetPhe: 1.203 ± 0.679
1.203MetGly: 1.203 ± 0.679
0.0MetHis: 0.0 ± 0.0
1.804MetIle: 1.804 ± 0.187
1.203MetLys: 1.203 ± 0.526
2.405MetLeu: 2.405 ± 0.153
2.405MetMet: 2.405 ± 0.153
0.601MetAsn: 0.601 ± 0.866
0.601MetPro: 0.601 ± 0.866
0.0MetGln: 0.0 ± 0.0
0.601MetArg: 0.601 ± 0.34
1.203MetSer: 1.203 ± 0.679
1.804MetThr: 1.804 ± 1.019
1.203MetVal: 1.203 ± 0.679
0.0MetTrp: 0.0 ± 0.0
1.203MetTyr: 1.203 ± 0.526
0.0MetXaa: 0.0 ± 0.0
Asn
3.007AsnAla: 3.007 ± 0.493
0.601AsnCys: 0.601 ± 0.34
4.811AsnAsp: 4.811 ± 3.312
3.007AsnGlu: 3.007 ± 0.493
0.601AsnPhe: 0.601 ± 0.866
1.203AsnGly: 1.203 ± 0.526
0.601AsnHis: 0.601 ± 0.866
3.007AsnIle: 3.007 ± 0.493
4.209AsnLys: 4.209 ± 1.172
4.811AsnLeu: 4.811 ± 0.306
0.0AsnMet: 0.0 ± 0.0
0.601AsnAsn: 0.601 ± 0.866
0.601AsnPro: 0.601 ± 0.866
1.804AsnGln: 1.804 ± 1.393
2.405AsnArg: 2.405 ± 1.359
2.405AsnSer: 2.405 ± 1.359
6.013AsnThr: 6.013 ± 0.985
3.608AsnVal: 3.608 ± 0.373
1.804AsnTrp: 1.804 ± 0.187
0.601AsnTyr: 0.601 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.007ProAla: 3.007 ± 1.919
0.0ProCys: 0.0 ± 0.0
2.405ProAsp: 2.405 ± 1.053
3.007ProGlu: 3.007 ± 0.493
2.405ProPhe: 2.405 ± 2.259
3.608ProGly: 3.608 ± 0.373
1.804ProHis: 1.804 ± 1.019
2.405ProIle: 2.405 ± 1.053
3.007ProLys: 3.007 ± 0.493
4.811ProLeu: 4.811 ± 0.306
1.203ProMet: 1.203 ± 0.679
1.203ProAsn: 1.203 ± 1.732
3.007ProPro: 3.007 ± 3.125
1.203ProGln: 1.203 ± 0.679
3.007ProArg: 3.007 ± 0.713
1.804ProSer: 1.804 ± 0.187
2.405ProThr: 2.405 ± 2.259
4.811ProVal: 4.811 ± 1.512
0.601ProTrp: 0.601 ± 0.34
1.804ProTyr: 1.804 ± 1.019
0.0ProXaa: 0.0 ± 0.0
Gln
4.209GlnAla: 4.209 ± 2.445
0.601GlnCys: 0.601 ± 0.34
0.601GlnAsp: 0.601 ± 0.34
0.601GlnGlu: 0.601 ± 0.866
0.601GlnPhe: 0.601 ± 0.34
3.608GlnGly: 3.608 ± 0.373
1.203GlnHis: 1.203 ± 0.526
4.209GlnIle: 4.209 ± 1.24
3.608GlnLys: 3.608 ± 0.832
1.804GlnLeu: 1.804 ± 1.393
0.601GlnMet: 0.601 ± 0.866
2.405GlnAsn: 2.405 ± 0.153
2.405GlnPro: 2.405 ± 3.465
0.601GlnGln: 0.601 ± 0.34
1.804GlnArg: 1.804 ± 0.187
0.601GlnSer: 0.601 ± 0.34
0.0GlnThr: 0.0 ± 0.0
1.804GlnVal: 1.804 ± 0.187
0.0GlnTrp: 0.0 ± 0.0
0.601GlnTyr: 0.601 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
3.608ArgAla: 3.608 ± 0.832
0.0ArgCys: 0.0 ± 0.0
3.007ArgAsp: 3.007 ± 0.493
5.412ArgGlu: 5.412 ± 0.646
2.405ArgPhe: 2.405 ± 1.359
4.811ArgGly: 4.811 ± 2.718
1.203ArgHis: 1.203 ± 0.679
3.608ArgIle: 3.608 ± 2.038
6.013ArgLys: 6.013 ± 0.22
4.811ArgLeu: 4.811 ± 1.512
1.203ArgMet: 1.203 ± 0.526
4.811ArgAsn: 4.811 ± 0.9
3.608ArgPro: 3.608 ± 0.373
2.405ArgGln: 2.405 ± 0.153
2.405ArgArg: 2.405 ± 1.359
3.608ArgSer: 3.608 ± 1.579
2.405ArgThr: 2.405 ± 2.259
1.804ArgVal: 1.804 ± 1.019
0.601ArgTrp: 0.601 ± 0.34
1.203ArgTyr: 1.203 ± 0.679
0.0ArgXaa: 0.0 ± 0.0
Ser
2.405SerAla: 2.405 ± 2.259
0.0SerCys: 0.0 ± 0.0
4.209SerAsp: 4.209 ± 2.378
3.608SerGlu: 3.608 ± 0.832
3.007SerPhe: 3.007 ± 0.493
4.811SerGly: 4.811 ± 0.9
0.601SerHis: 0.601 ± 0.34
5.412SerIle: 5.412 ± 1.852
4.209SerLys: 4.209 ± 0.034
6.615SerLeu: 6.615 ± 0.119
1.203SerMet: 1.203 ± 0.526
3.608SerAsn: 3.608 ± 0.373
2.405SerPro: 2.405 ± 2.259
1.804SerGln: 1.804 ± 1.393
4.209SerArg: 4.209 ± 1.24
4.209SerSer: 4.209 ± 1.24
4.209SerThr: 4.209 ± 2.445
7.216SerVal: 7.216 ± 0.747
1.203SerTrp: 1.203 ± 0.679
3.608SerTyr: 3.608 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
6.013ThrAla: 6.013 ± 0.22
1.203ThrCys: 1.203 ± 0.679
3.007ThrAsp: 3.007 ± 0.493
1.804ThrGlu: 1.804 ± 0.187
4.811ThrPhe: 4.811 ± 1.512
3.608ThrGly: 3.608 ± 0.832
1.804ThrHis: 1.804 ± 2.598
2.405ThrIle: 2.405 ± 1.053
4.209ThrLys: 4.209 ± 1.24
7.817ThrLeu: 7.817 ± 1.613
0.601ThrMet: 0.601 ± 0.866
1.804ThrAsn: 1.804 ± 0.187
3.007ThrPro: 3.007 ± 0.493
1.804ThrGln: 1.804 ± 1.393
2.405ThrArg: 2.405 ± 2.259
4.209ThrSer: 4.209 ± 2.445
4.209ThrThr: 4.209 ± 2.445
9.02ThrVal: 9.02 ± 1.478
2.405ThrTrp: 2.405 ± 0.153
1.203ThrTyr: 1.203 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
7.216ValAla: 7.216 ± 4.364
0.601ValCys: 0.601 ± 0.34
6.615ValAsp: 6.615 ± 2.531
1.804ValGlu: 1.804 ± 1.019
2.405ValPhe: 2.405 ± 1.359
3.608ValGly: 3.608 ± 0.832
3.007ValHis: 3.007 ± 0.493
3.608ValIle: 3.608 ± 0.373
3.007ValLys: 3.007 ± 1.699
3.608ValLeu: 3.608 ± 2.038
1.804ValMet: 1.804 ± 0.187
1.804ValAsn: 1.804 ± 0.187
4.209ValPro: 4.209 ± 1.24
2.405ValGln: 2.405 ± 1.053
3.007ValArg: 3.007 ± 1.699
6.615ValSer: 6.615 ± 0.119
5.412ValThr: 5.412 ± 0.56
4.209ValVal: 4.209 ± 0.034
1.804ValTrp: 1.804 ± 1.019
4.209ValTyr: 4.209 ± 1.24
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.34
0.0TrpCys: 0.0 ± 0.0
1.203TrpAsp: 1.203 ± 0.679
0.0TrpGlu: 0.0 ± 0.0
2.405TrpPhe: 2.405 ± 0.153
0.601TrpGly: 0.601 ± 0.34
1.203TrpHis: 1.203 ± 0.679
1.203TrpIle: 1.203 ± 0.679
0.0TrpLys: 0.0 ± 0.0
1.203TrpLeu: 1.203 ± 0.526
0.0TrpMet: 0.0 ± 0.0
0.601TrpAsn: 0.601 ± 0.34
0.601TrpPro: 0.601 ± 0.34
0.601TrpGln: 0.601 ± 0.34
0.601TrpArg: 0.601 ± 0.34
0.601TrpSer: 0.601 ± 0.866
1.804TrpThr: 1.804 ± 1.019
0.601TrpVal: 0.601 ± 0.34
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.203TyrAla: 1.203 ± 0.679
0.601TyrCys: 0.601 ± 0.866
1.203TyrAsp: 1.203 ± 0.526
1.203TyrGlu: 1.203 ± 0.679
1.203TyrPhe: 1.203 ± 0.526
1.804TyrGly: 1.804 ± 1.019
1.203TyrHis: 1.203 ± 1.732
0.0TyrIle: 0.0 ± 0.0
1.203TyrLys: 1.203 ± 0.526
3.608TyrLeu: 3.608 ± 0.832
0.601TyrMet: 0.601 ± 0.866
2.405TyrAsn: 2.405 ± 0.153
1.203TyrPro: 1.203 ± 0.526
1.804TyrGln: 1.804 ± 1.393
2.405TyrArg: 2.405 ± 0.153
4.209TyrSer: 4.209 ± 1.24
3.007TyrThr: 3.007 ± 1.699
3.608TyrVal: 3.608 ± 0.373
0.601TyrTrp: 0.601 ± 0.34
0.601TyrTyr: 0.601 ± 0.866
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski