Amino acid dipepetide frequency for Black grass varicosavirus-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.161AlaAla: 3.161 ± 2.193
0.0AlaCys: 0.0 ± 0.0
2.371AlaAsp: 2.371 ± 1.644
4.346AlaGlu: 4.346 ± 0.748
2.371AlaPhe: 2.371 ± 0.169
3.161AlaGly: 3.161 ± 0.528
1.976AlaHis: 1.976 ± 0.897
3.161AlaIle: 3.161 ± 0.379
3.556AlaLys: 3.556 ± 1.107
4.741AlaLeu: 4.741 ± 1.475
3.556AlaMet: 3.556 ± 0.2
1.58AlaAsn: 1.58 ± 1.096
2.371AlaPro: 2.371 ± 0.169
0.79AlaGln: 0.79 ± 0.359
4.346AlaArg: 4.346 ± 2.561
3.556AlaSer: 3.556 ± 0.2
3.161AlaThr: 3.161 ± 0.528
2.371AlaVal: 2.371 ± 0.738
0.395AlaTrp: 0.395 ± 0.179
2.371AlaTyr: 2.371 ± 1.644
0.0AlaXaa: 0.0 ± 0.0
Cys
1.185CysAla: 1.185 ± 0.369
0.395CysCys: 0.395 ± 0.727
1.976CysAsp: 1.976 ± 0.917
0.395CysGlu: 0.395 ± 0.179
0.395CysPhe: 0.395 ± 0.179
0.0CysGly: 0.0 ± 0.0
1.185CysHis: 1.185 ± 0.538
1.58CysIle: 1.58 ± 0.19
1.58CysLys: 1.58 ± 0.19
2.766CysLeu: 2.766 ± 0.558
0.79CysMet: 0.79 ± 0.359
0.395CysAsn: 0.395 ± 0.179
0.79CysPro: 0.79 ± 0.359
1.185CysGln: 1.185 ± 0.538
1.976CysArg: 1.976 ± 0.01
0.79CysSer: 0.79 ± 0.359
0.395CysThr: 0.395 ± 0.179
0.79CysVal: 0.79 ± 0.359
0.79CysTrp: 0.79 ± 0.359
0.395CysTyr: 0.395 ± 0.727
0.0CysXaa: 0.0 ± 0.0
Asp
3.161AspAla: 3.161 ± 2.193
1.58AspCys: 1.58 ± 0.717
3.556AspAsp: 3.556 ± 1.614
5.927AspGlu: 5.927 ± 3.658
0.79AspPhe: 0.79 ± 0.359
3.951AspGly: 3.951 ± 1.834
0.79AspHis: 0.79 ± 0.359
7.112AspIle: 7.112 ± 0.399
3.556AspLys: 3.556 ± 1.614
1.976AspLeu: 1.976 ± 0.897
4.346AspMet: 4.346 ± 0.405
2.371AspAsn: 2.371 ± 1.644
4.741AspPro: 4.741 ± 0.569
1.976AspGln: 1.976 ± 0.897
1.58AspArg: 1.58 ± 0.19
3.556AspSer: 3.556 ± 0.2
3.556AspThr: 3.556 ± 0.2
2.371AspVal: 2.371 ± 0.169
0.395AspTrp: 0.395 ± 0.179
2.371AspTyr: 2.371 ± 0.169
0.0AspXaa: 0.0 ± 0.0
Glu
3.556GluAla: 3.556 ± 1.107
2.766GluCys: 2.766 ± 1.255
5.136GluAsp: 5.136 ± 2.203
7.112GluGlu: 7.112 ± 6.747
2.371GluPhe: 2.371 ± 0.738
3.161GluGly: 3.161 ± 0.528
1.58GluHis: 1.58 ± 0.717
4.346GluIle: 4.346 ± 0.159
3.161GluLys: 3.161 ± 0.379
7.902GluLeu: 7.902 ± 0.948
3.161GluMet: 3.161 ± 0.322
3.161GluAsn: 3.161 ± 1.286
1.976GluPro: 1.976 ± 0.917
2.766GluGln: 2.766 ± 3.279
3.951GluArg: 3.951 ± 0.927
2.766GluSer: 2.766 ± 0.348
2.766GluThr: 2.766 ± 1.255
4.741GluVal: 4.741 ± 1.245
0.79GluTrp: 0.79 ± 0.359
2.371GluTyr: 2.371 ± 0.169
0.0GluXaa: 0.0 ± 0.0
Phe
1.185PheAla: 1.185 ± 0.369
0.395PheCys: 0.395 ± 0.179
3.161PheAsp: 3.161 ± 0.528
2.766PheGlu: 2.766 ± 0.558
0.0PhePhe: 0.0 ± 0.0
1.185PheGly: 1.185 ± 0.538
0.395PheHis: 0.395 ± 0.179
3.951PheIle: 3.951 ± 1.793
1.976PheLys: 1.976 ± 0.01
4.346PheLeu: 4.346 ± 0.159
1.185PheMet: 1.185 ± 0.538
0.395PheAsn: 0.395 ± 0.179
2.766PhePro: 2.766 ± 0.558
2.371PheGln: 2.371 ± 1.076
1.976PheArg: 1.976 ± 0.897
2.766PheSer: 2.766 ± 1.465
1.185PheThr: 1.185 ± 0.369
2.766PheVal: 2.766 ± 0.348
0.79PheTrp: 0.79 ± 0.359
0.395PheTyr: 0.395 ± 0.727
0.0PheXaa: 0.0 ± 0.0
Gly
1.976GlyAla: 1.976 ± 0.917
0.79GlyCys: 0.79 ± 0.548
2.766GlyAsp: 2.766 ± 0.558
2.371GlyGlu: 2.371 ± 0.169
2.766GlyPhe: 2.766 ± 0.348
5.136GlyGly: 5.136 ± 0.389
2.371GlyHis: 2.371 ± 1.076
5.136GlyIle: 5.136 ± 1.296
3.556GlyLys: 3.556 ± 0.707
5.927GlyLeu: 5.927 ± 0.031
3.951GlyMet: 3.951 ± 0.927
3.951GlyAsn: 3.951 ± 1.834
0.79GlyPro: 0.79 ± 0.359
2.766GlyGln: 2.766 ± 0.348
4.346GlyArg: 4.346 ± 0.159
4.346GlySer: 4.346 ± 1.066
3.951GlyThr: 3.951 ± 0.886
4.741GlyVal: 4.741 ± 1.245
1.185GlyTrp: 1.185 ± 0.538
2.766GlyTyr: 2.766 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
1.185HisAla: 1.185 ± 0.369
1.185HisCys: 1.185 ± 0.369
1.185HisAsp: 1.185 ± 0.538
1.58HisGlu: 1.58 ± 0.717
1.185HisPhe: 1.185 ± 0.369
0.395HisGly: 0.395 ± 0.179
1.185HisHis: 1.185 ± 0.538
2.371HisIle: 2.371 ± 1.076
1.58HisLys: 1.58 ± 0.717
2.766HisLeu: 2.766 ± 1.465
0.0HisMet: 0.0 ± 0.0
0.395HisAsn: 0.395 ± 0.179
1.58HisPro: 1.58 ± 0.717
0.0HisGln: 0.0 ± 0.0
1.976HisArg: 1.976 ± 0.01
0.79HisSer: 0.79 ± 0.359
0.395HisThr: 0.395 ± 0.727
1.58HisVal: 1.58 ± 0.717
0.0HisTrp: 0.0 ± 0.0
1.185HisTyr: 1.185 ± 0.538
0.0HisXaa: 0.0 ± 0.0
Ile
4.741IleAla: 4.741 ± 2.382
2.371IleCys: 2.371 ± 0.169
5.136IleAsp: 5.136 ± 0.389
5.927IleGlu: 5.927 ± 0.031
1.976IlePhe: 1.976 ± 0.897
7.112IleGly: 7.112 ± 1.414
0.395IleHis: 0.395 ± 0.179
2.371IleIle: 2.371 ± 1.076
8.692IleLys: 8.692 ± 0.589
7.112IleLeu: 7.112 ± 1.414
3.556IleMet: 3.556 ± 0.2
4.741IleAsn: 4.741 ± 2.152
3.556IlePro: 3.556 ± 0.2
3.161IleGln: 3.161 ± 0.528
0.79IleArg: 0.79 ± 0.548
3.556IleSer: 3.556 ± 1.614
2.371IleThr: 2.371 ± 1.076
3.161IleVal: 3.161 ± 0.528
0.79IleTrp: 0.79 ± 0.359
2.371IleTyr: 2.371 ± 0.169
0.0IleXaa: 0.0 ± 0.0
Lys
3.161LysAla: 3.161 ± 0.379
0.79LysCys: 0.79 ± 0.359
3.161LysAsp: 3.161 ± 0.379
4.346LysGlu: 4.346 ± 0.748
1.58LysPhe: 1.58 ± 0.717
6.717LysGly: 6.717 ± 2.392
1.185LysHis: 1.185 ± 0.369
4.741LysIle: 4.741 ± 1.245
5.136LysLys: 5.136 ± 3.11
4.741LysLeu: 4.741 ± 1.245
1.976LysMet: 1.976 ± 0.897
2.371LysAsn: 2.371 ± 0.738
2.371LysPro: 2.371 ± 1.076
0.79LysGln: 0.79 ± 0.359
2.766LysArg: 2.766 ± 1.255
2.371LysSer: 2.371 ± 0.738
4.741LysThr: 4.741 ± 0.338
5.531LysVal: 5.531 ± 1.117
1.58LysTrp: 1.58 ± 0.19
1.185LysTyr: 1.185 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
6.322LeuAla: 6.322 ± 1.055
1.976LeuCys: 1.976 ± 0.897
4.741LeuAsp: 4.741 ± 0.569
3.951LeuGlu: 3.951 ± 0.886
3.161LeuPhe: 3.161 ± 1.286
6.717LeuGly: 6.717 ± 0.579
1.58LeuHis: 1.58 ± 0.717
5.927LeuIle: 5.927 ± 0.031
3.556LeuLys: 3.556 ± 0.2
7.902LeuLeu: 7.902 ± 0.866
3.556LeuMet: 3.556 ± 1.614
3.161LeuAsn: 3.161 ± 0.528
5.136LeuPro: 5.136 ± 0.389
2.766LeuGln: 2.766 ± 1.255
5.136LeuArg: 5.136 ± 1.296
7.902LeuSer: 7.902 ± 0.041
6.717LeuThr: 6.717 ± 3.299
3.951LeuVal: 3.951 ± 0.927
0.79LeuTrp: 0.79 ± 0.548
3.951LeuTyr: 3.951 ± 0.02
0.0LeuXaa: 0.0 ± 0.0
Met
2.766MetAla: 2.766 ± 2.372
0.395MetCys: 0.395 ± 0.727
1.976MetAsp: 1.976 ± 0.897
3.161MetGlu: 3.161 ± 0.379
1.976MetPhe: 1.976 ± 0.897
3.951MetGly: 3.951 ± 0.02
1.58MetHis: 1.58 ± 1.096
3.161MetIle: 3.161 ± 0.528
1.185MetLys: 1.185 ± 0.538
3.951MetLeu: 3.951 ± 0.886
1.976MetMet: 1.976 ± 0.897
2.371MetAsn: 2.371 ± 0.169
2.371MetPro: 2.371 ± 0.169
0.0MetGln: 0.0 ± 0.0
3.951MetArg: 3.951 ± 0.02
2.371MetSer: 2.371 ± 1.076
1.58MetThr: 1.58 ± 0.19
1.185MetVal: 1.185 ± 0.538
0.395MetTrp: 0.395 ± 0.179
3.161MetTyr: 3.161 ± 0.528
0.0MetXaa: 0.0 ± 0.0
Asn
1.58AsnAla: 1.58 ± 2.003
1.185AsnCys: 1.185 ± 0.369
3.556AsnAsp: 3.556 ± 1.107
3.556AsnGlu: 3.556 ± 0.2
0.79AsnPhe: 0.79 ± 0.359
0.79AsnGly: 0.79 ± 0.359
0.395AsnHis: 0.395 ± 0.179
3.951AsnIle: 3.951 ± 0.02
1.58AsnLys: 1.58 ± 0.19
4.741AsnLeu: 4.741 ± 1.475
1.185AsnMet: 1.185 ± 0.538
0.79AsnAsn: 0.79 ± 0.359
1.58AsnPro: 1.58 ± 0.19
3.951AsnGln: 3.951 ± 0.02
2.371AsnArg: 2.371 ± 0.169
3.161AsnSer: 3.161 ± 0.528
1.58AsnThr: 1.58 ± 0.717
4.741AsnVal: 4.741 ± 1.245
0.79AsnTrp: 0.79 ± 0.359
1.58AsnTyr: 1.58 ± 0.19
0.0AsnXaa: 0.0 ± 0.0
Pro
1.58ProAla: 1.58 ± 0.717
0.79ProCys: 0.79 ± 0.548
5.531ProAsp: 5.531 ± 0.697
2.766ProGlu: 2.766 ± 2.372
1.58ProPhe: 1.58 ± 0.19
4.346ProGly: 4.346 ± 1.066
0.79ProHis: 0.79 ± 0.359
2.371ProIle: 2.371 ± 0.169
2.371ProLys: 2.371 ± 2.551
3.161ProLeu: 3.161 ± 0.528
1.58ProMet: 1.58 ± 0.19
1.185ProAsn: 1.185 ± 0.538
2.766ProPro: 2.766 ± 0.558
2.766ProGln: 2.766 ± 0.348
1.976ProArg: 1.976 ± 0.897
1.58ProSer: 1.58 ± 0.717
3.556ProThr: 3.556 ± 1.614
1.976ProVal: 1.976 ± 0.897
0.395ProTrp: 0.395 ± 0.179
1.976ProTyr: 1.976 ± 0.01
0.0ProXaa: 0.0 ± 0.0
Gln
1.185GlnAla: 1.185 ± 0.538
0.79GlnCys: 0.79 ± 0.359
1.58GlnAsp: 1.58 ± 0.717
2.766GlnGlu: 2.766 ± 1.465
1.976GlnPhe: 1.976 ± 0.897
2.766GlnGly: 2.766 ± 0.558
0.79GlnHis: 0.79 ± 0.548
3.951GlnIle: 3.951 ± 0.02
3.161GlnLys: 3.161 ± 1.434
1.976GlnLeu: 1.976 ± 0.01
0.395GlnMet: 0.395 ± 0.179
4.741GlnAsn: 4.741 ± 0.569
1.185GlnPro: 1.185 ± 0.538
1.185GlnGln: 1.185 ± 0.369
0.79GlnArg: 0.79 ± 0.359
0.79GlnSer: 0.79 ± 0.359
0.79GlnThr: 0.79 ± 0.548
1.976GlnVal: 1.976 ± 0.897
0.395GlnTrp: 0.395 ± 0.179
0.79GlnTyr: 0.79 ± 0.548
0.0GlnXaa: 0.0 ± 0.0
Arg
4.346ArgAla: 4.346 ± 0.159
1.185ArgCys: 1.185 ± 0.369
3.951ArgAsp: 3.951 ± 0.927
4.346ArgGlu: 4.346 ± 1.066
1.185ArgPhe: 1.185 ± 0.369
4.741ArgGly: 4.741 ± 1.245
0.79ArgHis: 0.79 ± 0.359
4.741ArgIle: 4.741 ± 0.569
4.741ArgLys: 4.741 ± 1.245
3.951ArgLeu: 3.951 ± 3.648
4.346ArgMet: 4.346 ± 0.748
2.766ArgAsn: 2.766 ± 0.558
0.0ArgPro: 0.0 ± 0.0
1.976ArgGln: 1.976 ± 0.01
5.136ArgArg: 5.136 ± 2.203
3.161ArgSer: 3.161 ± 1.434
3.951ArgThr: 3.951 ± 0.02
2.766ArgVal: 2.766 ± 0.348
1.185ArgTrp: 1.185 ± 0.538
1.185ArgTyr: 1.185 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
3.161SerAla: 3.161 ± 0.379
0.0SerCys: 0.0 ± 0.0
1.58SerAsp: 1.58 ± 1.096
5.927SerGlu: 5.927 ± 0.937
3.556SerPhe: 3.556 ± 0.2
1.976SerGly: 1.976 ± 0.01
2.371SerHis: 2.371 ± 0.738
4.741SerIle: 4.741 ± 2.152
1.976SerLys: 1.976 ± 0.01
7.507SerLeu: 7.507 ± 1.593
0.79SerMet: 0.79 ± 0.359
2.371SerAsn: 2.371 ± 1.076
1.976SerPro: 1.976 ± 0.897
1.58SerGln: 1.58 ± 0.19
3.556SerArg: 3.556 ± 0.707
5.927SerSer: 5.927 ± 0.876
4.346SerThr: 4.346 ± 0.159
2.766SerVal: 2.766 ± 0.348
1.185SerTrp: 1.185 ± 0.538
1.58SerTyr: 1.58 ± 0.717
0.0SerXaa: 0.0 ± 0.0
Thr
1.185ThrAla: 1.185 ± 0.369
0.0ThrCys: 0.0 ± 0.0
4.346ThrAsp: 4.346 ± 0.748
2.766ThrGlu: 2.766 ± 0.348
2.766ThrPhe: 2.766 ± 0.348
3.556ThrGly: 3.556 ± 0.2
0.79ThrHis: 0.79 ± 0.359
4.346ThrIle: 4.346 ± 1.066
2.766ThrLys: 2.766 ± 1.465
3.951ThrLeu: 3.951 ± 0.02
2.371ThrMet: 2.371 ± 0.169
2.371ThrAsn: 2.371 ± 0.169
3.951ThrPro: 3.951 ± 0.02
1.58ThrGln: 1.58 ± 0.717
4.346ThrArg: 4.346 ± 0.748
3.556ThrSer: 3.556 ± 0.707
1.976ThrThr: 1.976 ± 0.917
3.556ThrVal: 3.556 ± 1.614
1.185ThrTrp: 1.185 ± 1.276
1.976ThrTyr: 1.976 ± 0.01
0.0ThrXaa: 0.0 ± 0.0
Val
3.951ValAla: 3.951 ± 1.793
2.766ValCys: 2.766 ± 0.558
0.79ValAsp: 0.79 ± 0.359
3.556ValGlu: 3.556 ± 1.614
3.161ValPhe: 3.161 ± 0.528
2.371ValGly: 2.371 ± 0.169
0.79ValHis: 0.79 ± 0.548
3.161ValIle: 3.161 ± 0.528
4.346ValLys: 4.346 ± 0.159
4.741ValLeu: 4.741 ± 0.338
2.371ValMet: 2.371 ± 1.076
2.766ValAsn: 2.766 ± 1.255
2.766ValPro: 2.766 ± 0.348
0.79ValGln: 0.79 ± 0.548
5.531ValArg: 5.531 ± 1.604
3.161ValSer: 3.161 ± 1.286
2.766ValThr: 2.766 ± 0.558
4.346ValVal: 4.346 ± 1.066
0.0ValTrp: 0.0 ± 0.0
4.346ValTyr: 4.346 ± 1.066
0.0ValXaa: 0.0 ± 0.0
Trp
1.185TrpAla: 1.185 ± 0.369
0.0TrpCys: 0.0 ± 0.0
0.395TrpAsp: 0.395 ± 0.179
0.0TrpGlu: 0.0 ± 0.0
0.395TrpPhe: 0.395 ± 0.179
1.976TrpGly: 1.976 ± 0.897
0.395TrpHis: 0.395 ± 0.727
0.395TrpIle: 0.395 ± 0.179
0.395TrpLys: 0.395 ± 0.179
1.58TrpLeu: 1.58 ± 0.717
0.79TrpMet: 0.79 ± 0.548
0.79TrpAsn: 0.79 ± 0.359
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.395TrpArg: 0.395 ± 0.179
0.79TrpSer: 0.79 ± 0.359
1.976TrpThr: 1.976 ± 0.01
0.79TrpVal: 0.79 ± 0.359
0.0TrpTrp: 0.0 ± 0.0
0.79TrpTyr: 0.79 ± 0.359
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.766TyrAla: 2.766 ± 0.348
0.395TyrCys: 0.395 ± 0.179
2.766TyrAsp: 2.766 ± 0.348
2.371TyrGlu: 2.371 ± 0.169
1.976TyrPhe: 1.976 ± 0.01
1.58TyrGly: 1.58 ± 0.19
1.185TyrHis: 1.185 ± 0.538
2.371TyrIle: 2.371 ± 0.169
2.371TyrLys: 2.371 ± 0.169
3.161TyrLeu: 3.161 ± 0.379
1.185TyrMet: 1.185 ± 0.369
1.185TyrAsn: 1.185 ± 0.369
2.371TyrPro: 2.371 ± 1.076
1.58TyrGln: 1.58 ± 0.19
3.556TyrArg: 3.556 ± 0.2
1.976TyrSer: 1.976 ± 0.01
1.58TyrThr: 1.58 ± 0.19
2.371TyrVal: 2.371 ± 0.738
0.0TyrTrp: 0.0 ± 0.0
2.371TyrTyr: 2.371 ± 0.738
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2532 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski