Amino acid dipepetide frequency for Wenling narna-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.32AlaAla: 16.32 ± 4.393
1.484AlaCys: 1.484 ± 0.858
6.677AlaAsp: 6.677 ± 1.184
5.193AlaGlu: 5.193 ± 3.003
0.0AlaPhe: 0.0 ± 0.0
2.226AlaGly: 2.226 ± 0.026
5.193AlaHis: 5.193 ± 0.781
5.193AlaIle: 5.193 ± 0.781
4.451AlaLys: 4.451 ± 0.051
9.644AlaLeu: 9.644 ± 4.316
1.484AlaMet: 1.484 ± 0.858
0.742AlaAsn: 0.742 ± 0.429
6.677AlaPro: 6.677 ± 0.077
2.967AlaGln: 2.967 ± 0.807
9.644AlaArg: 9.644 ± 1.793
2.967AlaSer: 2.967 ± 1.716
2.967AlaThr: 2.967 ± 0.455
5.935AlaVal: 5.935 ± 0.91
2.226AlaTrp: 2.226 ± 1.287
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.742CysAla: 0.742 ± 0.832
0.742CysCys: 0.742 ± 0.832
0.742CysAsp: 0.742 ± 0.429
2.226CysGlu: 2.226 ± 2.497
0.0CysPhe: 0.0 ± 0.0
2.226CysGly: 2.226 ± 1.287
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.967CysLeu: 2.967 ± 0.455
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.742CysPro: 0.742 ± 0.429
0.742CysGln: 0.742 ± 0.832
3.709CysArg: 3.709 ± 1.639
1.484CysSer: 1.484 ± 1.665
0.742CysThr: 0.742 ± 0.429
2.226CysVal: 2.226 ± 1.287
1.484CysTrp: 1.484 ± 0.858
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.226AspAla: 2.226 ± 2.497
0.742AspCys: 0.742 ± 0.429
2.226AspAsp: 2.226 ± 0.026
0.0AspGlu: 0.0 ± 0.0
2.226AspPhe: 2.226 ± 1.287
2.967AspGly: 2.967 ± 0.455
1.484AspHis: 1.484 ± 0.858
0.742AspIle: 0.742 ± 0.429
1.484AspLys: 1.484 ± 0.403
8.902AspLeu: 8.902 ± 1.364
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
1.484AspPro: 1.484 ± 1.665
1.484AspGln: 1.484 ± 1.665
5.935AspArg: 5.935 ± 2.875
3.709AspSer: 3.709 ± 0.884
1.484AspThr: 1.484 ± 0.858
4.451AspVal: 4.451 ± 0.051
1.484AspTrp: 1.484 ± 0.858
1.484AspTyr: 1.484 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
8.16GluAla: 8.16 ± 0.935
0.0GluCys: 0.0 ± 0.0
2.967GluAsp: 2.967 ± 2.068
8.902GluGlu: 8.902 ± 0.103
0.0GluPhe: 0.0 ± 0.0
6.677GluGly: 6.677 ± 0.077
0.742GluHis: 0.742 ± 0.429
2.226GluIle: 2.226 ± 0.026
1.484GluLys: 1.484 ± 0.403
5.193GluLeu: 5.193 ± 2.042
1.484GluMet: 1.484 ± 0.25
0.742GluAsn: 0.742 ± 0.429
0.0GluPro: 0.0 ± 0.0
0.742GluGln: 0.742 ± 0.832
3.709GluArg: 3.709 ± 0.884
2.226GluSer: 2.226 ± 1.236
5.935GluThr: 5.935 ± 2.875
8.16GluVal: 8.16 ± 0.326
0.0GluTrp: 0.0 ± 0.0
0.742GluTyr: 0.742 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
3.709PheAla: 3.709 ± 2.145
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.484PheGlu: 1.484 ± 0.858
1.484PhePhe: 1.484 ± 0.858
2.226PheGly: 2.226 ± 0.026
1.484PheHis: 1.484 ± 0.403
0.0PheIle: 0.0 ± 0.0
0.742PheLys: 0.742 ± 0.429
4.451PheLeu: 4.451 ± 1.21
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
5.193PhePro: 5.193 ± 0.48
2.226PheGln: 2.226 ± 0.026
3.709PheArg: 3.709 ± 0.378
3.709PheSer: 3.709 ± 0.378
2.226PheThr: 2.226 ± 1.236
0.742PheVal: 0.742 ± 0.429
1.484PheTrp: 1.484 ± 0.403
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.193GlyAla: 5.193 ± 1.742
0.742GlyCys: 0.742 ± 0.429
2.226GlyAsp: 2.226 ± 1.287
4.451GlyGlu: 4.451 ± 3.733
5.193GlyPhe: 5.193 ± 1.742
5.935GlyGly: 5.935 ± 0.352
2.967GlyHis: 2.967 ± 2.068
2.226GlyIle: 2.226 ± 1.287
1.484GlyLys: 1.484 ± 0.858
8.16GlyLeu: 8.16 ± 3.458
0.742GlyMet: 0.742 ± 0.429
4.451GlyAsn: 4.451 ± 1.21
7.418GlyPro: 7.418 ± 3.278
2.226GlyGln: 2.226 ± 1.236
8.902GlyArg: 8.902 ± 4.942
5.193GlySer: 5.193 ± 0.781
3.709GlyThr: 3.709 ± 0.884
2.967GlyVal: 2.967 ± 0.455
1.484GlyTrp: 1.484 ± 0.403
2.226GlyTyr: 2.226 ± 1.236
0.0GlyXaa: 0.0 ± 0.0
His
4.451HisAla: 4.451 ± 1.21
0.0HisCys: 0.0 ± 0.0
0.742HisAsp: 0.742 ± 0.832
0.742HisGlu: 0.742 ± 0.832
2.226HisPhe: 2.226 ± 1.236
1.484HisGly: 1.484 ± 1.665
1.484HisHis: 1.484 ± 1.665
0.742HisIle: 0.742 ± 0.429
0.742HisLys: 0.742 ± 0.429
1.484HisLeu: 1.484 ± 0.858
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.709HisPro: 3.709 ± 0.884
0.0HisGln: 0.0 ± 0.0
4.451HisArg: 4.451 ± 2.471
0.742HisSer: 0.742 ± 0.832
0.0HisThr: 0.0 ± 0.0
0.742HisVal: 0.742 ± 0.429
0.742HisTrp: 0.742 ± 0.429
0.742HisTyr: 0.742 ± 0.429
0.0HisXaa: 0.0 ± 0.0
Ile
2.226IleAla: 2.226 ± 1.287
1.484IleCys: 1.484 ± 0.403
1.484IleAsp: 1.484 ± 0.403
0.0IleGlu: 0.0 ± 0.0
0.742IlePhe: 0.742 ± 0.429
1.484IleGly: 1.484 ± 0.858
2.226IleHis: 2.226 ± 1.236
0.0IleIle: 0.0 ± 0.0
0.742IleLys: 0.742 ± 0.832
2.967IleLeu: 2.967 ± 0.455
0.742IleMet: 0.742 ± 0.429
0.0IleAsn: 0.0 ± 0.0
1.484IlePro: 1.484 ± 1.665
2.226IleGln: 2.226 ± 0.026
3.709IleArg: 3.709 ± 0.884
0.742IleSer: 0.742 ± 0.832
0.0IleThr: 0.0 ± 0.0
2.226IleVal: 2.226 ± 1.287
0.742IleTrp: 0.742 ± 0.429
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.226LysAla: 2.226 ± 0.026
0.742LysCys: 0.742 ± 0.429
3.709LysAsp: 3.709 ± 0.884
1.484LysGlu: 1.484 ± 0.403
3.709LysPhe: 3.709 ± 0.884
2.226LysGly: 2.226 ± 1.236
0.742LysHis: 0.742 ± 0.429
0.742LysIle: 0.742 ± 0.832
0.0LysLys: 0.0 ± 0.0
1.484LysLeu: 1.484 ± 0.858
0.0LysMet: 0.0 ± 0.0
0.742LysAsn: 0.742 ± 0.832
3.709LysPro: 3.709 ± 0.378
1.484LysGln: 1.484 ± 0.403
4.451LysArg: 4.451 ± 1.313
0.742LysSer: 0.742 ± 0.429
2.226LysThr: 2.226 ± 1.287
3.709LysVal: 3.709 ± 0.378
1.484LysTrp: 1.484 ± 0.858
0.742LysTyr: 0.742 ± 0.832
0.0LysXaa: 0.0 ± 0.0
Leu
9.644LeuAla: 9.644 ± 0.532
2.226LeuCys: 2.226 ± 1.236
2.967LeuAsp: 2.967 ± 0.455
3.709LeuGlu: 3.709 ± 0.378
3.709LeuPhe: 3.709 ± 0.884
9.644LeuGly: 9.644 ± 1.991
2.967LeuHis: 2.967 ± 0.807
1.484LeuIle: 1.484 ± 0.858
2.967LeuLys: 2.967 ± 0.807
10.386LeuLeu: 10.386 ± 1.562
0.742LeuMet: 0.742 ± 0.429
3.709LeuAsn: 3.709 ± 0.378
13.353LeuPro: 13.353 ± 1.416
2.226LeuGln: 2.226 ± 0.026
14.095LeuArg: 14.095 ± 4.367
9.644LeuSer: 9.644 ± 1.793
3.709LeuThr: 3.709 ± 0.884
5.935LeuVal: 5.935 ± 2.171
0.742LeuTrp: 0.742 ± 0.429
1.484LeuTyr: 1.484 ± 0.858
0.0LeuXaa: 0.0 ± 0.0
Met
0.742MetAla: 0.742 ± 0.429
0.0MetCys: 0.0 ± 0.0
1.484MetAsp: 1.484 ± 0.858
0.0MetGlu: 0.0 ± 0.0
0.742MetPhe: 0.742 ± 0.832
0.742MetGly: 0.742 ± 0.429
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.742MetLys: 0.742 ± 0.429
0.0MetLeu: 0.0 ± 0.0
0.742MetMet: 0.742 ± 0.429
0.0MetAsn: 0.0 ± 0.0
1.484MetPro: 1.484 ± 0.858
0.0MetGln: 0.0 ± 0.0
0.742MetArg: 0.742 ± 0.429
0.742MetSer: 0.742 ± 0.429
0.742MetThr: 0.742 ± 0.429
1.484MetVal: 1.484 ± 0.403
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.226AsnAla: 2.226 ± 0.026
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.742AsnGlu: 0.742 ± 0.832
0.0AsnPhe: 0.0 ± 0.0
2.226AsnGly: 2.226 ± 1.287
0.0AsnHis: 0.0 ± 0.0
1.484AsnIle: 1.484 ± 0.858
0.742AsnLys: 0.742 ± 0.429
2.967AsnLeu: 2.967 ± 0.455
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.226AsnPro: 2.226 ± 2.497
0.742AsnGln: 0.742 ± 0.429
0.742AsnArg: 0.742 ± 0.832
2.967AsnSer: 2.967 ± 0.455
0.742AsnThr: 0.742 ± 0.429
2.226AsnVal: 2.226 ± 1.236
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.193ProAla: 5.193 ± 0.781
0.742ProCys: 0.742 ± 0.832
5.935ProAsp: 5.935 ± 1.613
3.709ProGlu: 3.709 ± 1.639
3.709ProPhe: 3.709 ± 0.378
6.677ProGly: 6.677 ± 3.707
0.742ProHis: 0.742 ± 0.429
0.742ProIle: 0.742 ± 0.832
4.451ProLys: 4.451 ± 0.051
9.644ProLeu: 9.644 ± 1.991
0.0ProMet: 0.0 ± 0.0
2.967ProAsn: 2.967 ± 0.807
8.902ProPro: 8.902 ± 0.103
0.742ProGln: 0.742 ± 0.429
13.353ProArg: 13.353 ± 3.63
2.967ProSer: 2.967 ± 1.716
2.967ProThr: 2.967 ± 1.716
5.935ProVal: 5.935 ± 2.171
0.742ProTrp: 0.742 ± 0.429
0.742ProTyr: 0.742 ± 0.429
0.0ProXaa: 0.0 ± 0.0
Gln
1.484GlnAla: 1.484 ± 0.403
0.0GlnCys: 0.0 ± 0.0
2.226GlnAsp: 2.226 ± 0.026
1.484GlnGlu: 1.484 ± 0.858
2.226GlnPhe: 2.226 ± 1.236
5.935GlnGly: 5.935 ± 2.875
0.0GlnHis: 0.0 ± 0.0
0.742GlnIle: 0.742 ± 0.832
2.226GlnLys: 2.226 ± 1.236
0.742GlnLeu: 0.742 ± 0.429
0.742GlnMet: 0.742 ± 0.429
0.742GlnAsn: 0.742 ± 0.429
2.226GlnPro: 2.226 ± 1.236
0.0GlnGln: 0.0 ± 0.0
5.193GlnArg: 5.193 ± 3.304
0.0GlnSer: 0.0 ± 0.0
2.967GlnThr: 2.967 ± 0.807
2.226GlnVal: 2.226 ± 1.236
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.677ArgAla: 6.677 ± 3.861
5.935ArgCys: 5.935 ± 2.875
3.709ArgAsp: 3.709 ± 0.378
11.869ArgGlu: 11.869 ± 4.488
2.967ArgPhe: 2.967 ± 0.455
9.644ArgGly: 9.644 ± 0.532
2.967ArgHis: 2.967 ± 0.807
2.226ArgIle: 2.226 ± 0.026
7.418ArgLys: 7.418 ± 3.029
9.644ArgLeu: 9.644 ± 0.729
0.742ArgMet: 0.742 ± 0.429
1.484ArgAsn: 1.484 ± 0.858
8.902ArgPro: 8.902 ± 1.158
4.451ArgGln: 4.451 ± 3.733
17.804ArgArg: 17.804 ± 3.578
8.16ArgSer: 8.16 ± 1.587
4.451ArgThr: 4.451 ± 1.313
8.16ArgVal: 8.16 ± 0.326
2.967ArgTrp: 2.967 ± 0.455
2.967ArgTyr: 2.967 ± 1.716
0.0ArgXaa: 0.0 ± 0.0
Ser
5.193SerAla: 5.193 ± 1.742
0.0SerCys: 0.0 ± 0.0
1.484SerAsp: 1.484 ± 0.403
2.226SerGlu: 2.226 ± 1.236
1.484SerPhe: 1.484 ± 0.403
5.193SerGly: 5.193 ± 0.781
0.742SerHis: 0.742 ± 0.832
1.484SerIle: 1.484 ± 0.403
2.226SerLys: 2.226 ± 0.026
9.644SerLeu: 9.644 ± 0.532
0.0SerMet: 0.0 ± 0.0
0.0SerAsn: 0.0 ± 0.0
5.193SerPro: 5.193 ± 3.304
3.709SerGln: 3.709 ± 2.9
7.418SerArg: 7.418 ± 3.029
5.193SerSer: 5.193 ± 0.48
2.967SerThr: 2.967 ± 1.716
5.193SerVal: 5.193 ± 0.48
2.967SerTrp: 2.967 ± 1.716
0.742SerTyr: 0.742 ± 0.429
0.0SerXaa: 0.0 ± 0.0
Thr
4.451ThrAla: 4.451 ± 2.574
0.742ThrCys: 0.742 ± 0.832
2.967ThrAsp: 2.967 ± 0.807
5.193ThrGlu: 5.193 ± 1.742
0.742ThrPhe: 0.742 ± 0.429
3.709ThrGly: 3.709 ± 0.884
1.484ThrHis: 1.484 ± 0.403
2.226ThrIle: 2.226 ± 0.026
0.742ThrLys: 0.742 ± 0.429
4.451ThrLeu: 4.451 ± 1.313
1.484ThrMet: 1.484 ± 0.858
0.742ThrAsn: 0.742 ± 0.429
2.967ThrPro: 2.967 ± 0.455
0.742ThrGln: 0.742 ± 0.429
1.484ThrArg: 1.484 ± 0.858
1.484ThrSer: 1.484 ± 1.665
2.967ThrThr: 2.967 ± 0.455
3.709ThrVal: 3.709 ± 1.639
2.226ThrTrp: 2.226 ± 1.287
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.16ValAla: 8.16 ± 2.197
3.709ValCys: 3.709 ± 0.884
2.226ValAsp: 2.226 ± 1.287
3.709ValGlu: 3.709 ± 0.884
2.967ValPhe: 2.967 ± 2.068
4.451ValGly: 4.451 ± 1.21
0.0ValHis: 0.0 ± 0.0
0.742ValIle: 0.742 ± 0.832
2.967ValLys: 2.967 ± 0.807
9.644ValLeu: 9.644 ± 0.729
0.742ValMet: 0.742 ± 0.657
3.709ValAsn: 3.709 ± 0.378
3.709ValPro: 3.709 ± 2.145
2.967ValGln: 2.967 ± 0.455
8.16ValArg: 8.16 ± 3.458
5.193ValSer: 5.193 ± 0.781
2.226ValThr: 2.226 ± 0.026
2.967ValVal: 2.967 ± 0.807
0.742ValTrp: 0.742 ± 0.429
0.742ValTyr: 0.742 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
2.226TrpAla: 2.226 ± 1.287
1.484TrpCys: 1.484 ± 0.858
0.742TrpAsp: 0.742 ± 0.429
1.484TrpGlu: 1.484 ± 0.858
0.0TrpPhe: 0.0 ± 0.0
0.742TrpGly: 0.742 ± 0.429
0.0TrpHis: 0.0 ± 0.0
0.742TrpIle: 0.742 ± 0.429
0.742TrpLys: 0.742 ± 0.429
2.226TrpLeu: 2.226 ± 1.287
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.484TrpPro: 1.484 ± 0.403
0.742TrpGln: 0.742 ± 0.429
3.709TrpArg: 3.709 ± 0.378
2.226TrpSer: 2.226 ± 1.287
1.484TrpThr: 1.484 ± 0.858
1.484TrpVal: 1.484 ± 0.858
0.742TrpTrp: 0.742 ± 0.832
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.742TyrAla: 0.742 ± 0.832
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.484TyrGlu: 1.484 ± 0.858
0.742TyrPhe: 0.742 ± 0.429
1.484TyrGly: 1.484 ± 0.858
0.0TyrHis: 0.0 ± 0.0
1.484TyrIle: 1.484 ± 0.858
0.0TyrLys: 0.0 ± 0.0
0.742TyrLeu: 0.742 ± 0.429
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.742TyrGln: 0.742 ± 0.832
2.226TyrArg: 2.226 ± 0.026
2.967TyrSer: 2.967 ± 0.455
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski