Amino acid dipepetide frequency for Beihai narna-like virus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.021AlaAla: 3.021 ± 0.588
1.007AlaCys: 1.007 ± 0.476
6.042AlaAsp: 6.042 ± 3.192
3.021AlaGlu: 3.021 ± 1.428
3.021AlaPhe: 3.021 ± 0.588
4.028AlaGly: 4.028 ± 1.904
4.028AlaHis: 4.028 ± 0.112
2.014AlaIle: 2.014 ± 0.952
2.014AlaLys: 2.014 ± 0.952
11.078AlaLeu: 11.078 ± 1.205
1.007AlaMet: 1.007 ± 0.476
2.014AlaAsn: 2.014 ± 0.952
4.028AlaPro: 4.028 ± 0.112
2.014AlaGln: 2.014 ± 0.952
7.049AlaArg: 7.049 ± 0.699
5.035AlaSer: 5.035 ± 0.364
2.014AlaThr: 2.014 ± 0.952
5.035AlaVal: 5.035 ± 0.364
1.007AlaTrp: 1.007 ± 0.476
2.014AlaTyr: 2.014 ± 0.952
0.0AlaXaa: 0.0 ± 0.0
Cys
3.021CysAla: 3.021 ± 0.588
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.007CysGlu: 1.007 ± 0.476
0.0CysPhe: 0.0 ± 0.0
1.007CysGly: 1.007 ± 0.476
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.014CysLys: 2.014 ± 1.064
2.014CysLeu: 2.014 ± 0.952
0.0CysMet: 0.0 ± 0.0
1.007CysAsn: 1.007 ± 0.476
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.014CysArg: 2.014 ± 0.952
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.007CysVal: 1.007 ± 0.476
0.0CysTrp: 0.0 ± 0.0
1.007CysTyr: 1.007 ± 0.476
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 1.428
0.0AspCys: 0.0 ± 0.0
4.028AspAsp: 4.028 ± 2.128
2.014AspGlu: 2.014 ± 0.952
3.021AspPhe: 3.021 ± 2.604
4.028AspGly: 4.028 ± 2.128
1.007AspHis: 1.007 ± 0.476
3.021AspIle: 3.021 ± 1.428
1.007AspLys: 1.007 ± 1.54
11.078AspLeu: 11.078 ± 0.811
1.007AspMet: 1.007 ± 0.476
1.007AspAsn: 1.007 ± 0.476
8.056AspPro: 8.056 ± 4.256
0.0AspGln: 0.0 ± 0.0
6.042AspArg: 6.042 ± 0.841
4.028AspSer: 4.028 ± 2.128
2.014AspThr: 2.014 ± 1.064
6.042AspVal: 6.042 ± 1.176
0.0AspTrp: 0.0 ± 0.0
5.035AspTyr: 5.035 ± 1.652
0.0AspXaa: 0.0 ± 0.0
Glu
3.021GluAla: 3.021 ± 0.588
2.014GluCys: 2.014 ± 0.952
3.021GluAsp: 3.021 ± 0.588
3.021GluGlu: 3.021 ± 0.588
4.028GluPhe: 4.028 ± 0.112
3.021GluGly: 3.021 ± 0.588
1.007GluHis: 1.007 ± 1.54
3.021GluIle: 3.021 ± 1.428
4.028GluLys: 4.028 ± 1.904
2.014GluLeu: 2.014 ± 1.064
1.007GluMet: 1.007 ± 0.476
1.007GluAsn: 1.007 ± 0.476
3.021GluPro: 3.021 ± 1.428
0.0GluGln: 0.0 ± 0.0
7.049GluArg: 7.049 ± 2.716
1.007GluSer: 1.007 ± 0.476
6.042GluThr: 6.042 ± 1.176
4.028GluVal: 4.028 ± 1.904
0.0GluTrp: 0.0 ± 0.0
1.007GluTyr: 1.007 ± 1.54
0.0GluXaa: 0.0 ± 0.0
Phe
5.035PheAla: 5.035 ± 0.364
0.0PheCys: 0.0 ± 0.0
2.014PheAsp: 2.014 ± 0.952
1.007PheGlu: 1.007 ± 0.476
1.007PhePhe: 1.007 ± 0.476
4.028PheGly: 4.028 ± 2.128
2.014PheHis: 2.014 ± 0.952
5.035PheIle: 5.035 ± 0.364
2.014PheLys: 2.014 ± 0.952
6.042PheLeu: 6.042 ± 3.192
0.0PheMet: 0.0 ± 0.43
0.0PheAsn: 0.0 ± 0.0
3.021PhePro: 3.021 ± 1.428
3.021PheGln: 3.021 ± 1.428
5.035PheArg: 5.035 ± 1.652
0.0PheSer: 0.0 ± 0.0
4.028PheThr: 4.028 ± 4.144
3.021PheVal: 3.021 ± 2.604
0.0PheTrp: 0.0 ± 0.0
4.028PheTyr: 4.028 ± 0.112
0.0PheXaa: 0.0 ± 0.0
Gly
2.014GlyAla: 2.014 ± 0.952
0.0GlyCys: 0.0 ± 0.0
5.035GlyAsp: 5.035 ± 0.364
1.007GlyGlu: 1.007 ± 0.476
8.056GlyPhe: 8.056 ± 0.223
3.021GlyGly: 3.021 ± 0.588
2.014GlyHis: 2.014 ± 0.952
4.028GlyIle: 4.028 ± 1.904
3.021GlyLys: 3.021 ± 1.428
8.056GlyLeu: 8.056 ± 1.793
1.007GlyMet: 1.007 ± 0.476
0.0GlyAsn: 0.0 ± 0.0
3.021GlyPro: 3.021 ± 1.428
3.021GlyGln: 3.021 ± 0.588
3.021GlyArg: 3.021 ± 1.428
1.007GlySer: 1.007 ± 1.54
2.014GlyThr: 2.014 ± 0.952
4.028GlyVal: 4.028 ± 4.144
1.007GlyTrp: 1.007 ± 0.476
4.028GlyTyr: 4.028 ± 4.144
0.0GlyXaa: 0.0 ± 0.0
His
3.021HisAla: 3.021 ± 0.588
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.007HisGlu: 1.007 ± 0.476
3.021HisPhe: 3.021 ± 1.428
4.028HisGly: 4.028 ± 1.904
0.0HisHis: 0.0 ± 0.0
4.028HisIle: 4.028 ± 2.128
2.014HisLys: 2.014 ± 0.952
1.007HisLeu: 1.007 ± 0.476
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.014HisPro: 2.014 ± 0.952
1.007HisGln: 1.007 ± 0.476
1.007HisArg: 1.007 ± 1.54
3.021HisSer: 3.021 ± 0.588
1.007HisThr: 1.007 ± 0.476
1.007HisVal: 1.007 ± 1.54
2.014HisTrp: 2.014 ± 1.064
1.007HisTyr: 1.007 ± 0.476
0.0HisXaa: 0.0 ± 0.0
Ile
4.028IleAla: 4.028 ± 0.112
1.007IleCys: 1.007 ± 0.476
2.014IleAsp: 2.014 ± 1.064
3.021IleGlu: 3.021 ± 0.588
1.007IlePhe: 1.007 ± 0.476
1.007IleGly: 1.007 ± 0.476
3.021IleHis: 3.021 ± 0.588
3.021IleIle: 3.021 ± 1.428
3.021IleLys: 3.021 ± 1.428
7.049IleLeu: 7.049 ± 0.699
0.0IleMet: 0.0 ± 0.0
2.014IleAsn: 2.014 ± 0.952
5.035IlePro: 5.035 ± 1.652
3.021IleGln: 3.021 ± 0.588
4.028IleArg: 4.028 ± 0.112
7.049IleSer: 7.049 ± 3.333
0.0IleThr: 0.0 ± 0.0
1.007IleVal: 1.007 ± 0.476
0.0IleTrp: 0.0 ± 0.0
1.007IleTyr: 1.007 ± 1.54
0.0IleXaa: 0.0 ± 0.0
Lys
4.028LysAla: 4.028 ± 1.904
0.0LysCys: 0.0 ± 0.0
2.014LysAsp: 2.014 ± 1.064
2.014LysGlu: 2.014 ± 0.952
2.014LysPhe: 2.014 ± 0.952
1.007LysGly: 1.007 ± 0.476
0.0LysHis: 0.0 ± 0.0
3.021LysIle: 3.021 ± 0.588
1.007LysLys: 1.007 ± 0.476
4.028LysLeu: 4.028 ± 1.904
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
1.007LysGln: 1.007 ± 0.476
4.028LysArg: 4.028 ± 0.112
4.028LysSer: 4.028 ± 0.112
2.014LysThr: 2.014 ± 0.952
9.063LysVal: 9.063 ± 2.269
1.007LysTrp: 1.007 ± 0.476
1.007LysTyr: 1.007 ± 0.476
0.0LysXaa: 0.0 ± 0.0
Leu
10.07LeuAla: 10.07 ± 0.729
2.014LeuCys: 2.014 ± 0.952
4.028LeuAsp: 4.028 ± 1.904
10.07LeuGlu: 10.07 ± 0.729
4.028LeuPhe: 4.028 ± 0.112
4.028LeuGly: 4.028 ± 1.904
3.021LeuHis: 3.021 ± 2.604
2.014LeuIle: 2.014 ± 0.952
3.021LeuLys: 3.021 ± 1.428
12.085LeuLeu: 12.085 ± 3.697
2.014LeuMet: 2.014 ± 0.952
3.021LeuAsn: 3.021 ± 0.588
6.042LeuPro: 6.042 ± 0.841
3.021LeuGln: 3.021 ± 1.428
6.042LeuArg: 6.042 ± 1.176
15.106LeuSer: 15.106 ± 0.923
8.056LeuThr: 8.056 ± 1.793
7.049LeuVal: 7.049 ± 2.716
1.007LeuTrp: 1.007 ± 0.476
3.021LeuTyr: 3.021 ± 1.428
0.0LeuXaa: 0.0 ± 0.0
Met
1.007MetAla: 1.007 ± 0.476
0.0MetCys: 0.0 ± 0.0
2.014MetAsp: 2.014 ± 1.064
1.007MetGlu: 1.007 ± 0.476
0.0MetPhe: 0.0 ± 0.0
2.014MetGly: 2.014 ± 1.064
0.0MetHis: 0.0 ± 0.0
1.007MetIle: 1.007 ± 0.476
2.014MetLys: 2.014 ± 0.952
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.007MetAsn: 1.007 ± 1.54
1.007MetPro: 1.007 ± 0.476
0.0MetGln: 0.0 ± 0.0
3.021MetArg: 3.021 ± 0.588
1.007MetSer: 1.007 ± 0.476
2.014MetThr: 2.014 ± 0.952
3.021MetVal: 3.021 ± 0.588
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.014AsnAla: 2.014 ± 1.064
1.007AsnCys: 1.007 ± 0.476
1.007AsnAsp: 1.007 ± 0.476
3.021AsnGlu: 3.021 ± 2.604
2.014AsnPhe: 2.014 ± 1.064
2.014AsnGly: 2.014 ± 0.952
0.0AsnHis: 0.0 ± 0.0
2.014AsnIle: 2.014 ± 1.064
1.007AsnLys: 1.007 ± 0.476
1.007AsnLeu: 1.007 ± 0.476
1.007AsnMet: 1.007 ± 1.54
2.014AsnAsn: 2.014 ± 0.952
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
1.007AsnArg: 1.007 ± 0.476
1.007AsnSer: 1.007 ± 0.476
0.0AsnThr: 0.0 ± 0.0
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
2.014AsnTyr: 2.014 ± 0.952
0.0AsnXaa: 0.0 ± 0.0
Pro
5.035ProAla: 5.035 ± 1.652
2.014ProCys: 2.014 ± 1.064
6.042ProAsp: 6.042 ± 1.176
2.014ProGlu: 2.014 ± 0.952
4.028ProPhe: 4.028 ± 2.128
3.021ProGly: 3.021 ± 0.588
3.021ProHis: 3.021 ± 0.588
2.014ProIle: 2.014 ± 1.064
1.007ProLys: 1.007 ± 0.476
5.035ProLeu: 5.035 ± 0.364
0.0ProMet: 0.0 ± 0.0
2.014ProAsn: 2.014 ± 3.08
4.028ProPro: 4.028 ± 2.128
2.014ProGln: 2.014 ± 0.952
4.028ProArg: 4.028 ± 1.904
5.035ProSer: 5.035 ± 0.364
1.007ProThr: 1.007 ± 1.54
5.035ProVal: 5.035 ± 2.381
0.0ProTrp: 0.0 ± 0.0
1.007ProTyr: 1.007 ± 1.54
0.0ProXaa: 0.0 ± 0.0
Gln
5.035GlnAla: 5.035 ± 2.381
0.0GlnCys: 0.0 ± 0.0
4.028GlnAsp: 4.028 ± 2.128
1.007GlnGlu: 1.007 ± 0.476
1.007GlnPhe: 1.007 ± 0.476
1.007GlnGly: 1.007 ± 0.476
1.007GlnHis: 1.007 ± 0.476
2.014GlnIle: 2.014 ± 0.952
0.0GlnLys: 0.0 ± 0.0
2.014GlnLeu: 2.014 ± 0.952
2.014GlnMet: 2.014 ± 0.952
1.007GlnAsn: 1.007 ± 0.476
2.014GlnPro: 2.014 ± 1.064
1.007GlnGln: 1.007 ± 0.476
3.021GlnArg: 3.021 ± 1.428
0.0GlnSer: 0.0 ± 0.0
1.007GlnThr: 1.007 ± 1.54
1.007GlnVal: 1.007 ± 1.54
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.042ArgAla: 6.042 ± 2.857
1.007ArgCys: 1.007 ± 0.476
11.078ArgAsp: 11.078 ± 6.86
4.028ArgGlu: 4.028 ± 2.128
5.035ArgPhe: 5.035 ± 3.668
6.042ArgGly: 6.042 ± 3.192
0.0ArgHis: 0.0 ± 0.0
7.049ArgIle: 7.049 ± 1.317
4.028ArgLys: 4.028 ± 1.904
5.035ArgLeu: 5.035 ± 0.364
5.035ArgMet: 5.035 ± 0.364
2.014ArgAsn: 2.014 ± 0.952
2.014ArgPro: 2.014 ± 0.952
1.007ArgGln: 1.007 ± 0.476
7.049ArgArg: 7.049 ± 0.699
5.035ArgSer: 5.035 ± 0.364
6.042ArgThr: 6.042 ± 2.857
4.028ArgVal: 4.028 ± 0.112
3.021ArgTrp: 3.021 ± 1.428
4.028ArgTyr: 4.028 ± 0.112
0.0ArgXaa: 0.0 ± 0.0
Ser
5.035SerAla: 5.035 ± 0.364
1.007SerCys: 1.007 ± 0.476
1.007SerAsp: 1.007 ± 0.476
6.042SerGlu: 6.042 ± 0.841
1.007SerPhe: 1.007 ± 1.54
4.028SerGly: 4.028 ± 1.904
1.007SerHis: 1.007 ± 0.476
3.021SerIle: 3.021 ± 2.604
6.042SerLys: 6.042 ± 1.176
7.049SerLeu: 7.049 ± 1.317
0.0SerMet: 0.0 ± 0.0
1.007SerAsn: 1.007 ± 0.476
2.014SerPro: 2.014 ± 1.064
3.021SerGln: 3.021 ± 0.588
7.049SerArg: 7.049 ± 0.699
3.021SerSer: 3.021 ± 1.428
3.021SerThr: 3.021 ± 1.428
3.021SerVal: 3.021 ± 1.428
1.007SerTrp: 1.007 ± 0.476
5.035SerTyr: 5.035 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
1.007ThrAla: 1.007 ± 0.476
0.0ThrCys: 0.0 ± 0.0
4.028ThrAsp: 4.028 ± 1.904
3.021ThrGlu: 3.021 ± 2.604
4.028ThrPhe: 4.028 ± 0.112
6.042ThrGly: 6.042 ± 1.176
2.014ThrHis: 2.014 ± 0.952
3.021ThrIle: 3.021 ± 0.588
2.014ThrLys: 2.014 ± 1.064
4.028ThrLeu: 4.028 ± 1.904
0.0ThrMet: 0.0 ± 0.0
1.007ThrAsn: 1.007 ± 0.476
2.014ThrPro: 2.014 ± 0.952
2.014ThrGln: 2.014 ± 1.064
5.035ThrArg: 5.035 ± 1.652
2.014ThrSer: 2.014 ± 0.952
3.021ThrThr: 3.021 ± 2.604
4.028ThrVal: 4.028 ± 1.904
2.014ThrTrp: 2.014 ± 1.064
1.007ThrTyr: 1.007 ± 0.476
0.0ThrXaa: 0.0 ± 0.0
Val
4.028ValAla: 4.028 ± 0.112
2.014ValCys: 2.014 ± 1.064
5.035ValAsp: 5.035 ± 1.652
3.021ValGlu: 3.021 ± 0.588
4.028ValPhe: 4.028 ± 1.904
1.007ValGly: 1.007 ± 0.476
4.028ValHis: 4.028 ± 1.904
2.014ValIle: 2.014 ± 1.064
1.007ValLys: 1.007 ± 0.476
13.092ValLeu: 13.092 ± 2.157
2.014ValMet: 2.014 ± 0.952
1.007ValAsn: 1.007 ± 1.54
6.042ValPro: 6.042 ± 3.192
0.0ValGln: 0.0 ± 0.0
5.035ValArg: 5.035 ± 0.364
5.035ValSer: 5.035 ± 0.364
3.021ValThr: 3.021 ± 0.588
7.049ValVal: 7.049 ± 0.699
0.0ValTrp: 0.0 ± 0.0
2.014ValTyr: 2.014 ± 1.064
0.0ValXaa: 0.0 ± 0.0
Trp
1.007TrpAla: 1.007 ± 0.476
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.007TrpPhe: 1.007 ± 0.476
2.014TrpGly: 2.014 ± 0.952
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.007TrpLeu: 1.007 ± 0.476
1.007TrpMet: 1.007 ± 0.842
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.014TrpArg: 2.014 ± 0.952
0.0TrpSer: 0.0 ± 0.0
2.014TrpThr: 2.014 ± 1.064
1.007TrpVal: 1.007 ± 0.476
1.007TrpTrp: 1.007 ± 0.476
2.014TrpTyr: 2.014 ± 0.952
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.007TyrCys: 1.007 ± 0.476
3.021TyrAsp: 3.021 ± 1.428
2.014TyrGlu: 2.014 ± 1.064
0.0TyrPhe: 0.0 ± 0.0
2.014TyrGly: 2.014 ± 0.952
3.021TyrHis: 3.021 ± 0.588
0.0TyrIle: 0.0 ± 0.0
1.007TyrLys: 1.007 ± 0.476
6.042TyrLeu: 6.042 ± 1.176
2.014TyrMet: 2.014 ± 3.08
1.007TyrAsn: 1.007 ± 1.54
4.028TyrPro: 4.028 ± 4.144
3.021TyrGln: 3.021 ± 0.588
6.042TyrArg: 6.042 ± 0.841
1.007TyrSer: 1.007 ± 0.476
3.021TyrThr: 3.021 ± 1.428
1.007TyrVal: 1.007 ± 0.476
1.007TyrTrp: 1.007 ± 0.476
1.007TyrTyr: 1.007 ± 0.476
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (994 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski