Amino acid dipepetide frequency for Wenzhou narna-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.895AlaAla: 10.895 ± 4.664
4.669AlaCys: 4.669 ± 1.999
3.891AlaAsp: 3.891 ± 0.904
0.0AlaGlu: 0.0 ± 0.0
3.113AlaPhe: 3.113 ± 1.714
6.226AlaGly: 6.226 ± 4.189
0.0AlaHis: 0.0 ± 0.0
4.669AlaIle: 4.669 ± 3.522
4.669AlaLys: 4.669 ± 3.522
8.56AlaLeu: 8.56 ± 4.426
0.0AlaMet: 0.0 ± 0.0
3.891AlaAsn: 3.891 ± 0.904
3.113AlaPro: 3.113 ± 1.333
3.891AlaGln: 3.891 ± 2.427
5.447AlaArg: 5.447 ± 1.57
2.335AlaSer: 2.335 ± 0.238
5.447AlaThr: 5.447 ± 0.047
4.669AlaVal: 4.669 ± 0.475
1.556AlaTrp: 1.556 ± 0.857
2.335AlaTyr: 2.335 ± 0.238
0.0AlaXaa: 0.0 ± 0.0
Cys
1.556CysAla: 1.556 ± 0.666
0.778CysCys: 0.778 ± 0.429
0.0CysAsp: 0.0 ± 0.0
0.778CysGlu: 0.778 ± 0.429
0.0CysPhe: 0.0 ± 0.0
1.556CysGly: 1.556 ± 0.666
1.556CysHis: 1.556 ± 0.666
0.0CysIle: 0.0 ± 0.0
1.556CysLys: 1.556 ± 0.666
3.891CysLeu: 3.891 ± 0.619
0.0CysMet: 0.0 ± 0.0
0.778CysAsn: 0.778 ± 0.429
1.556CysPro: 1.556 ± 0.857
1.556CysGln: 1.556 ± 0.666
0.0CysArg: 0.0 ± 0.0
1.556CysSer: 1.556 ± 0.857
2.335CysThr: 2.335 ± 1.286
3.891CysVal: 3.891 ± 0.904
0.778CysTrp: 0.778 ± 1.095
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.113AspAla: 3.113 ± 0.191
1.556AspCys: 1.556 ± 0.857
1.556AspAsp: 1.556 ± 0.666
2.335AspGlu: 2.335 ± 1.286
2.335AspPhe: 2.335 ± 1.286
3.891AspGly: 3.891 ± 0.619
0.778AspHis: 0.778 ± 0.429
0.778AspIle: 0.778 ± 0.429
3.113AspLys: 3.113 ± 1.714
1.556AspLeu: 1.556 ± 0.666
0.778AspMet: 0.778 ± 1.095
0.778AspAsn: 0.778 ± 1.095
3.891AspPro: 3.891 ± 0.904
0.0AspGln: 0.0 ± 0.0
2.335AspArg: 2.335 ± 0.238
5.447AspSer: 5.447 ± 1.477
3.113AspThr: 3.113 ± 1.333
3.113AspVal: 3.113 ± 1.333
0.778AspTrp: 0.778 ± 1.095
1.556AspTyr: 1.556 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
3.891GluAla: 3.891 ± 0.904
1.556GluCys: 1.556 ± 0.857
3.113GluAsp: 3.113 ± 1.714
6.226GluGlu: 6.226 ± 1.905
0.0GluPhe: 0.0 ± 0.0
3.113GluGly: 3.113 ± 1.714
2.335GluHis: 2.335 ± 0.238
4.669GluIle: 4.669 ± 2.572
0.778GluLys: 0.778 ± 0.429
7.004GluLeu: 7.004 ± 2.334
0.778GluMet: 0.778 ± 0.429
0.778GluAsn: 0.778 ± 0.429
1.556GluPro: 1.556 ± 0.666
3.113GluGln: 3.113 ± 0.191
2.335GluArg: 2.335 ± 1.286
5.447GluSer: 5.447 ± 3.0
3.891GluThr: 3.891 ± 2.143
3.891GluVal: 3.891 ± 2.143
2.335GluTrp: 2.335 ± 3.285
0.778GluTyr: 0.778 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.335PheAla: 2.335 ± 1.761
1.556PheCys: 1.556 ± 0.666
3.113PheAsp: 3.113 ± 1.333
2.335PheGlu: 2.335 ± 0.238
2.335PhePhe: 2.335 ± 1.286
1.556PheGly: 1.556 ± 0.666
2.335PheHis: 2.335 ± 1.286
0.778PheIle: 0.778 ± 0.429
3.113PheLys: 3.113 ± 1.714
3.113PheLeu: 3.113 ± 1.714
0.778PheMet: 0.778 ± 0.429
3.113PheAsn: 3.113 ± 0.191
2.335PhePro: 2.335 ± 1.286
0.778PheGln: 0.778 ± 0.429
3.891PheArg: 3.891 ± 0.619
1.556PheSer: 1.556 ± 0.857
1.556PheThr: 1.556 ± 0.857
1.556PheVal: 1.556 ± 0.857
2.335PheTrp: 2.335 ± 1.286
0.778PheTyr: 0.778 ± 0.429
0.0PheXaa: 0.0 ± 0.0
Gly
3.113GlyAla: 3.113 ± 2.856
2.335GlyCys: 2.335 ± 0.238
3.113GlyAsp: 3.113 ± 0.191
3.891GlyGlu: 3.891 ± 0.619
3.891GlyPhe: 3.891 ± 0.619
4.669GlyGly: 4.669 ± 1.999
0.778GlyHis: 0.778 ± 1.095
6.226GlyIle: 6.226 ± 0.382
5.447GlyLys: 5.447 ± 0.047
3.891GlyLeu: 3.891 ± 2.427
1.556GlyMet: 1.556 ± 0.666
1.556GlyAsn: 1.556 ± 2.19
4.669GlyPro: 4.669 ± 1.048
1.556GlyGln: 1.556 ± 0.666
6.226GlyArg: 6.226 ± 1.142
4.669GlySer: 4.669 ± 1.999
3.891GlyThr: 3.891 ± 2.427
1.556GlyVal: 1.556 ± 0.857
0.0GlyTrp: 0.0 ± 0.0
1.556GlyTyr: 1.556 ± 0.666
0.0GlyXaa: 0.0 ± 0.0
His
0.778HisAla: 0.778 ± 1.095
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.556HisGlu: 1.556 ± 0.857
0.778HisPhe: 0.778 ± 0.429
0.778HisGly: 0.778 ± 0.429
0.778HisHis: 0.778 ± 0.429
0.0HisIle: 0.0 ± 0.0
0.778HisLys: 0.778 ± 0.429
1.556HisLeu: 1.556 ± 0.666
0.778HisMet: 0.778 ± 0.335
0.778HisAsn: 0.778 ± 1.095
1.556HisPro: 1.556 ± 0.857
2.335HisGln: 2.335 ± 1.286
2.335HisArg: 2.335 ± 1.286
1.556HisSer: 1.556 ± 0.666
2.335HisThr: 2.335 ± 1.286
0.778HisVal: 0.778 ± 1.095
0.0HisTrp: 0.0 ± 0.0
5.447HisTyr: 5.447 ± 1.57
0.0HisXaa: 0.0 ± 0.0
Ile
5.447IleAla: 5.447 ± 3.094
0.0IleCys: 0.0 ± 0.0
4.669IleAsp: 4.669 ± 1.048
1.556IleGlu: 1.556 ± 0.857
1.556IlePhe: 1.556 ± 0.857
5.447IleGly: 5.447 ± 1.57
1.556IleHis: 1.556 ± 0.857
0.778IleIle: 0.778 ± 0.429
3.891IleLys: 3.891 ± 2.143
4.669IleLeu: 4.669 ± 2.572
0.0IleMet: 0.0 ± 0.0
2.335IleAsn: 2.335 ± 0.238
2.335IlePro: 2.335 ± 1.286
1.556IleGln: 1.556 ± 0.666
3.891IleArg: 3.891 ± 0.619
3.113IleSer: 3.113 ± 0.191
0.778IleThr: 0.778 ± 0.429
3.891IleVal: 3.891 ± 0.904
0.778IleTrp: 0.778 ± 0.429
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.226LysAla: 6.226 ± 1.142
0.0LysCys: 0.0 ± 0.0
1.556LysAsp: 1.556 ± 0.666
2.335LysGlu: 2.335 ± 1.286
2.335LysPhe: 2.335 ± 1.286
1.556LysGly: 1.556 ± 0.857
3.113LysHis: 3.113 ± 0.191
3.113LysIle: 3.113 ± 0.191
3.891LysLys: 3.891 ± 2.143
5.447LysLeu: 5.447 ± 1.477
0.778LysMet: 0.778 ± 0.429
2.335LysAsn: 2.335 ± 0.238
6.226LysPro: 6.226 ± 1.905
1.556LysGln: 1.556 ± 0.666
5.447LysArg: 5.447 ± 0.047
9.339LysSer: 9.339 ± 0.951
3.891LysThr: 3.891 ± 0.619
3.113LysVal: 3.113 ± 0.191
1.556LysTrp: 1.556 ± 0.857
3.113LysTyr: 3.113 ± 1.714
0.0LysXaa: 0.0 ± 0.0
Leu
7.004LeuAla: 7.004 ± 2.237
2.335LeuCys: 2.335 ± 0.238
3.891LeuAsp: 3.891 ± 2.427
5.447LeuGlu: 5.447 ± 1.477
3.113LeuPhe: 3.113 ± 1.714
2.335LeuGly: 2.335 ± 1.286
2.335LeuHis: 2.335 ± 1.286
1.556LeuIle: 1.556 ± 0.666
4.669LeuLys: 4.669 ± 2.572
3.891LeuLeu: 3.891 ± 0.619
0.778LeuMet: 0.778 ± 0.284
7.782LeuAsn: 7.782 ± 1.239
6.226LeuPro: 6.226 ± 0.382
3.891LeuGln: 3.891 ± 0.619
4.669LeuArg: 4.669 ± 1.999
6.226LeuSer: 6.226 ± 0.382
1.556LeuThr: 1.556 ± 2.19
4.669LeuVal: 4.669 ± 1.048
2.335LeuTrp: 2.335 ± 1.286
3.113LeuTyr: 3.113 ± 1.714
0.0LeuXaa: 0.0 ± 0.0
Met
0.778MetAla: 0.778 ± 0.429
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.556MetGlu: 1.556 ± 0.666
0.0MetPhe: 0.0 ± 0.0
0.778MetGly: 0.778 ± 0.429
0.778MetHis: 0.778 ± 0.429
0.778MetIle: 0.778 ± 1.095
0.0MetLys: 0.0 ± 0.0
1.556MetLeu: 1.556 ± 0.666
0.0MetMet: 0.0 ± 0.0
0.778MetAsn: 0.778 ± 1.095
0.0MetPro: 0.0 ± 0.0
0.778MetGln: 0.778 ± 0.429
0.0MetArg: 0.0 ± 0.0
2.335MetSer: 2.335 ± 1.761
2.335MetThr: 2.335 ± 1.761
1.556MetVal: 1.556 ± 0.857
1.556MetTrp: 1.556 ± 0.857
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.891AsnAla: 3.891 ± 0.904
1.556AsnCys: 1.556 ± 0.666
0.778AsnAsp: 0.778 ± 0.429
3.113AsnGlu: 3.113 ± 1.714
0.778AsnPhe: 0.778 ± 0.429
2.335AsnGly: 2.335 ± 0.238
0.778AsnHis: 0.778 ± 1.095
0.778AsnIle: 0.778 ± 0.429
1.556AsnLys: 1.556 ± 0.666
3.113AsnLeu: 3.113 ± 0.191
0.778AsnMet: 0.778 ± 1.095
0.0AsnAsn: 0.0 ± 0.0
1.556AsnPro: 1.556 ± 0.666
0.778AsnGln: 0.778 ± 0.429
2.335AsnArg: 2.335 ± 1.286
3.113AsnSer: 3.113 ± 0.191
4.669AsnThr: 4.669 ± 3.522
3.113AsnVal: 3.113 ± 1.333
0.778AsnTrp: 0.778 ± 0.429
2.335AsnTyr: 2.335 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
3.113ProAla: 3.113 ± 1.333
3.891ProCys: 3.891 ± 0.619
3.891ProAsp: 3.891 ± 2.143
3.891ProGlu: 3.891 ± 2.143
4.669ProPhe: 4.669 ± 0.475
4.669ProGly: 4.669 ± 1.999
0.0ProHis: 0.0 ± 0.0
4.669ProIle: 4.669 ± 1.048
4.669ProLys: 4.669 ± 1.048
3.113ProLeu: 3.113 ± 0.191
3.891ProMet: 3.891 ± 0.904
2.335ProAsn: 2.335 ± 0.238
3.113ProPro: 3.113 ± 1.714
1.556ProGln: 1.556 ± 0.857
2.335ProArg: 2.335 ± 1.286
2.335ProSer: 2.335 ± 0.238
3.891ProThr: 3.891 ± 2.143
3.891ProVal: 3.891 ± 2.427
0.778ProTrp: 0.778 ± 0.429
0.778ProTyr: 0.778 ± 0.429
0.0ProXaa: 0.0 ± 0.0
Gln
4.669GlnAla: 4.669 ± 3.522
0.0GlnCys: 0.0 ± 0.0
1.556GlnAsp: 1.556 ± 0.857
2.335GlnGlu: 2.335 ± 1.286
1.556GlnPhe: 1.556 ± 0.857
2.335GlnGly: 2.335 ± 0.238
0.778GlnHis: 0.778 ± 0.429
2.335GlnIle: 2.335 ± 0.238
1.556GlnLys: 1.556 ± 0.857
2.335GlnLeu: 2.335 ± 1.286
0.778GlnMet: 0.778 ± 1.095
1.556GlnAsn: 1.556 ± 0.857
2.335GlnPro: 2.335 ± 1.761
3.113GlnGln: 3.113 ± 0.191
3.113GlnArg: 3.113 ± 1.714
5.447GlnSer: 5.447 ± 1.57
3.891GlnThr: 3.891 ± 2.427
0.778GlnVal: 0.778 ± 0.429
0.778GlnTrp: 0.778 ± 0.429
3.113GlnTyr: 3.113 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
2.335ArgAla: 2.335 ± 1.761
2.335ArgCys: 2.335 ± 0.238
2.335ArgAsp: 2.335 ± 1.286
4.669ArgGlu: 4.669 ± 1.048
2.335ArgPhe: 2.335 ± 0.238
4.669ArgGly: 4.669 ± 1.999
0.778ArgHis: 0.778 ± 0.429
3.113ArgIle: 3.113 ± 0.191
4.669ArgLys: 4.669 ± 1.048
8.56ArgLeu: 8.56 ± 0.144
0.778ArgMet: 0.778 ± 0.429
2.335ArgAsn: 2.335 ± 1.286
3.113ArgPro: 3.113 ± 2.856
0.0ArgGln: 0.0 ± 0.0
0.778ArgArg: 0.778 ± 1.095
4.669ArgSer: 4.669 ± 1.048
4.669ArgThr: 4.669 ± 1.048
4.669ArgVal: 4.669 ± 1.048
1.556ArgTrp: 1.556 ± 0.857
1.556ArgTyr: 1.556 ± 0.666
0.0ArgXaa: 0.0 ± 0.0
Ser
3.891SerAla: 3.891 ± 0.619
0.0SerCys: 0.0 ± 0.0
1.556SerAsp: 1.556 ± 0.666
5.447SerGlu: 5.447 ± 1.477
3.891SerPhe: 3.891 ± 0.619
9.339SerGly: 9.339 ± 2.474
0.778SerHis: 0.778 ± 1.095
3.113SerIle: 3.113 ± 1.714
5.447SerLys: 5.447 ± 0.047
5.447SerLeu: 5.447 ± 1.477
0.0SerMet: 0.0 ± 0.0
2.335SerAsn: 2.335 ± 3.285
4.669SerPro: 4.669 ± 1.048
3.891SerGln: 3.891 ± 0.619
5.447SerArg: 5.447 ± 4.617
4.669SerSer: 4.669 ± 1.999
4.669SerThr: 4.669 ± 1.048
4.669SerVal: 4.669 ± 1.048
0.778SerTrp: 0.778 ± 0.429
1.556SerTyr: 1.556 ± 0.666
0.0SerXaa: 0.0 ± 0.0
Thr
3.891ThrAla: 3.891 ± 0.904
0.0ThrCys: 0.0 ± 0.0
5.447ThrAsp: 5.447 ± 1.57
3.113ThrGlu: 3.113 ± 0.191
0.778ThrPhe: 0.778 ± 1.095
5.447ThrGly: 5.447 ± 3.094
3.113ThrHis: 3.113 ± 0.191
6.226ThrIle: 6.226 ± 1.905
8.56ThrLys: 8.56 ± 2.903
3.891ThrLeu: 3.891 ± 0.619
0.778ThrMet: 0.778 ± 0.429
0.0ThrAsn: 0.0 ± 0.0
7.004ThrPro: 7.004 ± 2.334
3.113ThrGln: 3.113 ± 1.333
2.335ThrArg: 2.335 ± 1.286
2.335ThrSer: 2.335 ± 1.761
3.113ThrThr: 3.113 ± 1.714
3.891ThrVal: 3.891 ± 2.427
3.113ThrTrp: 3.113 ± 1.714
1.556ThrTyr: 1.556 ± 0.857
0.0ThrXaa: 0.0 ± 0.0
Val
8.56ValAla: 8.56 ± 4.426
0.778ValCys: 0.778 ± 0.429
1.556ValAsp: 1.556 ± 0.857
2.335ValGlu: 2.335 ± 1.286
5.447ValPhe: 5.447 ± 0.047
2.335ValGly: 2.335 ± 0.238
1.556ValHis: 1.556 ± 0.666
2.335ValIle: 2.335 ± 1.286
3.113ValLys: 3.113 ± 0.191
3.891ValLeu: 3.891 ± 0.904
0.778ValMet: 0.778 ± 0.429
3.113ValAsn: 3.113 ± 0.191
3.891ValPro: 3.891 ± 0.619
3.891ValGln: 3.891 ± 0.619
3.891ValArg: 3.891 ± 0.619
1.556ValSer: 1.556 ± 0.666
3.891ValThr: 3.891 ± 0.904
1.556ValVal: 1.556 ± 0.857
2.335ValTrp: 2.335 ± 0.238
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 0.429
0.778TrpCys: 0.778 ± 0.429
0.778TrpAsp: 0.778 ± 0.429
3.113TrpGlu: 3.113 ± 0.191
1.556TrpPhe: 1.556 ± 0.666
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.556TrpIle: 1.556 ± 0.857
2.335TrpLys: 2.335 ± 1.286
1.556TrpLeu: 1.556 ± 0.857
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.778TrpPro: 0.778 ± 0.429
3.891TrpGln: 3.891 ± 0.619
0.778TrpArg: 0.778 ± 0.429
0.778TrpSer: 0.778 ± 1.095
4.669TrpThr: 4.669 ± 1.048
0.778TrpVal: 0.778 ± 0.429
0.778TrpTrp: 0.778 ± 0.429
0.778TrpTyr: 0.778 ± 1.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.113TyrAla: 3.113 ± 1.714
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.556TyrGlu: 1.556 ± 0.666
0.778TyrPhe: 0.778 ± 0.429
1.556TyrGly: 1.556 ± 0.666
0.778TyrHis: 0.778 ± 0.429
0.778TyrIle: 0.778 ± 1.095
3.113TyrLys: 3.113 ± 1.714
1.556TyrLeu: 1.556 ± 0.857
0.778TyrMet: 0.778 ± 1.095
1.556TyrAsn: 1.556 ± 0.857
1.556TyrPro: 1.556 ± 0.857
3.113TyrGln: 3.113 ± 1.333
2.335TyrArg: 2.335 ± 1.286
3.113TyrSer: 3.113 ± 0.191
3.113TyrThr: 3.113 ± 2.856
0.778TyrVal: 0.778 ± 0.429
0.778TyrTrp: 0.778 ± 0.429
0.778TyrTyr: 0.778 ± 0.429
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1286 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski