Amino acid dipepetide frequency for Fig cryptic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.426AlaAla: 7.426 ± 2.774
0.0AlaCys: 0.0 ± 0.0
3.713AlaAsp: 3.713 ± 0.461
1.238AlaGlu: 1.238 ± 1.078
8.663AlaPhe: 8.663 ± 5.387
8.663AlaGly: 8.663 ± 2.004
1.238AlaHis: 1.238 ± 1.078
4.95AlaIle: 4.95 ± 2.465
3.713AlaLys: 3.713 ± 2.309
6.188AlaLeu: 6.188 ± 5.391
0.0AlaMet: 0.0 ± 0.0
2.475AlaAsn: 2.475 ± 0.309
7.426AlaPro: 7.426 ± 2.774
0.0AlaGln: 0.0 ± 0.0
3.713AlaArg: 3.713 ± 0.461
2.475AlaSer: 2.475 ± 1.539
2.475AlaThr: 2.475 ± 0.309
3.713AlaVal: 3.713 ± 0.461
0.0AlaTrp: 0.0 ± 0.0
1.238AlaTyr: 1.238 ± 0.77
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.475CysPhe: 2.475 ± 0.309
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.238CysIle: 1.238 ± 0.77
1.238CysLys: 1.238 ± 0.77
1.238CysLeu: 1.238 ± 0.77
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.238CysSer: 1.238 ± 0.77
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
4.95CysTyr: 4.95 ± 2.465
0.0CysXaa: 0.0 ± 0.0
Asp
6.188AspAla: 6.188 ± 0.152
0.0AspCys: 0.0 ± 0.0
1.238AspAsp: 1.238 ± 0.77
1.238AspGlu: 1.238 ± 0.77
3.713AspPhe: 3.713 ± 1.387
4.95AspGly: 4.95 ± 0.617
2.475AspHis: 2.475 ± 1.539
3.713AspIle: 3.713 ± 0.461
0.0AspLys: 0.0 ± 0.0
2.475AspLeu: 2.475 ± 0.309
2.475AspMet: 2.475 ± 0.309
2.475AspAsn: 2.475 ± 0.309
7.426AspPro: 7.426 ± 2.769
2.475AspGln: 2.475 ± 0.309
3.713AspArg: 3.713 ± 1.387
3.713AspSer: 3.713 ± 2.309
1.238AspThr: 1.238 ± 0.77
4.95AspVal: 4.95 ± 0.617
1.238AspTrp: 1.238 ± 0.77
2.475AspTyr: 2.475 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
6.188GluAla: 6.188 ± 0.152
1.238GluCys: 1.238 ± 1.078
4.95GluAsp: 4.95 ± 0.617
3.713GluGlu: 3.713 ± 3.235
2.475GluPhe: 2.475 ± 0.309
6.188GluGly: 6.188 ± 3.848
0.0GluHis: 0.0 ± 0.0
3.713GluIle: 3.713 ± 2.309
0.0GluLys: 0.0 ± 0.0
2.475GluLeu: 2.475 ± 0.309
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
2.475GluPro: 2.475 ± 0.309
2.475GluGln: 2.475 ± 0.309
4.95GluArg: 4.95 ± 1.23
3.713GluSer: 3.713 ± 0.461
1.238GluThr: 1.238 ± 0.77
1.238GluVal: 1.238 ± 0.77
0.0GluTrp: 0.0 ± 0.0
2.475GluTyr: 2.475 ± 1.539
0.0GluXaa: 0.0 ± 0.0
Phe
1.238PheAla: 1.238 ± 1.078
0.0PheCys: 0.0 ± 0.0
7.426PheAsp: 7.426 ± 0.926
3.713PheGlu: 3.713 ± 0.461
0.0PhePhe: 0.0 ± 0.0
3.713PheGly: 3.713 ± 0.461
6.188PheHis: 6.188 ± 1.696
2.475PheIle: 2.475 ± 1.539
2.475PheLys: 2.475 ± 1.539
4.95PheLeu: 4.95 ± 0.617
1.238PheMet: 1.238 ± 1.078
1.238PheAsn: 1.238 ± 0.77
6.188PhePro: 6.188 ± 0.152
1.238PheGln: 1.238 ± 0.77
0.0PheArg: 0.0 ± 0.0
2.475PheSer: 2.475 ± 1.539
4.95PheThr: 4.95 ± 1.23
2.475PheVal: 2.475 ± 0.309
1.238PheTrp: 1.238 ± 1.078
2.475PheTyr: 2.475 ± 1.539
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
2.475GlyCys: 2.475 ± 2.157
2.475GlyAsp: 2.475 ± 1.539
6.188GlyGlu: 6.188 ± 2.0
2.475GlyPhe: 2.475 ± 0.309
2.475GlyGly: 2.475 ± 1.539
1.238GlyHis: 1.238 ± 1.078
4.95GlyIle: 4.95 ± 1.23
3.713GlyLys: 3.713 ± 0.461
3.713GlyLeu: 3.713 ± 2.309
2.475GlyMet: 2.475 ± 2.157
2.475GlyAsn: 2.475 ± 1.539
1.238GlyPro: 1.238 ± 1.078
0.0GlyGln: 0.0 ± 0.0
6.188GlyArg: 6.188 ± 3.544
4.95GlySer: 4.95 ± 1.23
7.426GlyThr: 7.426 ± 4.622
1.238GlyVal: 1.238 ± 1.078
4.95GlyTrp: 4.95 ± 3.078
4.95GlyTyr: 4.95 ± 1.23
0.0GlyXaa: 0.0 ± 0.0
His
1.238HisAla: 1.238 ± 0.77
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.238HisGlu: 1.238 ± 0.77
0.0HisPhe: 0.0 ± 0.0
1.238HisGly: 1.238 ± 1.078
1.238HisHis: 1.238 ± 1.078
1.238HisIle: 1.238 ± 0.77
1.238HisLys: 1.238 ± 0.77
4.95HisLeu: 4.95 ± 1.23
0.0HisMet: 0.0 ± 0.0
1.238HisAsn: 1.238 ± 0.77
1.238HisPro: 1.238 ± 1.078
1.238HisGln: 1.238 ± 0.77
2.475HisArg: 2.475 ± 0.309
2.475HisSer: 2.475 ± 2.157
0.0HisThr: 0.0 ± 0.0
1.238HisVal: 1.238 ± 0.77
0.0HisTrp: 0.0 ± 0.0
2.475HisTyr: 2.475 ± 2.157
0.0HisXaa: 0.0 ± 0.0
Ile
6.188IleAla: 6.188 ± 0.152
1.238IleCys: 1.238 ± 0.77
4.95IleAsp: 4.95 ± 1.23
3.713IleGlu: 3.713 ± 2.309
2.475IlePhe: 2.475 ± 2.157
4.95IleGly: 4.95 ± 1.23
2.475IleHis: 2.475 ± 1.539
6.188IleIle: 6.188 ± 1.696
1.238IleLys: 1.238 ± 0.77
6.188IleLeu: 6.188 ± 2.0
2.475IleMet: 2.475 ± 1.539
3.713IleAsn: 3.713 ± 1.387
3.713IlePro: 3.713 ± 2.309
3.713IleGln: 3.713 ± 0.461
3.713IleArg: 3.713 ± 1.387
3.713IleSer: 3.713 ± 1.387
4.95IleThr: 4.95 ± 2.465
3.713IleVal: 3.713 ± 1.387
1.238IleTrp: 1.238 ± 0.77
3.713IleTyr: 3.713 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
2.475LysAla: 2.475 ± 1.539
1.238LysCys: 1.238 ± 0.77
1.238LysAsp: 1.238 ± 0.77
0.0LysGlu: 0.0 ± 0.0
1.238LysPhe: 1.238 ± 0.77
2.475LysGly: 2.475 ± 1.539
0.0LysHis: 0.0 ± 0.0
4.95LysIle: 4.95 ± 1.23
1.238LysLys: 1.238 ± 0.77
3.713LysLeu: 3.713 ± 0.461
1.238LysMet: 1.238 ± 0.77
0.0LysAsn: 0.0 ± 0.0
2.475LysPro: 2.475 ± 1.539
1.238LysGln: 1.238 ± 0.77
4.95LysArg: 4.95 ± 3.078
1.238LysSer: 1.238 ± 0.77
1.238LysThr: 1.238 ± 0.77
3.713LysVal: 3.713 ± 2.309
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
12.376LeuAla: 12.376 ± 1.544
0.0LeuCys: 0.0 ± 0.0
2.475LeuAsp: 2.475 ± 0.309
3.713LeuGlu: 3.713 ± 1.387
3.713LeuPhe: 3.713 ± 0.461
4.95LeuGly: 4.95 ± 1.23
1.238LeuHis: 1.238 ± 0.77
7.426LeuIle: 7.426 ± 2.769
2.475LeuLys: 2.475 ± 1.539
2.475LeuLeu: 2.475 ± 1.539
2.475LeuMet: 2.475 ± 1.539
2.475LeuAsn: 2.475 ± 1.539
3.713LeuPro: 3.713 ± 2.309
4.95LeuGln: 4.95 ± 1.23
7.426LeuArg: 7.426 ± 2.774
3.713LeuSer: 3.713 ± 0.461
4.95LeuThr: 4.95 ± 0.617
3.713LeuVal: 3.713 ± 0.461
4.95LeuTrp: 4.95 ± 0.617
3.713LeuTyr: 3.713 ± 1.387
0.0LeuXaa: 0.0 ± 0.0
Met
2.475MetAla: 2.475 ± 0.309
1.238MetCys: 1.238 ± 0.77
2.475MetAsp: 2.475 ± 0.309
2.475MetGlu: 2.475 ± 0.309
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.238MetHis: 1.238 ± 0.77
0.0MetIle: 0.0 ± 0.0
1.238MetLys: 1.238 ± 0.77
2.475MetLeu: 2.475 ± 0.309
1.238MetMet: 1.238 ± 0.572
1.238MetAsn: 1.238 ± 1.078
0.0MetPro: 0.0 ± 0.0
2.475MetGln: 2.475 ± 1.539
2.475MetArg: 2.475 ± 0.309
1.238MetSer: 1.238 ± 0.77
2.475MetThr: 2.475 ± 2.157
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.238MetTyr: 1.238 ± 1.078
0.0MetXaa: 0.0 ± 0.0
Asn
3.713AsnAla: 3.713 ± 0.461
1.238AsnCys: 1.238 ± 0.77
2.475AsnAsp: 2.475 ± 0.309
2.475AsnGlu: 2.475 ± 0.309
2.475AsnPhe: 2.475 ± 0.309
2.475AsnGly: 2.475 ± 0.309
1.238AsnHis: 1.238 ± 1.078
8.663AsnIle: 8.663 ± 1.691
1.238AsnLys: 1.238 ± 0.77
0.0AsnLeu: 0.0 ± 0.0
1.238AsnMet: 1.238 ± 0.77
3.713AsnAsn: 3.713 ± 1.387
1.238AsnPro: 1.238 ± 1.078
1.238AsnGln: 1.238 ± 0.77
6.188AsnArg: 6.188 ± 3.544
0.0AsnSer: 0.0 ± 0.0
1.238AsnThr: 1.238 ± 1.078
1.238AsnVal: 1.238 ± 1.078
1.238AsnTrp: 1.238 ± 1.078
2.475AsnTyr: 2.475 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
2.475ProAla: 2.475 ± 0.309
0.0ProCys: 0.0 ± 0.0
4.95ProAsp: 4.95 ± 0.617
2.475ProGlu: 2.475 ± 1.539
3.713ProPhe: 3.713 ± 0.461
1.238ProGly: 1.238 ± 1.078
1.238ProHis: 1.238 ± 0.77
4.95ProIle: 4.95 ± 2.465
3.713ProLys: 3.713 ± 0.461
3.713ProLeu: 3.713 ± 0.461
0.0ProMet: 0.0 ± 0.0
2.475ProAsn: 2.475 ± 0.309
1.238ProPro: 1.238 ± 0.77
2.475ProGln: 2.475 ± 1.539
2.475ProArg: 2.475 ± 0.309
6.188ProSer: 6.188 ± 1.696
8.663ProThr: 8.663 ± 2.004
3.713ProVal: 3.713 ± 1.387
0.0ProTrp: 0.0 ± 0.0
1.238ProTyr: 1.238 ± 1.078
0.0ProXaa: 0.0 ± 0.0
Gln
1.238GlnAla: 1.238 ± 0.77
0.0GlnCys: 0.0 ± 0.0
2.475GlnAsp: 2.475 ± 1.539
1.238GlnGlu: 1.238 ± 0.77
2.475GlnPhe: 2.475 ± 1.539
3.713GlnGly: 3.713 ± 2.309
0.0GlnHis: 0.0 ± 0.0
1.238GlnIle: 1.238 ± 1.078
1.238GlnLys: 1.238 ± 0.77
12.376GlnLeu: 12.376 ± 2.152
2.475GlnMet: 2.475 ± 1.539
1.238GlnAsn: 1.238 ± 1.078
3.713GlnPro: 3.713 ± 1.387
2.475GlnGln: 2.475 ± 1.539
1.238GlnArg: 1.238 ± 0.77
1.238GlnSer: 1.238 ± 0.77
0.0GlnThr: 0.0 ± 0.0
3.713GlnVal: 3.713 ± 1.387
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.95ArgAla: 4.95 ± 2.465
0.0ArgCys: 0.0 ± 0.0
2.475ArgAsp: 2.475 ± 1.539
3.713ArgGlu: 3.713 ± 2.309
3.713ArgPhe: 3.713 ± 1.387
2.475ArgGly: 2.475 ± 2.157
1.238ArgHis: 1.238 ± 0.77
8.663ArgIle: 8.663 ± 2.004
3.713ArgLys: 3.713 ± 2.309
11.139ArgLeu: 11.139 ± 1.382
3.713ArgMet: 3.713 ± 1.971
1.238ArgAsn: 1.238 ± 0.77
1.238ArgPro: 1.238 ± 1.078
0.0ArgGln: 0.0 ± 0.0
3.713ArgArg: 3.713 ± 1.387
6.188ArgSer: 6.188 ± 1.696
3.713ArgThr: 3.713 ± 1.387
1.238ArgVal: 1.238 ± 0.77
0.0ArgTrp: 0.0 ± 0.0
2.475ArgTyr: 2.475 ± 2.157
0.0ArgXaa: 0.0 ± 0.0
Ser
4.95SerAla: 4.95 ± 0.617
1.238SerCys: 1.238 ± 0.77
1.238SerAsp: 1.238 ± 0.77
2.475SerGlu: 2.475 ± 0.309
4.95SerPhe: 4.95 ± 0.617
2.475SerGly: 2.475 ± 0.309
2.475SerHis: 2.475 ± 0.309
4.95SerIle: 4.95 ± 1.23
1.238SerLys: 1.238 ± 0.77
4.95SerLeu: 4.95 ± 3.078
1.238SerMet: 1.238 ± 1.078
2.475SerAsn: 2.475 ± 2.157
0.0SerPro: 0.0 ± 0.0
3.713SerGln: 3.713 ± 0.461
2.475SerArg: 2.475 ± 0.309
6.188SerSer: 6.188 ± 0.152
4.95SerThr: 4.95 ± 0.617
4.95SerVal: 4.95 ± 1.23
1.238SerTrp: 1.238 ± 1.078
1.238SerTyr: 1.238 ± 0.77
0.0SerXaa: 0.0 ± 0.0
Thr
2.475ThrAla: 2.475 ± 2.157
1.238ThrCys: 1.238 ± 1.078
2.475ThrAsp: 2.475 ± 0.309
3.713ThrGlu: 3.713 ± 1.387
6.188ThrPhe: 6.188 ± 0.152
6.188ThrGly: 6.188 ± 1.696
0.0ThrHis: 0.0 ± 0.0
1.238ThrIle: 1.238 ± 0.77
0.0ThrLys: 0.0 ± 0.0
3.713ThrLeu: 3.713 ± 0.461
0.0ThrMet: 0.0 ± 0.0
4.95ThrAsn: 4.95 ± 0.617
6.188ThrPro: 6.188 ± 3.544
4.95ThrGln: 4.95 ± 1.23
6.188ThrArg: 6.188 ± 0.152
6.188ThrSer: 6.188 ± 3.544
6.188ThrThr: 6.188 ± 3.544
6.188ThrVal: 6.188 ± 3.544
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.238ValAla: 1.238 ± 1.078
1.238ValCys: 1.238 ± 0.77
4.95ValAsp: 4.95 ± 0.617
3.713ValGlu: 3.713 ± 2.309
2.475ValPhe: 2.475 ± 0.309
3.713ValGly: 3.713 ± 3.235
1.238ValHis: 1.238 ± 1.078
2.475ValIle: 2.475 ± 0.309
2.475ValLys: 2.475 ± 1.539
1.238ValLeu: 1.238 ± 0.77
1.238ValMet: 1.238 ± 1.078
3.713ValAsn: 3.713 ± 1.387
4.95ValPro: 4.95 ± 0.617
6.188ValGln: 6.188 ± 0.152
3.713ValArg: 3.713 ± 2.309
1.238ValSer: 1.238 ± 0.77
6.188ValThr: 6.188 ± 3.544
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.238ValTyr: 1.238 ± 0.77
0.0ValXaa: 0.0 ± 0.0
Trp
1.238TrpAla: 1.238 ± 0.77
0.0TrpCys: 0.0 ± 0.0
1.238TrpAsp: 1.238 ± 1.078
1.238TrpGlu: 1.238 ± 0.77
1.238TrpPhe: 1.238 ± 0.77
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.238TrpLys: 1.238 ± 0.77
1.238TrpLeu: 1.238 ± 1.078
1.238TrpMet: 1.238 ± 0.77
2.475TrpAsn: 2.475 ± 0.309
1.238TrpPro: 1.238 ± 1.078
1.238TrpGln: 1.238 ± 1.078
0.0TrpArg: 0.0 ± 0.0
1.238TrpSer: 1.238 ± 0.77
1.238TrpThr: 1.238 ± 0.77
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.238TyrAla: 1.238 ± 0.77
0.0TyrCys: 0.0 ± 0.0
4.95TyrAsp: 4.95 ± 1.23
1.238TyrGlu: 1.238 ± 1.078
1.238TyrPhe: 1.238 ± 0.77
3.713TyrGly: 3.713 ± 0.461
0.0TyrHis: 0.0 ± 0.0
1.238TyrIle: 1.238 ± 1.078
1.238TyrLys: 1.238 ± 0.77
3.713TyrLeu: 3.713 ± 1.387
0.0TyrMet: 0.0 ± 0.0
6.188TyrAsn: 6.188 ± 1.696
1.238TyrPro: 1.238 ± 1.078
0.0TyrGln: 0.0 ± 0.0
1.238TyrArg: 1.238 ± 1.078
0.0TyrSer: 0.0 ± 0.0
4.95TyrThr: 4.95 ± 0.617
6.188TyrVal: 6.188 ± 0.152
0.0TyrTrp: 0.0 ± 0.0
1.238TyrTyr: 1.238 ± 0.77
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (809 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski