Amino acid dipepetide frequency for Striped jack nervous necrosis virus (SjNNV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.59AlaAla: 8.59 ± 7.399
0.716AlaCys: 0.716 ± 0.397
4.295AlaAsp: 4.295 ± 1.302
3.579AlaGlu: 3.579 ± 2.556
2.863AlaPhe: 2.863 ± 1.587
8.59AlaGly: 8.59 ± 3.083
0.0AlaHis: 0.0 ± 0.0
2.147AlaIle: 2.147 ± 2.661
4.295AlaLys: 4.295 ± 2.092
7.158AlaLeu: 7.158 ± 4.682
0.0AlaMet: 0.0 ± 0.0
5.727AlaAsn: 5.727 ± 1.743
8.59AlaPro: 8.59 ± 3.216
1.432AlaGln: 1.432 ± 3.08
7.874AlaArg: 7.874 ± 2.339
5.727AlaSer: 5.727 ± 1.994
5.727AlaThr: 5.727 ± 2.789
8.59AlaVal: 8.59 ± 3.216
0.716AlaTrp: 0.716 ± 0.397
3.579AlaTyr: 3.579 ± 1.984
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.432CysAsp: 1.432 ± 0.794
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.432CysGly: 1.432 ± 0.794
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.716CysLys: 0.716 ± 0.397
1.432CysLeu: 1.432 ± 0.794
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.716CysPro: 0.716 ± 0.986
0.0CysGln: 0.0 ± 0.0
2.863CysArg: 2.863 ± 0.678
0.716CysSer: 0.716 ± 0.397
0.716CysThr: 0.716 ± 0.397
1.432CysVal: 1.432 ± 0.697
0.0CysTrp: 0.0 ± 0.0
0.716CysTyr: 0.716 ± 0.397
0.0CysXaa: 0.0 ± 0.0
Asp
5.727AspAla: 5.727 ± 1.357
1.432AspCys: 1.432 ± 0.794
3.579AspAsp: 3.579 ± 0.959
2.147AspGlu: 2.147 ± 1.19
2.863AspPhe: 2.863 ± 0.678
7.158AspGly: 7.158 ± 2.818
2.863AspHis: 2.863 ± 0.678
2.863AspIle: 2.863 ± 1.587
2.863AspLys: 2.863 ± 1.394
2.863AspLeu: 2.863 ± 0.678
0.716AspMet: 0.716 ± 0.397
0.716AspAsn: 0.716 ± 0.986
2.863AspPro: 2.863 ± 0.678
2.863AspGln: 2.863 ± 2.528
5.011AspArg: 5.011 ± 1.668
2.863AspSer: 2.863 ± 2.64
5.011AspThr: 5.011 ± 1.181
5.727AspVal: 5.727 ± 2.534
1.432AspTrp: 1.432 ± 0.697
5.011AspTyr: 5.011 ± 1.668
0.0AspXaa: 0.0 ± 0.0
Glu
4.295GluAla: 4.295 ± 2.381
0.0GluCys: 0.0 ± 0.0
2.863GluAsp: 2.863 ± 0.678
0.716GluGlu: 0.716 ± 0.397
2.863GluPhe: 2.863 ± 0.678
1.432GluGly: 1.432 ± 0.794
2.147GluHis: 2.147 ± 1.19
2.147GluIle: 2.147 ± 0.562
1.432GluLys: 1.432 ± 0.794
10.021GluLeu: 10.021 ± 3.13
1.432GluMet: 1.432 ± 5.022
2.147GluAsn: 2.147 ± 0.562
2.147GluPro: 2.147 ± 2.661
2.863GluGln: 2.863 ± 5.733
0.716GluArg: 0.716 ± 0.397
1.432GluSer: 1.432 ± 0.697
2.863GluThr: 2.863 ± 3.11
2.147GluVal: 2.147 ± 1.19
0.0GluTrp: 0.0 ± 0.0
2.147GluTyr: 2.147 ± 1.19
0.0GluXaa: 0.0 ± 0.0
Phe
2.147PheAla: 2.147 ± 1.19
0.716PheCys: 0.716 ± 0.397
2.863PheAsp: 2.863 ± 1.394
2.863PheGlu: 2.863 ± 0.678
0.0PhePhe: 0.0 ± 0.0
1.432PheGly: 1.432 ± 0.697
0.0PheHis: 0.0 ± 0.0
0.716PheIle: 0.716 ± 0.397
2.147PheLys: 2.147 ± 2.661
2.147PheLeu: 2.147 ± 0.562
0.716PheMet: 0.716 ± 0.397
1.432PheAsn: 1.432 ± 0.697
0.716PhePro: 0.716 ± 0.397
2.147PheGln: 2.147 ± 0.562
1.432PheArg: 1.432 ± 0.697
0.0PheSer: 0.0 ± 0.0
3.579PheThr: 3.579 ± 1.203
3.579PheVal: 3.579 ± 2.305
1.432PheTrp: 1.432 ± 0.794
0.716PheTyr: 0.716 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
4.295GlyAla: 4.295 ± 2.092
0.716GlyCys: 0.716 ± 0.397
5.727GlyAsp: 5.727 ± 1.743
2.863GlyGlu: 2.863 ± 1.587
3.579GlyPhe: 3.579 ± 2.348
3.579GlyGly: 3.579 ± 2.764
2.147GlyHis: 2.147 ± 0.562
2.863GlyIle: 2.863 ± 1.587
3.579GlyLys: 3.579 ± 1.203
5.011GlyLeu: 5.011 ± 1.668
1.432GlyMet: 1.432 ± 0.697
3.579GlyAsn: 3.579 ± 1.984
4.295GlyPro: 4.295 ± 2.381
0.0GlyGln: 0.0 ± 0.0
5.727GlyArg: 5.727 ± 1.743
2.863GlySer: 2.863 ± 0.678
3.579GlyThr: 3.579 ± 2.348
2.863GlyVal: 2.863 ± 2.528
0.716GlyTrp: 0.716 ± 0.397
5.011GlyTyr: 5.011 ± 1.884
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.986
0.0HisCys: 0.0 ± 0.0
0.716HisAsp: 0.716 ± 0.397
0.716HisGlu: 0.716 ± 0.397
0.0HisPhe: 0.0 ± 0.0
0.716HisGly: 0.716 ± 0.397
0.0HisHis: 0.0 ± 0.0
1.432HisIle: 1.432 ± 0.697
2.147HisLys: 2.147 ± 1.19
4.295HisLeu: 4.295 ± 2.134
0.0HisMet: 0.0 ± 0.0
0.716HisAsn: 0.716 ± 0.986
0.0HisPro: 0.0 ± 0.0
1.432HisGln: 1.432 ± 0.794
1.432HisArg: 1.432 ± 0.794
0.716HisSer: 0.716 ± 0.397
2.863HisThr: 2.863 ± 0.678
2.863HisVal: 2.863 ± 1.587
0.716HisTrp: 0.716 ± 0.397
1.432HisTyr: 1.432 ± 0.794
0.0HisXaa: 0.0 ± 0.0
Ile
4.295IleAla: 4.295 ± 1.302
0.0IleCys: 0.0 ± 0.0
2.863IleAsp: 2.863 ± 2.528
0.716IleGlu: 0.716 ± 0.397
0.716IlePhe: 0.716 ± 0.986
2.147IleGly: 2.147 ± 1.19
0.716IleHis: 0.716 ± 0.397
1.432IleIle: 1.432 ± 0.794
0.716IleLys: 0.716 ± 0.397
3.579IleLeu: 3.579 ± 1.203
0.0IleMet: 0.0 ± 0.0
0.716IleAsn: 0.716 ± 0.986
0.716IlePro: 0.716 ± 0.397
1.432IleGln: 1.432 ± 0.697
1.432IleArg: 1.432 ± 0.794
5.727IleSer: 5.727 ± 1.994
5.011IleThr: 5.011 ± 1.181
5.727IleVal: 5.727 ± 1.357
0.716IleTrp: 0.716 ± 0.397
0.716IleTyr: 0.716 ± 0.397
0.0IleXaa: 0.0 ± 0.0
Lys
3.579LysAla: 3.579 ± 2.348
2.147LysCys: 2.147 ± 1.19
2.147LysAsp: 2.147 ± 1.19
3.579LysGlu: 3.579 ± 0.959
0.716LysPhe: 0.716 ± 0.397
1.432LysGly: 1.432 ± 0.697
0.716LysHis: 0.716 ± 0.397
2.147LysIle: 2.147 ± 0.562
2.147LysLys: 2.147 ± 1.66
2.863LysLeu: 2.863 ± 0.678
2.863LysMet: 2.863 ± 2.579
1.432LysAsn: 1.432 ± 0.794
3.579LysPro: 3.579 ± 1.203
2.147LysGln: 2.147 ± 1.19
2.147LysArg: 2.147 ± 1.19
0.716LysSer: 0.716 ± 0.397
2.147LysThr: 2.147 ± 0.562
0.716LysVal: 0.716 ± 0.397
1.432LysTrp: 1.432 ± 0.697
0.716LysTyr: 0.716 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
10.021LeuAla: 10.021 ± 3.934
1.432LeuCys: 1.432 ± 0.697
5.011LeuAsp: 5.011 ± 1.668
3.579LeuGlu: 3.579 ± 2.764
2.147LeuPhe: 2.147 ± 2.789
9.306LeuGly: 9.306 ± 0.956
1.432LeuHis: 1.432 ± 0.697
0.716LeuIle: 0.716 ± 0.397
4.295LeuLys: 4.295 ± 2.381
8.59LeuLeu: 8.59 ± 3.083
0.716LeuMet: 0.716 ± 2.982
0.0LeuAsn: 0.0 ± 0.0
4.295LeuPro: 4.295 ± 3.321
5.011LeuGln: 5.011 ± 1.668
5.011LeuArg: 5.011 ± 2.026
5.727LeuSer: 5.727 ± 2.046
6.442LeuThr: 6.442 ± 2.43
5.011LeuVal: 5.011 ± 2.026
1.432LeuTrp: 1.432 ± 1.971
1.432LeuTyr: 1.432 ± 0.794
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.432MetCys: 1.432 ± 1.971
1.432MetAsp: 1.432 ± 2.798
2.147MetGlu: 2.147 ± 5.77
0.716MetPhe: 0.716 ± 0.397
0.716MetGly: 0.716 ± 0.397
1.432MetHis: 1.432 ± 0.794
1.432MetIle: 1.432 ± 0.794
0.0MetLys: 0.0 ± 0.0
0.716MetLeu: 0.716 ± 2.982
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.432MetPro: 1.432 ± 0.794
0.0MetGln: 0.0 ± 0.0
2.863MetArg: 2.863 ± 2.579
1.432MetSer: 1.432 ± 0.697
0.716MetThr: 0.716 ± 0.397
0.716MetVal: 0.716 ± 0.986
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.579AsnAla: 3.579 ± 0.959
0.716AsnCys: 0.716 ± 0.397
4.295AsnAsp: 4.295 ± 2.092
3.579AsnGlu: 3.579 ± 2.556
1.432AsnPhe: 1.432 ± 0.794
0.716AsnGly: 0.716 ± 0.986
0.0AsnHis: 0.0 ± 0.0
2.147AsnIle: 2.147 ± 1.19
0.716AsnLys: 0.716 ± 0.986
2.147AsnLeu: 2.147 ± 0.562
0.716AsnMet: 0.716 ± 0.863
3.579AsnAsn: 3.579 ± 0.959
0.0AsnPro: 0.0 ± 0.0
0.716AsnGln: 0.716 ± 0.397
1.432AsnArg: 1.432 ± 0.794
2.863AsnSer: 2.863 ± 0.678
1.432AsnThr: 1.432 ± 1.971
3.579AsnVal: 3.579 ± 0.959
0.0AsnTrp: 0.0 ± 0.0
1.432AsnTyr: 1.432 ± 0.794
0.0AsnXaa: 0.0 ± 0.0
Pro
6.442ProAla: 6.442 ± 4.404
0.0ProCys: 0.0 ± 0.0
1.432ProAsp: 1.432 ± 0.697
1.432ProGlu: 1.432 ± 1.971
2.863ProPhe: 2.863 ± 1.587
3.579ProGly: 3.579 ± 0.959
0.716ProHis: 0.716 ± 0.397
2.147ProIle: 2.147 ± 1.66
1.432ProLys: 1.432 ± 0.794
7.158ProLeu: 7.158 ± 1.81
1.432ProMet: 1.432 ± 3.08
0.716ProAsn: 0.716 ± 0.397
1.432ProPro: 1.432 ± 0.697
2.863ProGln: 2.863 ± 0.678
7.158ProArg: 7.158 ± 1.721
5.727ProSer: 5.727 ± 2.046
3.579ProThr: 3.579 ± 1.203
2.863ProVal: 2.863 ± 1.587
1.432ProTrp: 1.432 ± 0.794
2.147ProTyr: 2.147 ± 0.562
0.0ProXaa: 0.0 ± 0.0
Gln
7.874GlnAla: 7.874 ± 1.474
0.0GlnCys: 0.0 ± 0.0
2.863GlnAsp: 2.863 ± 1.587
1.432GlnGlu: 1.432 ± 0.794
1.432GlnPhe: 1.432 ± 0.794
0.716GlnGly: 0.716 ± 0.397
1.432GlnHis: 1.432 ± 2.798
2.147GlnIle: 2.147 ± 0.562
1.432GlnLys: 1.432 ± 0.794
2.147GlnLeu: 2.147 ± 2.789
0.716GlnMet: 0.716 ± 0.397
0.716GlnAsn: 0.716 ± 0.397
3.579GlnPro: 3.579 ± 2.348
2.863GlnGln: 2.863 ± 2.528
7.874GlnArg: 7.874 ± 1.474
2.863GlnSer: 2.863 ± 1.587
2.147GlnThr: 2.147 ± 1.19
2.863GlnVal: 2.863 ± 5.733
0.0GlnTrp: 0.0 ± 0.0
1.432GlnTyr: 1.432 ± 0.697
0.0GlnXaa: 0.0 ± 0.0
Arg
7.158ArgAla: 7.158 ± 2.81
0.0ArgCys: 0.0 ± 0.0
4.295ArgAsp: 4.295 ± 2.381
2.863ArgGlu: 2.863 ± 1.587
1.432ArgPhe: 1.432 ± 0.794
2.863ArgGly: 2.863 ± 0.678
2.863ArgHis: 2.863 ± 1.587
3.579ArgIle: 3.579 ± 1.203
2.863ArgLys: 2.863 ± 1.394
6.442ArgLeu: 6.442 ± 3.205
0.716ArgMet: 0.716 ± 0.397
4.295ArgAsn: 4.295 ± 2.381
2.147ArgPro: 2.147 ± 0.562
5.727ArgGln: 5.727 ± 2.046
10.021ArgArg: 10.021 ± 1.245
10.737ArgSer: 10.737 ± 3.713
5.727ArgThr: 5.727 ± 1.851
5.727ArgVal: 5.727 ± 1.994
1.432ArgTrp: 1.432 ± 0.697
2.863ArgTyr: 2.863 ± 2.528
0.0ArgXaa: 0.0 ± 0.0
Ser
4.295SerAla: 4.295 ± 5.323
0.716SerCys: 0.716 ± 0.397
4.295SerAsp: 4.295 ± 1.302
3.579SerGlu: 3.579 ± 0.959
0.0SerPhe: 0.0 ± 0.0
5.727SerGly: 5.727 ± 1.357
0.716SerHis: 0.716 ± 0.397
3.579SerIle: 3.579 ± 1.203
3.579SerLys: 3.579 ± 1.984
5.011SerLeu: 5.011 ± 1.668
0.0SerMet: 0.0 ± 0.0
2.147SerAsn: 2.147 ± 0.562
6.442SerPro: 6.442 ± 2.43
2.147SerGln: 2.147 ± 0.562
6.442SerArg: 6.442 ± 1.613
4.295SerSer: 4.295 ± 2.381
5.727SerThr: 5.727 ± 1.743
5.727SerVal: 5.727 ± 1.743
0.716SerTrp: 0.716 ± 0.397
1.432SerTyr: 1.432 ± 0.794
0.0SerXaa: 0.0 ± 0.0
Thr
6.442ThrAla: 6.442 ± 2.43
1.432ThrCys: 1.432 ± 0.794
9.306ThrAsp: 9.306 ± 2.271
2.863ThrGlu: 2.863 ± 2.579
4.295ThrPhe: 4.295 ± 2.433
6.442ThrGly: 6.442 ± 3.735
0.716ThrHis: 0.716 ± 0.397
2.863ThrIle: 2.863 ± 1.394
1.432ThrLys: 1.432 ± 0.697
4.295ThrLeu: 4.295 ± 2.092
2.147ThrMet: 2.147 ± 1.19
0.716ThrAsn: 0.716 ± 0.986
3.579ThrPro: 3.579 ± 0.959
5.727ThrGln: 5.727 ± 2.789
2.863ThrArg: 2.863 ± 2.64
4.295ThrSer: 4.295 ± 1.124
5.727ThrThr: 5.727 ± 1.743
5.011ThrVal: 5.011 ± 2.125
0.716ThrTrp: 0.716 ± 0.397
2.863ThrTyr: 2.863 ± 1.587
0.0ThrXaa: 0.0 ± 0.0
Val
5.727ValAla: 5.727 ± 5.056
0.0ValCys: 0.0 ± 0.0
5.011ValAsp: 5.011 ± 1.884
5.011ValGlu: 5.011 ± 2.026
1.432ValPhe: 1.432 ± 0.697
4.295ValGly: 4.295 ± 2.092
2.863ValHis: 2.863 ± 0.678
2.863ValIle: 2.863 ± 2.579
2.863ValLys: 2.863 ± 1.587
1.432ValLeu: 1.432 ± 0.697
1.432ValMet: 1.432 ± 0.627
2.147ValAsn: 2.147 ± 2.789
7.158ValPro: 7.158 ± 2.405
5.011ValGln: 5.011 ± 5.226
8.59ValArg: 8.59 ± 1.177
4.295ValSer: 4.295 ± 1.302
5.727ValThr: 5.727 ± 1.994
7.158ValVal: 7.158 ± 2.81
0.0ValTrp: 0.0 ± 0.0
2.147ValTyr: 2.147 ± 0.562
0.0ValXaa: 0.0 ± 0.0
Trp
1.432TrpAla: 1.432 ± 0.794
0.716TrpCys: 0.716 ± 0.397
0.716TrpAsp: 0.716 ± 0.986
0.716TrpGlu: 0.716 ± 0.986
0.0TrpPhe: 0.0 ± 0.0
0.716TrpGly: 0.716 ± 0.986
0.716TrpHis: 0.716 ± 0.986
0.716TrpIle: 0.716 ± 0.397
0.0TrpLys: 0.0 ± 0.0
0.716TrpLeu: 0.716 ± 0.397
0.0TrpMet: 0.0 ± 0.0
1.432TrpAsn: 1.432 ± 0.794
0.716TrpPro: 0.716 ± 0.397
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.432TrpSer: 1.432 ± 0.697
1.432TrpThr: 1.432 ± 0.697
1.432TrpVal: 1.432 ± 0.794
0.716TrpTrp: 0.716 ± 0.986
0.716TrpTyr: 0.716 ± 0.397
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.579TyrAla: 3.579 ± 0.959
0.0TyrCys: 0.0 ± 0.0
1.432TyrAsp: 1.432 ± 0.794
1.432TyrGlu: 1.432 ± 0.794
1.432TyrPhe: 1.432 ± 0.697
1.432TyrGly: 1.432 ± 0.794
1.432TyrHis: 1.432 ± 0.794
1.432TyrIle: 1.432 ± 0.794
0.716TyrLys: 0.716 ± 0.397
3.579TyrLeu: 3.579 ± 1.984
1.432TyrMet: 1.432 ± 0.794
2.863TyrAsn: 2.863 ± 0.678
2.147TyrPro: 2.147 ± 2.661
2.147TyrGln: 2.147 ± 1.19
2.863TyrArg: 2.863 ± 1.587
2.147TyrSer: 2.147 ± 0.562
3.579TyrThr: 3.579 ± 1.203
2.147TyrVal: 2.147 ± 0.562
0.716TyrTrp: 0.716 ± 0.986
1.432TyrTyr: 1.432 ± 0.697
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski