Amino acid dipepetide frequency for Australian Anopheles totivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.696AlaAla: 3.696 ± 0.729
0.0AlaCys: 0.0 ± 0.0
4.752AlaAsp: 4.752 ± 0.841
2.64AlaGlu: 2.64 ± 0.102
1.584AlaPhe: 1.584 ± 0.94
5.808AlaGly: 5.808 ± 0.518
1.056AlaHis: 1.056 ± 0.626
4.224AlaIle: 4.224 ± 1.042
4.752AlaLys: 4.752 ± 0.841
7.92AlaLeu: 7.92 ± 0.425
2.112AlaMet: 2.112 ± 0.521
2.64AlaAsn: 2.64 ± 0.102
3.168AlaPro: 3.168 ± 1.147
1.056AlaGln: 1.056 ± 0.105
2.64AlaArg: 2.64 ± 0.102
6.336AlaSer: 6.336 ± 0.099
4.224AlaThr: 4.224 ± 1.774
4.224AlaVal: 4.224 ± 0.31
1.584AlaTrp: 1.584 ± 0.94
0.528AlaTyr: 0.528 ± 0.313
0.0AlaXaa: 0.0 ± 0.0
Cys
1.056CysAla: 1.056 ± 0.626
0.528CysCys: 0.528 ± 0.419
0.0CysAsp: 0.0 ± 0.0
0.528CysGlu: 0.528 ± 0.419
0.0CysPhe: 0.0 ± 0.0
0.528CysGly: 0.528 ± 0.313
0.0CysHis: 0.0 ± 0.0
1.056CysIle: 1.056 ± 0.837
0.528CysLys: 0.528 ± 0.313
1.056CysLeu: 1.056 ± 0.626
0.528CysMet: 0.528 ± 0.313
0.528CysAsn: 0.528 ± 0.313
1.056CysPro: 1.056 ± 0.837
0.528CysGln: 0.528 ± 0.419
1.056CysArg: 1.056 ± 0.837
1.056CysSer: 1.056 ± 0.837
0.528CysThr: 0.528 ± 0.313
3.168CysVal: 3.168 ± 0.316
0.0CysTrp: 0.0 ± 0.0
0.528CysTyr: 0.528 ± 0.419
0.0CysXaa: 0.0 ± 0.0
Asp
2.112AspAla: 2.112 ± 0.211
1.056AspCys: 1.056 ± 0.105
4.224AspAsp: 4.224 ± 0.422
4.752AspGlu: 4.752 ± 0.623
1.584AspPhe: 1.584 ± 0.94
5.808AspGly: 5.808 ± 1.249
3.168AspHis: 3.168 ± 0.415
3.168AspIle: 3.168 ± 0.415
1.056AspLys: 1.056 ± 0.105
4.224AspLeu: 4.224 ± 1.154
1.584AspMet: 1.584 ± 0.202
1.056AspAsn: 1.056 ± 0.105
4.752AspPro: 4.752 ± 2.087
1.056AspGln: 1.056 ± 0.626
3.696AspArg: 3.696 ± 1.467
6.336AspSer: 6.336 ± 2.097
1.584AspThr: 1.584 ± 0.208
6.864AspVal: 6.864 ± 1.052
2.64AspTrp: 2.64 ± 0.834
1.056AspTyr: 1.056 ± 0.105
0.0AspXaa: 0.0 ± 0.0
Glu
7.392GluAla: 7.392 ± 2.921
0.528GluCys: 0.528 ± 0.419
2.112GluAsp: 2.112 ± 0.943
3.696GluGlu: 3.696 ± 0.729
2.64GluPhe: 2.64 ± 1.361
1.584GluGly: 1.584 ± 0.208
0.0GluHis: 0.0 ± 0.0
2.112GluIle: 2.112 ± 0.211
2.64GluLys: 2.64 ± 1.566
3.168GluLeu: 3.168 ± 0.316
1.584GluMet: 1.584 ± 0.524
1.056GluAsn: 1.056 ± 0.837
2.112GluPro: 2.112 ± 1.253
0.528GluGln: 0.528 ± 0.313
1.584GluArg: 1.584 ± 0.524
4.752GluSer: 4.752 ± 2.087
1.056GluThr: 1.056 ± 0.105
6.336GluVal: 6.336 ± 1.563
1.584GluTrp: 1.584 ± 0.524
1.584GluTyr: 1.584 ± 0.524
0.0GluXaa: 0.0 ± 0.0
Phe
3.696PheAla: 3.696 ± 0.003
0.0PheCys: 0.0 ± 0.0
2.64PheAsp: 2.64 ± 0.63
1.584PheGlu: 1.584 ± 0.208
1.056PhePhe: 1.056 ± 0.105
5.808PheGly: 5.808 ± 0.946
0.0PheHis: 0.0 ± 0.0
2.64PheIle: 2.64 ± 0.63
2.64PheLys: 2.64 ± 1.361
3.168PheLeu: 3.168 ± 1.78
1.584PheMet: 1.584 ± 1.256
0.0PheAsn: 0.0 ± 0.0
1.584PhePro: 1.584 ± 0.208
0.0PheGln: 0.0 ± 0.0
1.584PheArg: 1.584 ± 0.524
3.168PheSer: 3.168 ± 1.048
1.584PheThr: 1.584 ± 0.94
4.752PheVal: 4.752 ± 0.841
0.528PheTrp: 0.528 ± 0.419
1.584PheTyr: 1.584 ± 0.524
0.0PheXaa: 0.0 ± 0.0
Gly
6.336GlyAla: 6.336 ± 2.294
0.528GlyCys: 0.528 ± 0.419
3.696GlyAsp: 3.696 ± 0.729
4.752GlyGlu: 4.752 ± 0.109
4.224GlyPhe: 4.224 ± 0.31
16.367GlyGly: 16.367 ± 3.122
3.696GlyHis: 3.696 ± 1.46
2.64GlyIle: 2.64 ± 0.63
3.168GlyLys: 3.168 ± 0.316
6.336GlyLeu: 6.336 ± 0.633
2.112GlyMet: 2.112 ± 0.521
5.28GlyAsn: 5.28 ± 0.527
3.696GlyPro: 3.696 ± 0.729
1.056GlyGln: 1.056 ± 0.626
5.28GlyArg: 5.28 ± 1.991
6.864GlySer: 6.864 ± 2.608
3.168GlyThr: 3.168 ± 1.879
4.752GlyVal: 4.752 ± 1.355
2.112GlyTrp: 2.112 ± 0.943
2.64GlyTyr: 2.64 ± 0.102
0.0GlyXaa: 0.0 ± 0.0
His
0.528HisAla: 0.528 ± 0.419
1.584HisCys: 1.584 ± 0.208
2.112HisAsp: 2.112 ± 1.253
0.528HisGlu: 0.528 ± 0.313
0.528HisPhe: 0.528 ± 0.419
1.584HisGly: 1.584 ± 0.208
0.0HisHis: 0.0 ± 0.0
1.056HisIle: 1.056 ± 0.626
0.528HisLys: 0.528 ± 0.313
1.584HisLeu: 1.584 ± 0.524
0.0HisMet: 0.0 ± 0.0
1.056HisAsn: 1.056 ± 0.105
1.056HisPro: 1.056 ± 0.626
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
4.752HisSer: 4.752 ± 0.841
1.056HisThr: 1.056 ± 0.105
1.584HisVal: 1.584 ± 0.208
1.056HisTrp: 1.056 ± 0.626
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.584IleAla: 1.584 ± 1.256
1.584IleCys: 1.584 ± 0.208
3.696IleAsp: 3.696 ± 0.003
1.056IleGlu: 1.056 ± 0.626
1.056IlePhe: 1.056 ± 0.837
4.224IleGly: 4.224 ± 1.774
1.584IleHis: 1.584 ± 0.208
2.64IleIle: 2.64 ± 1.361
2.112IleLys: 2.112 ± 0.211
3.696IleLeu: 3.696 ± 1.46
1.056IleMet: 1.056 ± 0.105
2.112IleAsn: 2.112 ± 0.211
1.584IlePro: 1.584 ± 0.94
1.056IleGln: 1.056 ± 0.626
3.168IleArg: 3.168 ± 0.316
5.28IleSer: 5.28 ± 0.204
4.752IleThr: 4.752 ± 0.841
3.696IleVal: 3.696 ± 2.199
0.0IleTrp: 0.0 ± 0.0
0.528IleTyr: 0.528 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
3.168LysAla: 3.168 ± 1.147
1.056LysCys: 1.056 ± 0.837
2.112LysAsp: 2.112 ± 0.943
1.584LysGlu: 1.584 ± 0.208
3.696LysPhe: 3.696 ± 0.003
3.696LysGly: 3.696 ± 0.729
1.056LysHis: 1.056 ± 0.105
4.224LysIle: 4.224 ± 1.042
5.808LysLys: 5.808 ± 0.946
5.28LysLeu: 5.28 ± 0.204
1.056LysMet: 1.056 ± 0.626
3.168LysAsn: 3.168 ± 1.78
2.112LysPro: 2.112 ± 0.521
2.112LysGln: 2.112 ± 0.521
0.528LysArg: 0.528 ± 0.313
3.168LysSer: 3.168 ± 0.415
3.696LysThr: 3.696 ± 1.467
4.224LysVal: 4.224 ± 0.31
3.696LysTrp: 3.696 ± 1.467
3.168LysTyr: 3.168 ± 1.048
0.0LysXaa: 0.0 ± 0.0
Leu
5.808LeuAla: 5.808 ± 0.518
0.528LeuCys: 0.528 ± 0.419
4.752LeuAsp: 4.752 ± 0.109
2.64LeuGlu: 2.64 ± 0.102
3.168LeuPhe: 3.168 ± 1.78
7.392LeuGly: 7.392 ± 2.202
2.112LeuHis: 2.112 ± 0.943
4.224LeuIle: 4.224 ± 1.774
4.752LeuLys: 4.752 ± 2.304
9.504LeuLeu: 9.504 ± 0.218
2.112LeuMet: 2.112 ± 0.369
3.696LeuAsn: 3.696 ± 1.46
3.696LeuPro: 3.696 ± 1.46
1.056LeuGln: 1.056 ± 0.837
4.752LeuArg: 4.752 ± 0.841
10.032LeuSer: 10.032 ± 4.295
4.752LeuThr: 4.752 ± 0.109
5.28LeuVal: 5.28 ± 0.527
0.528LeuTrp: 0.528 ± 0.313
2.112LeuTyr: 2.112 ± 0.211
0.0LeuXaa: 0.0 ± 0.0
Met
2.112MetAla: 2.112 ± 0.211
0.528MetCys: 0.528 ± 0.419
1.584MetAsp: 1.584 ± 0.208
0.528MetGlu: 0.528 ± 0.313
1.056MetPhe: 1.056 ± 0.837
0.528MetGly: 0.528 ± 0.313
0.528MetHis: 0.528 ± 0.313
0.0MetIle: 0.0 ± 0.0
3.168MetLys: 3.168 ± 1.147
1.584MetLeu: 1.584 ± 0.524
2.112MetMet: 2.112 ± 0.211
2.64MetAsn: 2.64 ± 2.093
1.584MetPro: 1.584 ± 0.94
0.0MetGln: 0.0 ± 0.0
1.056MetArg: 1.056 ± 0.105
3.168MetSer: 3.168 ± 1.147
1.584MetThr: 1.584 ± 0.524
1.056MetVal: 1.056 ± 0.626
1.056MetTrp: 1.056 ± 0.105
0.528MetTyr: 0.528 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
1.584AsnAla: 1.584 ± 1.256
0.0AsnCys: 0.0 ± 0.0
4.752AsnAsp: 4.752 ± 0.109
2.112AsnGlu: 2.112 ± 0.521
2.112AsnPhe: 2.112 ± 0.211
5.808AsnGly: 5.808 ± 0.518
1.584AsnHis: 1.584 ± 0.524
2.112AsnIle: 2.112 ± 1.675
3.168AsnLys: 3.168 ± 0.415
2.112AsnLeu: 2.112 ± 0.211
0.0AsnMet: 0.0 ± 0.0
3.696AsnAsn: 3.696 ± 1.467
2.64AsnPro: 2.64 ± 0.63
0.0AsnGln: 0.0 ± 0.0
2.112AsnArg: 2.112 ± 0.521
4.224AsnSer: 4.224 ± 0.31
4.224AsnThr: 4.224 ± 0.31
2.64AsnVal: 2.64 ± 0.63
0.0AsnTrp: 0.0 ± 0.0
1.584AsnTyr: 1.584 ± 0.524
0.0AsnXaa: 0.0 ± 0.0
Pro
1.584ProAla: 1.584 ± 0.94
0.528ProCys: 0.528 ± 0.313
4.224ProAsp: 4.224 ± 1.774
0.528ProGlu: 0.528 ± 0.313
2.112ProPhe: 2.112 ± 0.521
3.696ProGly: 3.696 ± 2.192
1.584ProHis: 1.584 ± 0.208
1.056ProIle: 1.056 ± 0.105
3.168ProLys: 3.168 ± 1.147
4.752ProLeu: 4.752 ± 1.355
0.528ProMet: 0.528 ± 0.419
0.528ProAsn: 0.528 ± 0.313
1.056ProPro: 1.056 ± 0.837
1.056ProGln: 1.056 ± 0.105
1.584ProArg: 1.584 ± 1.256
4.752ProSer: 4.752 ± 0.623
2.112ProThr: 2.112 ± 1.253
4.224ProVal: 4.224 ± 1.774
1.056ProTrp: 1.056 ± 0.105
1.584ProTyr: 1.584 ± 0.524
0.0ProXaa: 0.0 ± 0.0
Gln
2.64GlnAla: 2.64 ± 0.834
0.528GlnCys: 0.528 ± 0.419
0.0GlnAsp: 0.0 ± 0.0
1.056GlnGlu: 1.056 ± 0.105
0.528GlnPhe: 0.528 ± 0.419
2.112GlnGly: 2.112 ± 0.211
0.0GlnHis: 0.0 ± 0.0
1.056GlnIle: 1.056 ± 0.626
1.056GlnLys: 1.056 ± 0.626
0.528GlnLeu: 0.528 ± 0.313
1.584GlnMet: 1.584 ± 0.208
1.056GlnAsn: 1.056 ± 0.105
0.0GlnPro: 0.0 ± 0.0
0.528GlnGln: 0.528 ± 0.419
2.112GlnArg: 2.112 ± 0.211
2.112GlnSer: 2.112 ± 0.521
0.0GlnThr: 0.0 ± 0.0
2.64GlnVal: 2.64 ± 0.63
0.0GlnTrp: 0.0 ± 0.0
2.112GlnTyr: 2.112 ± 1.253
0.0GlnXaa: 0.0 ± 0.0
Arg
2.112ArgAla: 2.112 ± 0.521
1.056ArgCys: 1.056 ± 0.837
4.224ArgAsp: 4.224 ± 1.154
0.528ArgGlu: 0.528 ± 0.313
4.752ArgPhe: 4.752 ± 2.304
3.696ArgGly: 3.696 ± 0.003
0.0ArgHis: 0.0 ± 0.0
1.056ArgIle: 1.056 ± 0.626
2.112ArgLys: 2.112 ± 0.943
4.224ArgLeu: 4.224 ± 0.422
1.584ArgMet: 1.584 ± 0.524
2.64ArgAsn: 2.64 ± 1.361
1.056ArgPro: 1.056 ± 0.105
2.112ArgGln: 2.112 ± 0.943
3.168ArgArg: 3.168 ± 1.879
5.808ArgSer: 5.808 ± 0.946
2.112ArgThr: 2.112 ± 0.521
4.752ArgVal: 4.752 ± 0.841
1.584ArgTrp: 1.584 ± 1.256
1.584ArgTyr: 1.584 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
5.28SerAla: 5.28 ± 0.204
2.112SerCys: 2.112 ± 0.521
3.696SerAsp: 3.696 ± 0.003
4.752SerGlu: 4.752 ± 1.572
4.224SerPhe: 4.224 ± 1.154
8.976SerGly: 8.976 ± 2.397
1.584SerHis: 1.584 ± 0.208
4.224SerIle: 4.224 ± 1.886
4.752SerLys: 4.752 ± 2.304
8.976SerLeu: 8.976 ± 2.726
2.112SerMet: 2.112 ± 0.211
4.224SerAsn: 4.224 ± 1.154
4.224SerPro: 4.224 ± 0.31
2.64SerGln: 2.64 ± 1.566
6.336SerArg: 6.336 ± 1.365
7.92SerSer: 7.92 ± 1.157
6.336SerThr: 6.336 ± 0.099
9.504SerVal: 9.504 ± 1.246
3.696SerTrp: 3.696 ± 1.46
3.168SerTyr: 3.168 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
4.752ThrAla: 4.752 ± 0.109
1.584ThrCys: 1.584 ± 0.94
4.752ThrAsp: 4.752 ± 2.819
6.336ThrGlu: 6.336 ± 0.831
1.056ThrPhe: 1.056 ± 0.837
1.584ThrGly: 1.584 ± 0.524
0.0ThrHis: 0.0 ± 0.0
3.168ThrIle: 3.168 ± 1.147
4.224ThrLys: 4.224 ± 0.31
3.696ThrLeu: 3.696 ± 1.467
1.584ThrMet: 1.584 ± 0.208
3.168ThrAsn: 3.168 ± 1.147
2.64ThrPro: 2.64 ± 0.102
1.056ThrGln: 1.056 ± 0.626
2.64ThrArg: 2.64 ± 1.361
7.392ThrSer: 7.392 ± 1.457
2.64ThrThr: 2.64 ± 0.102
1.584ThrVal: 1.584 ± 0.208
1.056ThrTrp: 1.056 ± 0.626
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.28ValAla: 5.28 ± 0.204
0.528ValCys: 0.528 ± 0.419
6.336ValAsp: 6.336 ± 0.099
5.808ValGlu: 5.808 ± 1.249
2.112ValPhe: 2.112 ± 0.943
7.92ValGly: 7.92 ± 1.038
1.584ValHis: 1.584 ± 0.208
2.64ValIle: 2.64 ± 0.63
6.336ValLys: 6.336 ± 0.831
5.808ValLeu: 5.808 ± 2.41
1.056ValMet: 1.056 ± 0.626
3.696ValAsn: 3.696 ± 0.729
2.64ValPro: 2.64 ± 1.566
5.28ValGln: 5.28 ± 0.204
2.64ValArg: 2.64 ± 0.63
6.864ValSer: 6.864 ± 3.979
5.808ValThr: 5.808 ± 1.249
5.28ValVal: 5.28 ± 1.668
0.528ValTrp: 0.528 ± 0.313
1.584ValTyr: 1.584 ± 0.208
0.0ValXaa: 0.0 ± 0.0
Trp
1.584TrpAla: 1.584 ± 0.524
0.0TrpCys: 0.0 ± 0.0
1.056TrpAsp: 1.056 ± 0.105
0.528TrpGlu: 0.528 ± 0.313
2.112TrpPhe: 2.112 ± 0.211
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.528TrpIle: 0.528 ± 0.419
1.056TrpLys: 1.056 ± 0.105
4.224TrpLeu: 4.224 ± 0.31
0.528TrpMet: 0.528 ± 0.313
0.528TrpAsn: 0.528 ± 0.313
0.528TrpPro: 0.528 ± 0.313
0.0TrpGln: 0.0 ± 0.0
3.168TrpArg: 3.168 ± 1.147
1.056TrpSer: 1.056 ± 0.837
3.168TrpThr: 3.168 ± 0.415
1.584TrpVal: 1.584 ± 0.524
2.112TrpTrp: 2.112 ± 0.521
1.584TrpTyr: 1.584 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.64TyrAla: 2.64 ± 0.102
0.0TyrCys: 0.0 ± 0.0
1.056TyrAsp: 1.056 ± 0.837
2.112TyrGlu: 2.112 ± 0.211
0.0TyrPhe: 0.0 ± 0.0
1.584TyrGly: 1.584 ± 0.524
1.056TyrHis: 1.056 ± 0.105
2.64TyrIle: 2.64 ± 0.102
1.584TyrLys: 1.584 ± 0.208
1.584TyrLeu: 1.584 ± 0.524
1.056TyrMet: 1.056 ± 0.105
3.696TyrAsn: 3.696 ± 0.729
0.528TyrPro: 0.528 ± 0.419
0.528TyrGln: 0.528 ± 0.419
1.056TyrArg: 1.056 ± 0.837
3.696TyrSer: 3.696 ± 0.729
0.528TyrThr: 0.528 ± 0.313
1.584TyrVal: 1.584 ± 0.524
0.528TyrTrp: 0.528 ± 0.313
1.584TyrTyr: 1.584 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1895 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski