Amino acid dipepetide frequency for Aedes anphevirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.258AlaAla: 7.258 ± 3.936
2.016AlaCys: 2.016 ± 0.923
2.419AlaAsp: 2.419 ± 1.108
4.435AlaGlu: 4.435 ± 0.043
3.629AlaPhe: 3.629 ± 1.661
0.403AlaGly: 0.403 ± 0.185
0.403AlaHis: 0.403 ± 0.185
3.226AlaIle: 3.226 ± 0.597
2.419AlaLys: 2.419 ± 0.966
7.258AlaLeu: 7.258 ± 4.972
2.016AlaMet: 2.016 ± 0.923
2.823AlaAsn: 2.823 ± 0.255
2.016AlaPro: 2.016 ± 0.114
1.613AlaGln: 1.613 ± 1.335
3.226AlaArg: 3.226 ± 0.597
4.032AlaSer: 4.032 ± 1.265
4.032AlaThr: 4.032 ± 0.809
5.242AlaVal: 5.242 ± 1.748
0.403AlaTrp: 0.403 ± 0.185
1.613AlaTyr: 1.613 ± 0.299
0.0AlaXaa: 0.0 ± 0.0
Cys
3.226CysAla: 3.226 ± 1.634
0.806CysCys: 0.806 ± 0.369
0.806CysAsp: 0.806 ± 0.369
2.016CysGlu: 2.016 ± 0.923
0.403CysPhe: 0.403 ± 0.185
2.016CysGly: 2.016 ± 0.923
0.806CysHis: 0.806 ± 0.668
1.21CysIle: 1.21 ± 0.554
0.806CysLys: 0.806 ± 0.369
0.806CysLeu: 0.806 ± 0.369
0.403CysMet: 0.403 ± 0.185
0.403CysAsn: 0.403 ± 0.185
0.806CysPro: 0.806 ± 0.668
0.0CysGln: 0.0 ± 0.0
2.419CysArg: 2.419 ± 1.108
0.806CysSer: 0.806 ± 0.369
1.21CysThr: 1.21 ± 0.554
0.806CysVal: 0.806 ± 0.369
0.0CysTrp: 0.0 ± 0.0
1.613CysTyr: 1.613 ± 0.738
0.0CysXaa: 0.0 ± 0.0
Asp
2.823AspAla: 2.823 ± 1.292
1.21AspCys: 1.21 ± 0.554
1.613AspAsp: 1.613 ± 0.738
4.032AspGlu: 4.032 ± 2.302
2.823AspPhe: 2.823 ± 0.255
2.823AspGly: 2.823 ± 1.819
1.21AspHis: 1.21 ± 0.483
2.823AspIle: 2.823 ± 0.782
3.226AspLys: 3.226 ± 1.477
5.645AspLeu: 5.645 ± 1.547
2.016AspMet: 2.016 ± 0.114
2.016AspAsn: 2.016 ± 0.114
3.629AspPro: 3.629 ± 0.624
1.613AspGln: 1.613 ± 0.299
2.419AspArg: 2.419 ± 0.071
2.823AspSer: 2.823 ± 1.292
2.016AspThr: 2.016 ± 0.923
2.823AspVal: 2.823 ± 1.292
1.21AspTrp: 1.21 ± 0.483
1.613AspTyr: 1.613 ± 0.738
0.0AspXaa: 0.0 ± 0.0
Glu
2.016GluAla: 2.016 ± 0.923
0.403GluCys: 0.403 ± 0.185
5.242GluAsp: 5.242 ± 1.363
3.629GluGlu: 3.629 ± 2.486
4.032GluPhe: 4.032 ± 3.339
4.839GluGly: 4.839 ± 0.896
1.613GluHis: 1.613 ± 0.738
4.032GluIle: 4.032 ± 0.228
4.435GluLys: 4.435 ± 0.994
7.661GluLeu: 7.661 ± 0.64
1.613GluMet: 1.613 ± 0.665
2.419GluAsn: 2.419 ± 0.071
2.016GluPro: 2.016 ± 0.923
0.806GluGln: 0.806 ± 0.369
5.242GluArg: 5.242 ± 0.711
3.629GluSer: 3.629 ± 1.661
4.032GluThr: 4.032 ± 1.265
2.419GluVal: 2.419 ± 0.071
1.613GluTrp: 1.613 ± 0.299
0.806GluTyr: 0.806 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
0.806PheAla: 0.806 ± 0.369
1.613PheCys: 1.613 ± 0.738
2.419PheAsp: 2.419 ± 0.966
2.016PheGlu: 2.016 ± 0.114
2.419PhePhe: 2.419 ± 0.071
2.823PheGly: 2.823 ± 1.292
1.21PheHis: 1.21 ± 0.554
2.419PheIle: 2.419 ± 0.071
3.629PheLys: 3.629 ± 0.624
6.048PheLeu: 6.048 ± 2.769
0.0PheMet: 0.0 ± 0.453
2.016PheAsn: 2.016 ± 0.923
2.419PhePro: 2.419 ± 0.071
2.016PheGln: 2.016 ± 0.114
2.016PheArg: 2.016 ± 1.151
4.435PheSer: 4.435 ± 0.994
1.613PheThr: 1.613 ± 1.335
2.419PheVal: 2.419 ± 0.966
0.403PheTrp: 0.403 ± 0.852
1.613PheTyr: 1.613 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
2.016GlyAla: 2.016 ± 0.114
1.21GlyCys: 1.21 ± 0.483
2.419GlyAsp: 2.419 ± 0.071
2.823GlyGlu: 2.823 ± 0.782
2.016GlyPhe: 2.016 ± 0.114
3.629GlyGly: 3.629 ± 0.624
2.016GlyHis: 2.016 ± 0.923
2.823GlyIle: 2.823 ± 0.782
5.645GlyLys: 5.645 ± 0.526
4.435GlyLeu: 4.435 ± 0.043
2.823GlyMet: 2.823 ± 0.255
1.613GlyAsn: 1.613 ± 0.738
1.613GlyPro: 1.613 ± 0.738
1.613GlyGln: 1.613 ± 0.299
3.629GlyArg: 3.629 ± 0.624
4.032GlySer: 4.032 ± 1.846
3.629GlyThr: 3.629 ± 1.449
6.048GlyVal: 6.048 ± 1.379
0.0GlyTrp: 0.0 ± 0.0
1.613GlyTyr: 1.613 ± 0.299
0.0GlyXaa: 0.0 ± 0.0
His
0.806HisAla: 0.806 ± 0.369
0.806HisCys: 0.806 ± 0.369
0.403HisAsp: 0.403 ± 0.185
0.806HisGlu: 0.806 ± 0.369
1.21HisPhe: 1.21 ± 0.554
0.806HisGly: 0.806 ± 0.369
0.0HisHis: 0.0 ± 0.0
0.806HisIle: 0.806 ± 0.369
0.806HisLys: 0.806 ± 0.668
2.016HisLeu: 2.016 ± 0.114
1.21HisMet: 1.21 ± 0.483
0.403HisAsn: 0.403 ± 0.185
0.806HisPro: 0.806 ± 0.369
2.419HisGln: 2.419 ± 0.966
2.823HisArg: 2.823 ± 1.292
1.21HisSer: 1.21 ± 0.554
2.823HisThr: 2.823 ± 0.782
0.806HisVal: 0.806 ± 0.668
0.0HisTrp: 0.0 ± 0.0
0.403HisTyr: 0.403 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
3.629IleAla: 3.629 ± 1.449
2.419IleCys: 2.419 ± 0.966
5.242IleAsp: 5.242 ± 1.748
3.226IleGlu: 3.226 ± 1.477
4.032IlePhe: 4.032 ± 2.302
3.629IleGly: 3.629 ± 0.624
0.403IleHis: 0.403 ± 0.852
1.613IleIle: 1.613 ± 0.738
5.242IleLys: 5.242 ± 1.748
7.258IleLeu: 7.258 ± 1.249
2.016IleMet: 2.016 ± 0.114
3.226IleAsn: 3.226 ± 1.634
4.032IlePro: 4.032 ± 0.228
2.016IleGln: 2.016 ± 0.114
6.048IleArg: 6.048 ± 0.342
5.645IleSer: 5.645 ± 1.563
2.823IleThr: 2.823 ± 0.782
2.823IleVal: 2.823 ± 0.782
0.0IleTrp: 0.0 ± 0.0
2.419IleTyr: 2.419 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
3.629LysAla: 3.629 ± 0.624
1.21LysCys: 1.21 ± 0.554
4.435LysAsp: 4.435 ± 2.03
4.435LysGlu: 4.435 ± 0.994
1.21LysPhe: 1.21 ± 0.554
4.032LysGly: 4.032 ± 2.302
1.21LysHis: 1.21 ± 0.554
5.242LysIle: 5.242 ± 1.748
4.435LysLys: 4.435 ± 0.994
8.871LysLeu: 8.871 ± 1.123
1.613LysMet: 1.613 ± 0.738
1.613LysAsn: 1.613 ± 0.738
3.629LysPro: 3.629 ± 0.412
1.21LysGln: 1.21 ± 0.554
2.016LysArg: 2.016 ± 0.114
4.435LysSer: 4.435 ± 1.08
3.226LysThr: 3.226 ± 0.597
2.823LysVal: 2.823 ± 0.782
1.613LysTrp: 1.613 ± 0.299
1.21LysTyr: 1.21 ± 0.554
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 0.456
2.419LeuCys: 2.419 ± 0.966
4.839LeuAsp: 4.839 ± 0.141
7.661LeuGlu: 7.661 ± 2.47
3.226LeuPhe: 3.226 ± 0.44
5.242LeuGly: 5.242 ± 1.748
1.21LeuHis: 1.21 ± 0.483
6.048LeuIle: 6.048 ± 1.379
7.258LeuLys: 7.258 ± 1.249
9.677LeuLeu: 9.677 ± 1.791
1.613LeuMet: 1.613 ± 0.299
4.435LeuAsn: 4.435 ± 2.117
6.855LeuPro: 6.855 ± 1.064
4.032LeuGln: 4.032 ± 2.302
4.839LeuArg: 4.839 ± 0.141
12.5LeuSer: 12.5 ± 0.538
5.242LeuThr: 5.242 ± 0.326
4.839LeuVal: 4.839 ± 1.932
0.806LeuTrp: 0.806 ± 0.369
2.823LeuTyr: 2.823 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
2.823MetAla: 2.823 ± 0.782
0.0MetCys: 0.0 ± 0.0
1.613MetAsp: 1.613 ± 0.299
1.21MetGlu: 1.21 ± 0.554
1.613MetPhe: 1.613 ± 0.738
3.226MetGly: 3.226 ± 1.477
1.21MetHis: 1.21 ± 0.483
1.613MetIle: 1.613 ± 0.299
1.21MetLys: 1.21 ± 0.554
2.419MetLeu: 2.419 ± 0.071
0.806MetMet: 0.806 ± 0.369
2.823MetAsn: 2.823 ± 0.782
0.0MetPro: 0.0 ± 0.0
0.806MetGln: 0.806 ± 0.369
3.629MetArg: 3.629 ± 1.661
2.016MetSer: 2.016 ± 0.923
3.226MetThr: 3.226 ± 0.597
1.21MetVal: 1.21 ± 1.52
0.403MetTrp: 0.403 ± 0.185
2.016MetTyr: 2.016 ± 0.923
0.0MetXaa: 0.0 ± 0.0
Asn
2.419AsnAla: 2.419 ± 0.966
0.403AsnCys: 0.403 ± 0.185
1.613AsnAsp: 1.613 ± 0.738
2.419AsnGlu: 2.419 ± 0.071
2.823AsnPhe: 2.823 ± 0.782
0.806AsnGly: 0.806 ± 0.369
0.806AsnHis: 0.806 ± 0.369
4.435AsnIle: 4.435 ± 0.043
2.419AsnLys: 2.419 ± 0.071
2.823AsnLeu: 2.823 ± 0.255
2.016AsnMet: 2.016 ± 0.114
1.21AsnAsn: 1.21 ± 0.554
2.823AsnPro: 2.823 ± 0.782
1.613AsnGln: 1.613 ± 0.299
1.21AsnArg: 1.21 ± 0.554
2.419AsnSer: 2.419 ± 1.108
3.226AsnThr: 3.226 ± 0.44
1.613AsnVal: 1.613 ± 0.738
1.21AsnTrp: 1.21 ± 0.483
1.613AsnTyr: 1.613 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
2.016ProAla: 2.016 ± 0.923
0.806ProCys: 0.806 ± 0.369
4.839ProAsp: 4.839 ± 2.215
2.419ProGlu: 2.419 ± 3.04
2.016ProPhe: 2.016 ± 0.114
2.823ProGly: 2.823 ± 0.782
0.403ProHis: 0.403 ± 0.185
4.032ProIle: 4.032 ± 0.809
2.016ProLys: 2.016 ± 2.188
5.645ProLeu: 5.645 ± 0.526
1.21ProMet: 1.21 ± 0.554
2.016ProAsn: 2.016 ± 0.923
2.016ProPro: 2.016 ± 0.923
2.016ProGln: 2.016 ± 0.114
2.016ProArg: 2.016 ± 1.151
2.016ProSer: 2.016 ± 0.923
2.419ProThr: 2.419 ± 1.108
2.823ProVal: 2.823 ± 0.255
0.0ProTrp: 0.0 ± 0.0
2.823ProTyr: 2.823 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
1.613GlnAla: 1.613 ± 0.299
1.21GlnCys: 1.21 ± 0.554
0.806GlnAsp: 0.806 ± 0.668
2.823GlnGlu: 2.823 ± 0.782
0.806GlnPhe: 0.806 ± 0.369
2.823GlnGly: 2.823 ± 0.255
1.21GlnHis: 1.21 ± 0.483
2.419GlnIle: 2.419 ± 1.108
2.016GlnLys: 2.016 ± 0.114
2.419GlnLeu: 2.419 ± 2.003
0.806GlnMet: 0.806 ± 0.369
1.613GlnAsn: 1.613 ± 0.738
0.403GlnPro: 0.403 ± 0.185
0.403GlnGln: 0.403 ± 0.185
2.823GlnArg: 2.823 ± 0.255
2.823GlnSer: 2.823 ± 2.855
0.806GlnThr: 0.806 ± 0.668
2.823GlnVal: 2.823 ± 1.292
0.0GlnTrp: 0.0 ± 0.0
0.403GlnTyr: 0.403 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
4.032ArgAla: 4.032 ± 1.265
2.016ArgCys: 2.016 ± 0.923
2.016ArgAsp: 2.016 ± 1.151
5.242ArgGlu: 5.242 ± 1.363
2.016ArgPhe: 2.016 ± 0.114
2.016ArgGly: 2.016 ± 0.114
1.21ArgHis: 1.21 ± 0.483
4.839ArgIle: 4.839 ± 1.932
2.823ArgLys: 2.823 ± 0.782
5.645ArgLeu: 5.645 ± 0.526
2.823ArgMet: 2.823 ± 1.292
1.21ArgAsn: 1.21 ± 0.554
2.016ArgPro: 2.016 ± 0.923
2.419ArgGln: 2.419 ± 1.108
3.629ArgArg: 3.629 ± 0.624
6.855ArgSer: 6.855 ± 2.101
2.823ArgThr: 2.823 ± 0.255
3.226ArgVal: 3.226 ± 1.477
0.403ArgTrp: 0.403 ± 0.852
2.419ArgTyr: 2.419 ± 0.071
0.0ArgXaa: 0.0 ± 0.0
Ser
4.839SerAla: 4.839 ± 0.141
0.806SerCys: 0.806 ± 0.369
2.419SerAsp: 2.419 ± 0.966
3.226SerGlu: 3.226 ± 1.477
4.435SerPhe: 4.435 ± 2.03
4.032SerGly: 4.032 ± 0.228
2.419SerHis: 2.419 ± 1.108
8.065SerIle: 8.065 ± 1.493
3.226SerLys: 3.226 ± 1.477
10.081SerLeu: 10.081 ± 0.467
3.629SerMet: 3.629 ± 0.624
3.629SerAsn: 3.629 ± 0.624
3.226SerPro: 3.226 ± 1.634
2.016SerGln: 2.016 ± 0.114
3.226SerArg: 3.226 ± 0.44
6.452SerSer: 6.452 ± 1.917
4.839SerThr: 4.839 ± 2.215
2.016SerVal: 2.016 ± 0.923
1.613SerTrp: 1.613 ± 1.335
6.048SerTyr: 6.048 ± 1.379
0.0SerXaa: 0.0 ± 0.0
Thr
3.629ThrAla: 3.629 ± 0.412
0.806ThrCys: 0.806 ± 0.369
1.613ThrAsp: 1.613 ± 0.738
4.032ThrGlu: 4.032 ± 0.228
2.016ThrPhe: 2.016 ± 0.923
4.032ThrGly: 4.032 ± 0.809
1.21ThrHis: 1.21 ± 0.554
4.839ThrIle: 4.839 ± 1.932
3.629ThrLys: 3.629 ± 1.661
4.839ThrLeu: 4.839 ± 0.896
2.823ThrMet: 2.823 ± 0.782
4.032ThrAsn: 4.032 ± 0.228
1.613ThrPro: 1.613 ± 0.299
1.613ThrGln: 1.613 ± 0.738
2.016ThrArg: 2.016 ± 0.114
4.839ThrSer: 4.839 ± 1.932
1.613ThrThr: 1.613 ± 0.299
3.629ThrVal: 3.629 ± 1.661
2.016ThrTrp: 2.016 ± 0.923
1.613ThrTyr: 1.613 ± 0.738
0.0ThrXaa: 0.0 ± 0.0
Val
3.629ValAla: 3.629 ± 1.449
0.403ValCys: 0.403 ± 0.185
3.226ValAsp: 3.226 ± 0.44
3.226ValGlu: 3.226 ± 0.597
2.016ValPhe: 2.016 ± 0.114
2.419ValGly: 2.419 ± 0.071
1.613ValHis: 1.613 ± 0.299
5.645ValIle: 5.645 ± 1.563
4.032ValLys: 4.032 ± 1.265
4.435ValLeu: 4.435 ± 0.043
1.613ValMet: 1.613 ± 0.738
1.21ValAsn: 1.21 ± 0.483
1.21ValPro: 1.21 ± 1.52
1.613ValGln: 1.613 ± 0.738
3.226ValArg: 3.226 ± 1.477
5.645ValSer: 5.645 ± 0.51
4.435ValThr: 4.435 ± 2.03
5.242ValVal: 5.242 ± 1.363
0.806ValTrp: 0.806 ± 0.369
0.403ValTyr: 0.403 ± 0.852
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.806TrpCys: 0.806 ± 0.369
0.0TrpAsp: 0.0 ± 0.0
1.21TrpGlu: 1.21 ± 0.483
0.0TrpPhe: 0.0 ± 0.0
1.21TrpGly: 1.21 ± 0.554
0.0TrpHis: 0.0 ± 0.0
0.806TrpIle: 0.806 ± 0.668
1.21TrpLys: 1.21 ± 0.483
1.613TrpLeu: 1.613 ± 0.738
1.21TrpMet: 1.21 ± 0.483
0.403TrpAsn: 0.403 ± 0.185
0.403TrpPro: 0.403 ± 0.185
0.403TrpGln: 0.403 ± 0.852
1.613TrpArg: 1.613 ± 0.299
0.403TrpSer: 0.403 ± 0.852
0.403TrpThr: 0.403 ± 0.185
0.403TrpVal: 0.403 ± 0.852
0.0TrpTrp: 0.0 ± 0.0
0.403TrpTyr: 0.403 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.21TyrAla: 1.21 ± 1.52
0.0TyrCys: 0.0 ± 0.0
2.419TyrAsp: 2.419 ± 0.071
1.613TyrGlu: 1.613 ± 0.299
2.016TyrPhe: 2.016 ± 0.923
2.016TyrGly: 2.016 ± 0.114
1.21TyrHis: 1.21 ± 0.554
0.806TyrIle: 0.806 ± 0.668
1.613TyrLys: 1.613 ± 0.299
4.032TyrLeu: 4.032 ± 1.265
1.21TyrMet: 1.21 ± 0.483
0.806TyrAsn: 0.806 ± 0.369
4.839TyrPro: 4.839 ± 1.178
0.806TyrGln: 0.806 ± 0.369
1.613TyrArg: 1.613 ± 0.299
3.226TyrSer: 3.226 ± 1.477
2.016TyrThr: 2.016 ± 0.923
2.016TyrVal: 2.016 ± 0.923
0.0TyrTrp: 0.0 ± 0.0
0.806TyrTyr: 0.806 ± 0.369
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski