Amino acid dipepetide frequency for Sogatella furcifera totivirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.152AlaAla: 4.152 ± 0.249
0.0AlaCys: 0.0 ± 0.0
1.73AlaAsp: 1.73 ± 0.579
3.114AlaGlu: 3.114 ± 0.316
2.422AlaPhe: 2.422 ± 0.33
2.422AlaGly: 2.422 ± 0.188
1.038AlaHis: 1.038 ± 0.067
1.384AlaIle: 1.384 ± 0.256
2.422AlaLys: 2.422 ± 0.188
4.152AlaLeu: 4.152 ± 0.27
3.806AlaMet: 3.806 ± 0.621
2.076AlaAsn: 2.076 ± 0.135
2.076AlaPro: 2.076 ± 0.135
1.73AlaGln: 1.73 ± 0.579
3.46AlaArg: 3.46 ± 0.121
3.46AlaSer: 3.46 ± 0.121
4.498AlaThr: 4.498 ± 0.054
4.498AlaVal: 4.498 ± 0.572
3.114AlaTrp: 3.114 ± 0.316
7.612AlaTyr: 7.612 ± 1.185
0.0AlaXaa: 0.0 ± 0.0
Cys
1.384CysAla: 1.384 ± 0.256
0.0CysCys: 0.0 ± 0.0
1.384CysAsp: 1.384 ± 0.256
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.038CysHis: 1.038 ± 0.067
0.0CysIle: 0.0 ± 0.0
0.346CysLys: 0.346 ± 0.195
1.038CysLeu: 1.038 ± 0.067
1.038CysMet: 1.038 ± 0.067
0.692CysAsn: 0.692 ± 0.128
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.692CysSer: 0.692 ± 0.391
0.346CysThr: 0.346 ± 0.195
0.346CysVal: 0.346 ± 0.195
0.0CysTrp: 0.0 ± 0.0
0.692CysTyr: 0.692 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
5.536AspAla: 5.536 ± 0.014
0.0AspCys: 0.0 ± 0.0
3.114AspAsp: 3.114 ± 0.202
3.114AspGlu: 3.114 ± 0.316
2.422AspPhe: 2.422 ± 0.188
3.806AspGly: 3.806 ± 0.074
0.692AspHis: 0.692 ± 0.391
6.228AspIle: 6.228 ± 0.404
1.73AspLys: 1.73 ± 0.061
3.114AspLeu: 3.114 ± 0.202
1.384AspMet: 1.384 ± 0.256
1.384AspAsn: 1.384 ± 0.263
6.228AspPro: 6.228 ± 1.67
3.114AspGln: 3.114 ± 0.316
4.844AspArg: 4.844 ± 0.66
4.844AspSer: 4.844 ± 0.377
2.422AspThr: 2.422 ± 0.188
3.806AspVal: 3.806 ± 0.444
1.73AspTrp: 1.73 ± 0.061
4.152AspTyr: 4.152 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
3.46GluAla: 3.46 ± 0.121
0.0GluCys: 0.0 ± 0.0
1.038GluAsp: 1.038 ± 0.067
2.422GluGlu: 2.422 ± 0.707
1.038GluPhe: 1.038 ± 0.586
1.384GluGly: 1.384 ± 0.256
0.346GluHis: 0.346 ± 0.195
2.422GluIle: 2.422 ± 0.33
3.114GluLys: 3.114 ± 0.316
3.46GluLeu: 3.46 ± 0.397
0.0GluMet: 0.0 ± 0.0
1.038GluAsn: 1.038 ± 0.586
0.692GluPro: 0.692 ± 0.128
2.422GluGln: 2.422 ± 0.33
1.038GluArg: 1.038 ± 0.586
1.384GluSer: 1.384 ± 0.256
3.114GluThr: 3.114 ± 0.316
4.152GluVal: 4.152 ± 0.27
1.384GluTrp: 1.384 ± 0.263
2.768GluTyr: 2.768 ± 0.512
0.0GluXaa: 0.0 ± 0.0
Phe
1.384PheAla: 1.384 ± 0.263
0.0PheCys: 0.0 ± 0.0
4.844PheAsp: 4.844 ± 0.142
0.692PheGlu: 0.692 ± 0.128
2.768PhePhe: 2.768 ± 0.007
2.768PheGly: 2.768 ± 0.007
0.0PheHis: 0.0 ± 0.0
1.038PheIle: 1.038 ± 0.067
2.076PheLys: 2.076 ± 0.135
4.152PheLeu: 4.152 ± 0.249
2.422PheMet: 2.422 ± 0.188
2.768PheAsn: 2.768 ± 0.512
2.076PhePro: 2.076 ± 0.135
2.076PheGln: 2.076 ± 0.135
1.384PheArg: 1.384 ± 0.263
4.498PheSer: 4.498 ± 0.465
0.346PheThr: 0.346 ± 0.195
1.73PheVal: 1.73 ± 0.061
0.692PheTrp: 0.692 ± 0.128
2.768PheTyr: 2.768 ± 0.525
0.0PheXaa: 0.0 ± 0.0
Gly
3.46GlyAla: 3.46 ± 0.121
0.0GlyCys: 0.0 ± 0.0
3.114GlyAsp: 3.114 ± 0.721
1.73GlyGlu: 1.73 ± 0.458
3.806GlyPhe: 3.806 ± 0.074
8.997GlyGly: 8.997 ± 0.93
0.0GlyHis: 0.0 ± 0.0
3.806GlyIle: 3.806 ± 0.593
1.038GlyLys: 1.038 ± 0.067
4.152GlyLeu: 4.152 ± 0.249
0.692GlyMet: 0.692 ± 0.128
4.152GlyAsn: 4.152 ± 0.249
2.422GlyPro: 2.422 ± 0.849
2.422GlyGln: 2.422 ± 0.33
3.114GlyArg: 3.114 ± 0.202
4.152GlySer: 4.152 ± 0.249
3.114GlyThr: 3.114 ± 0.202
4.498GlyVal: 4.498 ± 0.054
0.692GlyTrp: 0.692 ± 0.391
2.422GlyTyr: 2.422 ± 0.188
0.0GlyXaa: 0.0 ± 0.0
His
1.038HisAla: 1.038 ± 0.067
0.0HisCys: 0.0 ± 0.0
0.692HisAsp: 0.692 ± 0.128
0.0HisGlu: 0.0 ± 0.0
1.038HisPhe: 1.038 ± 0.067
0.692HisGly: 0.692 ± 0.391
0.0HisHis: 0.0 ± 0.0
1.038HisIle: 1.038 ± 0.067
0.692HisLys: 0.692 ± 0.391
2.768HisLeu: 2.768 ± 0.525
0.0HisMet: 0.0 ± 0.0
3.114HisAsn: 3.114 ± 0.202
0.692HisPro: 0.692 ± 0.128
0.346HisGln: 0.346 ± 0.195
2.422HisArg: 2.422 ± 0.33
2.422HisSer: 2.422 ± 0.188
1.73HisThr: 1.73 ± 0.061
1.384HisVal: 1.384 ± 0.263
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.114IleAla: 3.114 ± 0.202
0.0IleCys: 0.0 ± 0.0
4.152IleAsp: 4.152 ± 0.27
2.422IleGlu: 2.422 ± 0.188
2.076IlePhe: 2.076 ± 0.135
2.422IleGly: 2.422 ± 0.33
1.038IleHis: 1.038 ± 0.067
2.422IleIle: 2.422 ± 0.33
3.114IleLys: 3.114 ± 0.202
1.038IleLeu: 1.038 ± 0.586
1.384IleMet: 1.384 ± 0.256
3.806IleAsn: 3.806 ± 0.074
3.806IlePro: 3.806 ± 0.074
2.768IleGln: 2.768 ± 0.007
3.806IleArg: 3.806 ± 0.074
6.574IleSer: 6.574 ± 0.081
6.228IleThr: 6.228 ± 1.151
1.73IleVal: 1.73 ± 0.458
3.114IleTrp: 3.114 ± 0.316
0.692IleTyr: 0.692 ± 0.128
0.0IleXaa: 0.0 ± 0.0
Lys
2.076LysAla: 2.076 ± 0.384
0.346LysCys: 0.346 ± 0.195
5.882LysAsp: 5.882 ± 0.828
1.384LysGlu: 1.384 ± 0.263
2.076LysPhe: 2.076 ± 0.135
1.384LysGly: 1.384 ± 0.263
0.692LysHis: 0.692 ± 0.128
2.768LysIle: 2.768 ± 0.512
1.384LysLys: 1.384 ± 0.256
3.46LysLeu: 3.46 ± 0.121
0.346LysMet: 0.346 ± 0.195
2.768LysAsn: 2.768 ± 0.512
1.73LysPro: 1.73 ± 0.061
4.152LysGln: 4.152 ± 0.27
2.076LysArg: 2.076 ± 0.653
3.806LysSer: 3.806 ± 0.074
6.228LysThr: 6.228 ± 0.114
4.844LysVal: 4.844 ± 0.377
2.422LysTrp: 2.422 ± 0.849
2.422LysTyr: 2.422 ± 0.33
0.0LysXaa: 0.0 ± 0.0
Leu
4.152LeuAla: 4.152 ± 0.249
1.384LeuCys: 1.384 ± 0.263
1.384LeuAsp: 1.384 ± 0.781
2.768LeuGlu: 2.768 ± 0.525
2.768LeuPhe: 2.768 ± 0.007
5.19LeuGly: 5.19 ± 0.855
2.422LeuHis: 2.422 ± 0.33
1.73LeuIle: 1.73 ± 0.458
2.422LeuLys: 2.422 ± 0.33
4.152LeuLeu: 4.152 ± 1.307
0.692LeuMet: 0.692 ± 0.128
3.46LeuAsn: 3.46 ± 0.121
3.46LeuPro: 3.46 ± 0.121
3.114LeuGln: 3.114 ± 0.202
3.46LeuArg: 3.46 ± 0.397
8.304LeuSer: 8.304 ± 0.02
6.574LeuThr: 6.574 ± 0.081
5.536LeuVal: 5.536 ± 0.014
0.0LeuTrp: 0.0 ± 0.0
1.73LeuTyr: 1.73 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
2.076MetAla: 2.076 ± 0.384
1.038MetCys: 1.038 ± 0.067
2.422MetAsp: 2.422 ± 0.33
0.346MetGlu: 0.346 ± 0.195
0.692MetPhe: 0.692 ± 0.128
2.422MetGly: 2.422 ± 0.33
0.0MetHis: 0.0 ± 0.0
1.73MetIle: 1.73 ± 0.061
0.692MetLys: 0.692 ± 0.128
2.422MetLeu: 2.422 ± 0.188
0.692MetMet: 0.692 ± 0.128
0.692MetAsn: 0.692 ± 0.128
2.076MetPro: 2.076 ± 0.384
0.692MetGln: 0.692 ± 0.128
1.038MetArg: 1.038 ± 0.586
1.384MetSer: 1.384 ± 0.263
3.46MetThr: 3.46 ± 0.121
2.076MetVal: 2.076 ± 0.384
0.692MetTrp: 0.692 ± 0.128
0.692MetTyr: 0.692 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
2.768AsnAla: 2.768 ± 0.007
0.692AsnCys: 0.692 ± 0.128
4.844AsnAsp: 4.844 ± 0.896
1.038AsnGlu: 1.038 ± 0.067
1.73AsnPhe: 1.73 ± 0.061
3.46AsnGly: 3.46 ± 0.121
0.692AsnHis: 0.692 ± 0.128
4.498AsnIle: 4.498 ± 0.054
4.498AsnLys: 4.498 ± 0.465
4.152AsnLeu: 4.152 ± 0.768
1.384AsnMet: 1.384 ± 0.263
1.038AsnAsn: 1.038 ± 0.067
1.384AsnPro: 1.384 ± 0.263
3.114AsnGln: 3.114 ± 0.316
2.768AsnArg: 2.768 ± 0.512
4.844AsnSer: 4.844 ± 0.377
3.46AsnThr: 3.46 ± 0.121
3.806AsnVal: 3.806 ± 0.074
0.692AsnTrp: 0.692 ± 0.128
2.076AsnTyr: 2.076 ± 0.135
0.0AsnXaa: 0.0 ± 0.0
Pro
2.076ProAla: 2.076 ± 0.653
0.692ProCys: 0.692 ± 0.128
3.806ProAsp: 3.806 ± 0.444
1.73ProGlu: 1.73 ± 0.061
0.692ProPhe: 0.692 ± 0.128
3.806ProGly: 3.806 ± 0.444
2.422ProHis: 2.422 ± 0.188
2.076ProIle: 2.076 ± 0.384
3.806ProLys: 3.806 ± 0.444
1.038ProLeu: 1.038 ± 0.067
1.038ProMet: 1.038 ± 0.586
4.152ProAsn: 4.152 ± 0.249
1.038ProPro: 1.038 ± 0.451
4.498ProGln: 4.498 ± 1.091
1.73ProArg: 1.73 ± 0.977
7.266ProSer: 7.266 ± 0.047
3.806ProThr: 3.806 ± 0.444
7.612ProVal: 7.612 ± 1.407
1.038ProTrp: 1.038 ± 0.067
1.73ProTyr: 1.73 ± 0.061
0.0ProXaa: 0.0 ± 0.0
Gln
3.46GlnAla: 3.46 ± 0.64
0.0GlnCys: 0.0 ± 0.0
0.692GlnAsp: 0.692 ± 0.128
3.46GlnGlu: 3.46 ± 0.121
3.806GlnPhe: 3.806 ± 0.444
1.73GlnGly: 1.73 ± 0.061
1.384GlnHis: 1.384 ± 0.263
4.498GlnIle: 4.498 ± 0.054
2.768GlnLys: 2.768 ± 0.512
1.038GlnLeu: 1.038 ± 0.067
0.692GlnMet: 0.692 ± 0.128
2.422GlnAsn: 2.422 ± 0.188
2.768GlnPro: 2.768 ± 0.512
3.46GlnGln: 3.46 ± 0.64
1.038GlnArg: 1.038 ± 0.067
4.844GlnSer: 4.844 ± 0.66
2.422GlnThr: 2.422 ± 0.188
4.498GlnVal: 4.498 ± 0.572
1.384GlnTrp: 1.384 ± 0.256
1.73GlnTyr: 1.73 ± 0.458
0.0GlnXaa: 0.0 ± 0.0
Arg
3.46ArgAla: 3.46 ± 0.121
0.346ArgCys: 0.346 ± 0.195
2.076ArgAsp: 2.076 ± 0.653
2.076ArgGlu: 2.076 ± 0.653
1.384ArgPhe: 1.384 ± 0.263
2.422ArgGly: 2.422 ± 0.849
1.384ArgHis: 1.384 ± 0.263
4.498ArgIle: 4.498 ± 0.054
3.114ArgLys: 3.114 ± 0.202
2.422ArgLeu: 2.422 ± 0.33
1.73ArgMet: 1.73 ± 0.458
2.768ArgAsn: 2.768 ± 0.007
1.73ArgPro: 1.73 ± 0.458
0.346ArgGln: 0.346 ± 0.195
1.384ArgArg: 1.384 ± 0.263
2.768ArgSer: 2.768 ± 0.007
2.768ArgThr: 2.768 ± 0.525
5.19ArgVal: 5.19 ± 0.337
2.422ArgTrp: 2.422 ± 0.33
0.346ArgTyr: 0.346 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
4.498SerAla: 4.498 ± 0.572
0.692SerCys: 0.692 ± 0.128
7.612SerAsp: 7.612 ± 0.148
2.768SerGlu: 2.768 ± 0.525
3.806SerPhe: 3.806 ± 0.074
2.768SerGly: 2.768 ± 0.525
2.422SerHis: 2.422 ± 0.33
4.844SerIle: 4.844 ± 0.142
6.228SerLys: 6.228 ± 0.114
5.882SerLeu: 5.882 ± 0.727
1.73SerMet: 1.73 ± 0.061
6.228SerAsn: 6.228 ± 0.633
5.882SerPro: 5.882 ± 0.209
4.152SerGln: 4.152 ± 0.768
3.46SerArg: 3.46 ± 0.397
6.92SerSer: 6.92 ± 0.761
6.228SerThr: 6.228 ± 0.633
6.574SerVal: 6.574 ± 0.438
1.038SerTrp: 1.038 ± 0.067
4.844SerTyr: 4.844 ± 0.142
0.0SerXaa: 0.0 ± 0.0
Thr
4.844ThrAla: 4.844 ± 0.377
0.0ThrCys: 0.0 ± 0.0
4.152ThrAsp: 4.152 ± 0.768
5.19ThrGlu: 5.19 ± 0.182
1.73ThrPhe: 1.73 ± 0.061
5.19ThrGly: 5.19 ± 0.182
1.384ThrHis: 1.384 ± 0.263
4.498ThrIle: 4.498 ± 0.572
6.574ThrLys: 6.574 ± 0.438
3.114ThrLeu: 3.114 ± 1.239
2.768ThrMet: 2.768 ± 0.512
2.768ThrAsn: 2.768 ± 0.512
4.498ThrPro: 4.498 ± 0.572
3.806ThrGln: 3.806 ± 0.963
1.73ThrArg: 1.73 ± 0.458
7.958ThrSer: 7.958 ± 0.693
5.19ThrThr: 5.19 ± 0.7
4.498ThrVal: 4.498 ± 0.054
1.038ThrTrp: 1.038 ± 0.067
1.384ThrTyr: 1.384 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
2.076ValAla: 2.076 ± 0.135
2.076ValCys: 2.076 ± 0.384
9.343ValAsp: 9.343 ± 1.468
1.384ValGlu: 1.384 ± 0.263
2.076ValPhe: 2.076 ± 0.135
3.806ValGly: 3.806 ± 0.074
1.038ValHis: 1.038 ± 0.067
2.768ValIle: 2.768 ± 0.007
4.844ValLys: 4.844 ± 0.142
5.19ValLeu: 5.19 ± 0.337
2.076ValMet: 2.076 ± 0.135
3.46ValAsn: 3.46 ± 0.397
8.651ValPro: 8.651 ± 1.34
3.46ValGln: 3.46 ± 0.121
3.806ValArg: 3.806 ± 0.074
6.228ValSer: 6.228 ± 0.404
3.114ValThr: 3.114 ± 1.354
6.92ValVal: 6.92 ± 0.242
1.384ValTrp: 1.384 ± 0.256
3.806ValTyr: 3.806 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
2.076TrpAla: 2.076 ± 0.135
0.692TrpCys: 0.692 ± 0.128
1.038TrpAsp: 1.038 ± 0.067
0.346TrpGlu: 0.346 ± 0.195
0.346TrpPhe: 0.346 ± 0.195
1.038TrpGly: 1.038 ± 0.067
0.692TrpHis: 0.692 ± 0.128
2.076TrpIle: 2.076 ± 0.135
1.038TrpLys: 1.038 ± 0.067
3.114TrpLeu: 3.114 ± 0.202
1.73TrpMet: 1.73 ± 0.198
2.076TrpAsn: 2.076 ± 0.384
1.384TrpPro: 1.384 ± 0.256
0.692TrpGln: 0.692 ± 0.128
0.346TrpArg: 0.346 ± 0.195
1.038TrpSer: 1.038 ± 0.067
2.422TrpThr: 2.422 ± 0.188
2.076TrpVal: 2.076 ± 0.135
1.038TrpTrp: 1.038 ± 0.067
1.038TrpTyr: 1.038 ± 0.067
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.422TyrAla: 2.422 ± 0.188
1.038TyrCys: 1.038 ± 0.586
0.346TyrAsp: 0.346 ± 0.195
0.692TyrGlu: 0.692 ± 0.128
3.806TyrPhe: 3.806 ± 0.074
2.076TyrGly: 2.076 ± 0.135
1.384TyrHis: 1.384 ± 0.263
1.384TyrIle: 1.384 ± 0.263
0.346TyrLys: 0.346 ± 0.195
4.844TyrLeu: 4.844 ± 0.66
1.384TyrMet: 1.384 ± 0.263
2.076TyrAsn: 2.076 ± 0.384
3.46TyrPro: 3.46 ± 0.121
1.73TyrGln: 1.73 ± 0.458
1.73TyrArg: 1.73 ± 0.458
5.19TyrSer: 5.19 ± 0.182
4.844TyrThr: 4.844 ± 0.377
1.73TyrVal: 1.73 ± 0.061
2.422TyrTrp: 2.422 ± 0.188
3.806TyrTyr: 3.806 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski