Amino acid dipepetide frequency for Torque teno mini virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.927AlaAla: 3.927 ± 1.852
1.309AlaCys: 1.309 ± 0.617
5.236AlaAsp: 5.236 ± 2.271
5.236AlaGlu: 5.236 ± 2.469
0.0AlaPhe: 0.0 ± 0.0
0.0AlaGly: 0.0 ± 0.0
1.309AlaHis: 1.309 ± 0.617
0.0AlaIle: 0.0 ± 0.0
2.618AlaLys: 2.618 ± 1.235
2.618AlaLeu: 2.618 ± 1.235
0.0AlaMet: 0.0 ± 0.0
1.309AlaAsn: 1.309 ± 0.617
2.618AlaPro: 2.618 ± 1.235
1.309AlaGln: 1.309 ± 0.617
0.0AlaArg: 0.0 ± 0.0
0.0AlaSer: 0.0 ± 0.0
2.618AlaThr: 2.618 ± 3.505
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.309AlaTyr: 1.309 ± 0.617
0.0AlaXaa: 0.0 ± 0.0
Cys
1.309CysAla: 1.309 ± 0.617
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.618CysGlu: 2.618 ± 3.505
0.0CysPhe: 0.0 ± 0.0
2.618CysGly: 2.618 ± 3.505
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.309CysLys: 1.309 ± 0.617
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.618CysPro: 2.618 ± 3.505
0.0CysGln: 0.0 ± 0.0
1.309CysArg: 1.309 ± 0.617
0.0CysSer: 0.0 ± 0.0
2.618CysThr: 2.618 ± 3.505
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.309AspAla: 1.309 ± 4.123
0.0AspCys: 0.0 ± 0.0
2.618AspAsp: 2.618 ± 3.505
5.236AspGlu: 5.236 ± 2.271
2.618AspPhe: 2.618 ± 3.505
1.309AspGly: 1.309 ± 4.123
0.0AspHis: 0.0 ± 0.0
2.618AspIle: 2.618 ± 1.235
5.236AspLys: 5.236 ± 2.469
3.927AspLeu: 3.927 ± 12.368
0.0AspMet: 0.0 ± 0.0
2.618AspAsn: 2.618 ± 1.235
2.618AspPro: 2.618 ± 1.235
1.309AspGln: 1.309 ± 0.617
1.309AspArg: 1.309 ± 0.617
1.309AspSer: 1.309 ± 0.617
9.162AspThr: 9.162 ± 5.159
0.0AspVal: 0.0 ± 0.0
1.309AspTrp: 1.309 ± 0.617
7.853AspTyr: 7.853 ± 3.704
0.0AspXaa: 0.0 ± 0.0
Glu
3.927GluAla: 3.927 ± 1.852
1.309GluCys: 1.309 ± 4.123
2.618GluAsp: 2.618 ± 3.505
7.853GluGlu: 7.853 ± 15.256
1.309GluPhe: 1.309 ± 0.617
1.309GluGly: 1.309 ± 4.123
1.309GluHis: 1.309 ± 0.617
2.618GluIle: 2.618 ± 1.235
10.471GluLys: 10.471 ± 9.282
3.927GluLeu: 3.927 ± 1.852
2.618GluMet: 2.618 ± 1.008
0.0GluAsn: 0.0 ± 0.0
2.618GluPro: 2.618 ± 1.235
2.618GluGln: 2.618 ± 1.235
3.927GluArg: 3.927 ± 2.888
1.309GluSer: 1.309 ± 4.123
7.853GluThr: 7.853 ± 3.704
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
3.927GluTyr: 3.927 ± 1.852
0.0GluXaa: 0.0 ± 0.0
Phe
2.618PheAla: 2.618 ± 1.235
0.0PheCys: 0.0 ± 0.0
5.236PheAsp: 5.236 ± 2.271
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
2.618PheGly: 2.618 ± 8.245
2.618PheHis: 2.618 ± 1.235
5.236PheIle: 5.236 ± 2.469
2.618PheLys: 2.618 ± 1.235
2.618PheLeu: 2.618 ± 1.235
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
2.618PheGln: 2.618 ± 1.235
1.309PheArg: 1.309 ± 0.617
2.618PheSer: 2.618 ± 1.235
6.545PheThr: 6.545 ± 1.653
0.0PheVal: 0.0 ± 0.0
2.618PheTrp: 2.618 ± 1.235
3.927PheTyr: 3.927 ± 1.852
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
2.618GlyCys: 2.618 ± 3.505
5.236GlyAsp: 5.236 ± 11.751
1.309GlyGlu: 1.309 ± 0.617
5.236GlyPhe: 5.236 ± 2.271
5.236GlyGly: 5.236 ± 2.469
0.0GlyHis: 0.0 ± 0.0
1.309GlyIle: 1.309 ± 0.617
2.618GlyLys: 2.618 ± 1.235
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
2.618GlyAsn: 2.618 ± 1.235
1.309GlyPro: 1.309 ± 0.617
2.618GlyGln: 2.618 ± 1.235
1.309GlyArg: 1.309 ± 0.617
1.309GlySer: 1.309 ± 0.617
2.618GlyThr: 2.618 ± 1.235
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
1.309GlyTyr: 1.309 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.309HisCys: 1.309 ± 0.617
1.309HisAsp: 1.309 ± 4.123
0.0HisGlu: 0.0 ± 0.0
1.309HisPhe: 1.309 ± 0.617
1.309HisGly: 1.309 ± 0.617
0.0HisHis: 0.0 ± 0.0
2.618HisIle: 2.618 ± 1.235
2.618HisLys: 2.618 ± 1.235
1.309HisLeu: 1.309 ± 0.617
1.309HisMet: 1.309 ± 0.617
0.0HisAsn: 0.0 ± 0.0
1.309HisPro: 1.309 ± 0.617
0.0HisGln: 0.0 ± 0.0
1.309HisArg: 1.309 ± 0.617
1.309HisSer: 1.309 ± 0.617
1.309HisThr: 1.309 ± 0.617
2.618HisVal: 2.618 ± 3.505
1.309HisTrp: 1.309 ± 0.617
1.309HisTyr: 1.309 ± 0.617
0.0HisXaa: 0.0 ± 0.0
Ile
1.309IleAla: 1.309 ± 0.617
0.0IleCys: 0.0 ± 0.0
6.545IleAsp: 6.545 ± 3.087
1.309IleGlu: 1.309 ± 0.617
3.927IlePhe: 3.927 ± 1.852
2.618IleGly: 2.618 ± 1.235
2.618IleHis: 2.618 ± 3.505
5.236IleIle: 5.236 ± 2.271
9.162IleLys: 9.162 ± 4.321
1.309IleLeu: 1.309 ± 0.617
0.0IleMet: 0.0 ± 0.0
2.618IleAsn: 2.618 ± 1.235
5.236IlePro: 5.236 ± 2.469
2.618IleGln: 2.618 ± 1.235
2.618IleArg: 2.618 ± 1.235
5.236IleSer: 5.236 ± 2.469
6.545IleThr: 6.545 ± 1.653
2.618IleVal: 2.618 ± 1.235
0.0IleTrp: 0.0 ± 0.0
5.236IleTyr: 5.236 ± 2.271
0.0IleXaa: 0.0 ± 0.0
Lys
2.618LysAla: 2.618 ± 1.235
0.0LysCys: 0.0 ± 0.0
5.236LysAsp: 5.236 ± 7.011
3.927LysGlu: 3.927 ± 1.852
2.618LysPhe: 2.618 ± 3.505
2.618LysGly: 2.618 ± 1.235
1.309LysHis: 1.309 ± 0.617
7.853LysIle: 7.853 ± 3.704
6.545LysLys: 6.545 ± 6.393
7.853LysLeu: 7.853 ± 1.036
1.309LysMet: 1.309 ± 0.617
1.309LysAsn: 1.309 ± 0.617
3.927LysPro: 3.927 ± 1.852
9.162LysGln: 9.162 ± 0.419
10.471LysArg: 10.471 ± 4.939
7.853LysSer: 7.853 ± 3.704
5.236LysThr: 5.236 ± 2.469
1.309LysVal: 1.309 ± 0.617
3.927LysTrp: 3.927 ± 2.888
5.236LysTyr: 5.236 ± 2.271
0.0LysXaa: 0.0 ± 0.0
Leu
3.927LeuAla: 3.927 ± 1.852
0.0LeuCys: 0.0 ± 0.0
2.618LeuAsp: 2.618 ± 1.235
5.236LeuGlu: 5.236 ± 7.011
3.927LeuPhe: 3.927 ± 2.888
3.927LeuGly: 3.927 ± 1.852
3.927LeuHis: 3.927 ± 1.852
2.618LeuIle: 2.618 ± 1.235
9.162LeuLys: 9.162 ± 0.419
13.089LeuLeu: 13.089 ± 6.173
1.309LeuMet: 1.309 ± 0.617
3.927LeuAsn: 3.927 ± 1.852
3.927LeuPro: 3.927 ± 1.852
11.78LeuGln: 11.78 ± 8.664
2.618LeuArg: 2.618 ± 1.235
2.618LeuSer: 2.618 ± 1.235
5.236LeuThr: 5.236 ± 7.011
2.618LeuVal: 2.618 ± 3.505
1.309LeuTrp: 1.309 ± 0.617
5.236LeuTyr: 5.236 ± 2.469
0.0LeuXaa: 0.0 ± 0.0
Met
1.309MetAla: 1.309 ± 0.617
0.0MetCys: 0.0 ± 0.0
2.618MetAsp: 2.618 ± 1.235
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.309MetGly: 1.309 ± 0.617
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.618MetLeu: 2.618 ± 1.235
1.309MetMet: 1.309 ± 0.617
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
2.618MetGln: 2.618 ± 1.235
0.0MetArg: 0.0 ± 0.0
2.618MetSer: 2.618 ± 3.505
2.618MetThr: 2.618 ± 1.235
0.0MetVal: 0.0 ± 0.0
1.309MetTrp: 1.309 ± 0.617
1.309MetTyr: 1.309 ± 0.617
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.309AsnCys: 1.309 ± 0.617
2.618AsnAsp: 2.618 ± 1.235
3.927AsnGlu: 3.927 ± 1.852
2.618AsnPhe: 2.618 ± 1.235
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
5.236AsnIle: 5.236 ± 2.469
3.927AsnLys: 3.927 ± 2.888
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
5.236AsnAsn: 5.236 ± 2.271
5.236AsnPro: 5.236 ± 2.469
1.309AsnGln: 1.309 ± 4.123
0.0AsnArg: 0.0 ± 0.0
5.236AsnSer: 5.236 ± 2.469
2.618AsnThr: 2.618 ± 3.505
0.0AsnVal: 0.0 ± 0.0
2.618AsnTrp: 2.618 ± 1.235
2.618AsnTyr: 2.618 ± 1.235
0.0AsnXaa: 0.0 ± 0.0
Pro
1.309ProAla: 1.309 ± 0.617
0.0ProCys: 0.0 ± 0.0
2.618ProAsp: 2.618 ± 1.235
5.236ProGlu: 5.236 ± 2.271
2.618ProPhe: 2.618 ± 1.235
2.618ProGly: 2.618 ± 1.235
2.618ProHis: 2.618 ± 1.235
3.927ProIle: 3.927 ± 1.852
5.236ProLys: 5.236 ± 2.271
11.78ProLeu: 11.78 ± 0.816
2.618ProMet: 2.618 ± 1.235
1.309ProAsn: 1.309 ± 0.617
9.162ProPro: 9.162 ± 0.419
2.618ProGln: 2.618 ± 1.235
2.618ProArg: 2.618 ± 1.235
1.309ProSer: 1.309 ± 0.617
2.618ProThr: 2.618 ± 1.235
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
7.853ProTyr: 7.853 ± 3.704
0.0ProXaa: 0.0 ± 0.0
Gln
1.309GlnAla: 1.309 ± 0.617
1.309GlnCys: 1.309 ± 4.123
1.309GlnAsp: 1.309 ± 0.617
6.545GlnGlu: 6.545 ± 1.653
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
2.618GlnHis: 2.618 ± 3.505
2.618GlnIle: 2.618 ± 1.235
5.236GlnLys: 5.236 ± 2.271
7.853GlnLeu: 7.853 ± 5.776
1.309GlnMet: 1.309 ± 0.617
7.853GlnAsn: 7.853 ± 3.704
6.545GlnPro: 6.545 ± 3.087
2.618GlnGln: 2.618 ± 1.235
2.618GlnArg: 2.618 ± 1.235
2.618GlnSer: 2.618 ± 1.235
5.236GlnThr: 5.236 ± 2.469
1.309GlnVal: 1.309 ± 0.617
1.309GlnTrp: 1.309 ± 0.617
2.618GlnTyr: 2.618 ± 1.235
0.0GlnXaa: 0.0 ± 0.0
Arg
2.618ArgAla: 2.618 ± 1.235
2.618ArgCys: 2.618 ± 1.235
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
3.927ArgPhe: 3.927 ± 1.852
0.0ArgGly: 0.0 ± 0.0
1.309ArgHis: 1.309 ± 0.617
3.927ArgIle: 3.927 ± 1.852
5.236ArgLys: 5.236 ± 2.469
3.927ArgLeu: 3.927 ± 2.888
0.0ArgMet: 0.0 ± 2.087
0.0ArgAsn: 0.0 ± 0.0
3.927ArgPro: 3.927 ± 1.852
2.618ArgGln: 2.618 ± 3.505
14.398ArgArg: 14.398 ± 6.791
1.309ArgSer: 1.309 ± 0.617
0.0ArgThr: 0.0 ± 0.0
1.309ArgVal: 1.309 ± 0.617
0.0ArgTrp: 0.0 ± 0.0
7.853ArgTyr: 7.853 ± 3.704
0.0ArgXaa: 0.0 ± 0.0
Ser
1.309SerAla: 1.309 ± 0.617
1.309SerCys: 1.309 ± 0.617
0.0SerAsp: 0.0 ± 0.0
5.236SerGlu: 5.236 ± 2.469
0.0SerPhe: 0.0 ± 0.0
0.0SerGly: 0.0 ± 0.0
0.0SerHis: 0.0 ± 0.0
3.927SerIle: 3.927 ± 2.888
2.618SerLys: 2.618 ± 1.235
7.853SerLeu: 7.853 ± 3.704
1.309SerMet: 1.309 ± 0.617
3.927SerAsn: 3.927 ± 1.852
5.236SerPro: 5.236 ± 2.271
2.618SerGln: 2.618 ± 1.235
1.309SerArg: 1.309 ± 4.123
5.236SerSer: 5.236 ± 2.469
7.853SerThr: 7.853 ± 1.036
0.0SerVal: 0.0 ± 0.0
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.309ThrAla: 1.309 ± 4.123
1.309ThrCys: 1.309 ± 4.123
3.927ThrAsp: 3.927 ± 1.852
5.236ThrGlu: 5.236 ± 7.011
6.545ThrPhe: 6.545 ± 3.087
6.545ThrGly: 6.545 ± 3.087
1.309ThrHis: 1.309 ± 0.617
10.471ThrIle: 10.471 ± 4.541
2.618ThrLys: 2.618 ± 1.235
7.853ThrLeu: 7.853 ± 1.036
3.927ThrMet: 3.927 ± 1.852
3.927ThrAsn: 3.927 ± 1.852
3.927ThrPro: 3.927 ± 2.888
9.162ThrGln: 9.162 ± 4.321
1.309ThrArg: 1.309 ± 4.123
2.618ThrSer: 2.618 ± 3.505
9.162ThrThr: 9.162 ± 0.419
2.618ThrVal: 2.618 ± 1.235
2.618ThrTrp: 2.618 ± 1.235
1.309ThrTyr: 1.309 ± 0.617
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
1.309ValPhe: 1.309 ± 0.617
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
1.309ValIle: 1.309 ± 0.617
2.618ValLys: 2.618 ± 1.235
1.309ValLeu: 1.309 ± 0.617
0.0ValMet: 0.0 ± 0.0
2.618ValAsn: 2.618 ± 3.505
2.618ValPro: 2.618 ± 1.235
0.0ValGln: 0.0 ± 0.0
0.0ValArg: 0.0 ± 0.0
2.618ValSer: 2.618 ± 3.505
0.0ValThr: 0.0 ± 0.0
1.309ValVal: 1.309 ± 0.617
0.0ValTrp: 0.0 ± 0.0
1.309ValTyr: 1.309 ± 0.617
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.618TrpGly: 2.618 ± 1.235
1.309TrpHis: 1.309 ± 0.617
0.0TrpIle: 0.0 ± 0.0
1.309TrpLys: 1.309 ± 0.617
2.618TrpLeu: 2.618 ± 3.505
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.309TrpPro: 1.309 ± 0.617
1.309TrpGln: 1.309 ± 0.617
2.618TrpArg: 2.618 ± 1.235
1.309TrpSer: 1.309 ± 0.617
2.618TrpThr: 2.618 ± 1.235
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.309TrpTyr: 1.309 ± 0.617
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.309TyrAla: 1.309 ± 0.617
0.0TyrCys: 0.0 ± 0.0
2.618TyrAsp: 2.618 ± 1.235
2.618TyrGlu: 2.618 ± 1.235
5.236TyrPhe: 5.236 ± 2.469
0.0TyrGly: 0.0 ± 0.0
1.309TyrHis: 1.309 ± 0.617
5.236TyrIle: 5.236 ± 2.469
9.162TyrLys: 9.162 ± 4.321
6.545TyrLeu: 6.545 ± 3.087
1.309TyrMet: 1.309 ± 0.617
5.236TyrAsn: 5.236 ± 7.011
3.927TyrPro: 3.927 ± 1.852
3.927TyrGln: 3.927 ± 1.852
5.236TyrArg: 5.236 ± 2.469
1.309TyrSer: 1.309 ± 0.617
5.236TyrThr: 5.236 ± 2.469
1.309TyrVal: 1.309 ± 0.617
0.0TyrTrp: 0.0 ± 0.0
6.545TyrTyr: 6.545 ± 3.087
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski