Amino acid dipepetide frequency for Torque teno midi virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.567AlaAla: 11.567 ± 7.124
0.0AlaCys: 0.0 ± 0.0
2.103AlaAsp: 2.103 ± 4.06
3.155AlaGlu: 3.155 ± 3.744
1.052AlaPhe: 1.052 ± 0.519
5.258AlaGly: 5.258 ± 2.086
2.103AlaHis: 2.103 ± 1.738
3.155AlaIle: 3.155 ± 1.568
3.155AlaLys: 3.155 ± 2.568
3.155AlaLeu: 3.155 ± 1.004
0.0AlaMet: 0.0 ± 0.0
1.052AlaAsn: 1.052 ± 0.519
4.206AlaPro: 4.206 ± 1.12
0.0AlaGln: 0.0 ± 0.0
4.206AlaArg: 4.206 ± 1.12
2.103AlaSer: 2.103 ± 1.037
3.155AlaThr: 3.155 ± 1.004
1.052AlaVal: 1.052 ± 0.519
0.0AlaTrp: 0.0 ± 0.0
3.155AlaTyr: 3.155 ± 1.004
0.0AlaXaa: 0.0 ± 0.0
Cys
1.052CysAla: 1.052 ± 1.46
1.052CysCys: 1.052 ± 0.519
0.0CysAsp: 0.0 ± 0.0
2.103CysGlu: 2.103 ± 1.037
1.052CysPhe: 1.052 ± 2.03
3.155CysGly: 3.155 ± 1.938
1.052CysHis: 1.052 ± 2.03
0.0CysIle: 0.0 ± 0.0
1.052CysLys: 1.052 ± 0.519
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.052CysAsn: 1.052 ± 0.519
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.103CysArg: 2.103 ± 1.738
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.052CysTyr: 1.052 ± 0.519
0.0CysXaa: 0.0 ± 0.0
Asp
1.052AspAla: 1.052 ± 2.03
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
1.052AspGlu: 1.052 ± 2.03
4.206AspPhe: 4.206 ± 1.56
1.052AspGly: 1.052 ± 2.03
0.0AspHis: 0.0 ± 0.0
2.103AspIle: 2.103 ± 4.06
1.052AspLys: 1.052 ± 2.03
5.258AspLeu: 5.258 ± 3.269
1.052AspMet: 1.052 ± 1.46
2.103AspAsn: 2.103 ± 1.037
2.103AspPro: 2.103 ± 1.037
2.103AspGln: 2.103 ± 2.92
3.155AspArg: 3.155 ± 1.568
5.258AspSer: 5.258 ± 1.717
4.206AspThr: 4.206 ± 1.12
0.0AspVal: 0.0 ± 0.0
1.052AspTrp: 1.052 ± 1.46
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.052GluAla: 1.052 ± 0.519
2.103GluCys: 2.103 ± 1.141
3.155GluAsp: 3.155 ± 1.004
7.361GluGlu: 7.361 ± 5.683
0.0GluPhe: 0.0 ± 0.0
3.155GluGly: 3.155 ± 1.004
2.103GluHis: 2.103 ± 1.037
3.155GluIle: 3.155 ± 1.938
2.103GluLys: 2.103 ± 1.141
4.206GluLeu: 4.206 ± 1.56
0.0GluMet: 0.0 ± 0.0
5.258GluAsn: 5.258 ± 3.379
0.0GluPro: 0.0 ± 0.0
2.103GluGln: 2.103 ± 2.44
4.206GluArg: 4.206 ± 5.766
5.258GluSer: 5.258 ± 3.269
6.309GluThr: 6.309 ± 3.111
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.155PheAla: 3.155 ± 1.568
1.052PheCys: 1.052 ± 2.03
0.0PheAsp: 0.0 ± 0.0
1.052PheGlu: 1.052 ± 0.519
3.155PhePhe: 3.155 ± 1.556
1.052PheGly: 1.052 ± 0.519
3.155PheHis: 3.155 ± 3.744
3.155PheIle: 3.155 ± 1.556
3.155PheLys: 3.155 ± 1.556
2.103PheLeu: 2.103 ± 1.037
1.052PheMet: 1.052 ± 0.519
0.0PheAsn: 0.0 ± 0.0
3.155PhePro: 3.155 ± 1.938
1.052PheGln: 1.052 ± 0.519
0.0PheArg: 0.0 ± 0.0
3.155PheSer: 3.155 ± 1.004
4.206PheThr: 4.206 ± 2.074
2.103PheVal: 2.103 ± 1.037
2.103PheTrp: 2.103 ± 1.037
5.258PheTyr: 5.258 ± 1.717
0.0PheXaa: 0.0 ± 0.0
Gly
4.206GlyAla: 4.206 ± 1.12
2.103GlyCys: 2.103 ± 1.141
1.052GlyAsp: 1.052 ± 2.03
3.155GlyGlu: 3.155 ± 1.004
4.206GlyPhe: 4.206 ± 2.074
8.412GlyGly: 8.412 ± 3.12
1.052GlyHis: 1.052 ± 2.03
1.052GlyIle: 1.052 ± 1.46
2.103GlyLys: 2.103 ± 1.037
1.052GlyLeu: 1.052 ± 0.519
1.052GlyMet: 1.052 ± 0.515
4.206GlyAsn: 4.206 ± 1.12
4.206GlyPro: 4.206 ± 1.447
0.0GlyGln: 0.0 ± 0.0
2.103GlyArg: 2.103 ± 1.141
2.103GlySer: 2.103 ± 2.92
3.155GlyThr: 3.155 ± 1.556
1.052GlyVal: 1.052 ± 2.03
0.0GlyTrp: 0.0 ± 0.0
2.103GlyTyr: 2.103 ± 1.037
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.052HisAsp: 1.052 ± 2.03
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.052HisGly: 1.052 ± 0.519
2.103HisHis: 2.103 ± 1.141
1.052HisIle: 1.052 ± 2.03
3.155HisLys: 3.155 ± 1.556
1.052HisLeu: 1.052 ± 1.46
1.052HisMet: 1.052 ± 0.519
2.103HisAsn: 2.103 ± 1.141
5.258HisPro: 5.258 ± 2.086
2.103HisGln: 2.103 ± 1.141
3.155HisArg: 3.155 ± 1.938
4.206HisSer: 4.206 ± 3.801
0.0HisThr: 0.0 ± 0.0
1.052HisVal: 1.052 ± 0.519
0.0HisTrp: 0.0 ± 0.0
1.052HisTyr: 1.052 ± 0.519
0.0HisXaa: 0.0 ± 0.0
Ile
1.052IleAla: 1.052 ± 2.03
0.0IleCys: 0.0 ± 0.0
1.052IleAsp: 1.052 ± 0.519
4.206IleGlu: 4.206 ± 3.476
4.206IlePhe: 4.206 ± 1.447
1.052IleGly: 1.052 ± 0.519
0.0IleHis: 0.0 ± 0.0
6.309IleIle: 6.309 ± 1.999
3.155IleLys: 3.155 ± 1.556
5.258IleLeu: 5.258 ± 4.244
0.0IleMet: 0.0 ± 0.0
4.206IleAsn: 4.206 ± 1.56
4.206IlePro: 4.206 ± 2.074
4.206IleGln: 4.206 ± 1.12
4.206IleArg: 4.206 ± 3.801
5.258IleSer: 5.258 ± 1.427
5.258IleThr: 5.258 ± 1.427
2.103IleVal: 2.103 ± 1.141
1.052IleTrp: 1.052 ± 0.519
3.155IleTyr: 3.155 ± 1.556
0.0IleXaa: 0.0 ± 0.0
Lys
4.206LysAla: 4.206 ± 2.281
1.052LysCys: 1.052 ± 2.03
2.103LysAsp: 2.103 ± 1.141
4.206LysGlu: 4.206 ± 2.958
2.103LysPhe: 2.103 ± 1.037
3.155LysGly: 3.155 ± 1.556
2.103LysHis: 2.103 ± 1.037
5.258LysIle: 5.258 ± 2.593
11.567LysLys: 11.567 ± 3.243
5.258LysLeu: 5.258 ± 2.086
2.103LysMet: 2.103 ± 1.141
3.155LysAsn: 3.155 ± 1.556
7.361LysPro: 7.361 ± 2.363
5.258LysGln: 5.258 ± 2.086
6.309LysArg: 6.309 ± 1.832
3.155LysSer: 3.155 ± 1.556
6.309LysThr: 6.309 ± 1.941
4.206LysVal: 4.206 ± 2.074
2.103LysTrp: 2.103 ± 1.037
5.258LysTyr: 5.258 ± 1.427
0.0LysXaa: 0.0 ± 0.0
Leu
5.258LeuAla: 5.258 ± 3.379
2.103LeuCys: 2.103 ± 1.037
2.103LeuAsp: 2.103 ± 1.738
1.052LeuGlu: 1.052 ± 1.46
1.052LeuPhe: 1.052 ± 1.46
4.206LeuGly: 4.206 ± 2.281
3.155LeuHis: 3.155 ± 1.556
6.309LeuIle: 6.309 ± 1.941
5.258LeuLys: 5.258 ± 2.448
8.412LeuLeu: 8.412 ± 0.908
0.0LeuMet: 0.0 ± 0.0
3.155LeuAsn: 3.155 ± 1.556
4.206LeuPro: 4.206 ± 1.12
7.361LeuGln: 7.361 ± 2.063
3.155LeuArg: 3.155 ± 1.004
4.206LeuSer: 4.206 ± 2.281
3.155LeuThr: 3.155 ± 1.556
1.052LeuVal: 1.052 ± 0.519
3.155LeuTrp: 3.155 ± 1.556
1.052LeuTyr: 1.052 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
1.052MetAla: 1.052 ± 1.46
1.052MetCys: 1.052 ± 0.519
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.052MetPhe: 1.052 ± 0.519
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.103MetLeu: 2.103 ± 1.037
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.052MetPro: 1.052 ± 0.519
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
4.206MetSer: 4.206 ± 3.801
1.052MetThr: 1.052 ± 1.46
0.0MetVal: 0.0 ± 0.0
1.052MetTrp: 1.052 ± 2.03
1.052MetTyr: 1.052 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
2.103AsnAla: 2.103 ± 1.037
1.052AsnCys: 1.052 ± 0.519
1.052AsnAsp: 1.052 ± 0.519
2.103AsnGlu: 2.103 ± 1.037
2.103AsnPhe: 2.103 ± 1.037
4.206AsnGly: 4.206 ± 1.56
1.052AsnHis: 1.052 ± 1.46
5.258AsnIle: 5.258 ± 1.717
3.155AsnLys: 3.155 ± 2.568
1.052AsnLeu: 1.052 ± 0.519
0.0AsnMet: 0.0 ± 1.118
3.155AsnAsn: 3.155 ± 1.556
4.206AsnPro: 4.206 ± 2.074
3.155AsnGln: 3.155 ± 1.938
1.052AsnArg: 1.052 ± 0.519
3.155AsnSer: 3.155 ± 1.004
2.103AsnThr: 2.103 ± 2.44
0.0AsnVal: 0.0 ± 0.0
1.052AsnTrp: 1.052 ± 0.519
4.206AsnTyr: 4.206 ± 1.12
0.0AsnXaa: 0.0 ± 0.0
Pro
2.103ProAla: 2.103 ± 1.141
0.0ProCys: 0.0 ± 0.0
4.206ProAsp: 4.206 ± 1.56
2.103ProGlu: 2.103 ± 1.141
3.155ProPhe: 3.155 ± 1.556
1.052ProGly: 1.052 ± 1.46
1.052ProHis: 1.052 ± 1.46
4.206ProIle: 4.206 ± 1.12
4.206ProLys: 4.206 ± 1.12
4.206ProLeu: 4.206 ± 1.12
0.0ProMet: 0.0 ± 0.0
4.206ProAsn: 4.206 ± 1.12
9.464ProPro: 9.464 ± 2.512
4.206ProGln: 4.206 ± 2.074
10.515ProArg: 10.515 ± 3.045
7.361ProSer: 7.361 ± 2.363
7.361ProThr: 7.361 ± 1.443
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
6.309ProTyr: 6.309 ± 3.111
0.0ProXaa: 0.0 ± 0.0
Gln
2.103GlnAla: 2.103 ± 1.037
0.0GlnCys: 0.0 ± 0.0
3.155GlnAsp: 3.155 ± 3.472
4.206GlnGlu: 4.206 ± 1.447
2.103GlnPhe: 2.103 ± 1.037
2.103GlnGly: 2.103 ± 1.037
2.103GlnHis: 2.103 ± 1.141
3.155GlnIle: 3.155 ± 2.568
4.206GlnLys: 4.206 ± 2.281
2.103GlnLeu: 2.103 ± 1.141
2.103GlnMet: 2.103 ± 1.738
1.052GlnAsn: 1.052 ± 0.519
2.103GlnPro: 2.103 ± 1.037
3.155GlnGln: 3.155 ± 2.568
2.103GlnArg: 2.103 ± 2.92
4.206GlnSer: 4.206 ± 2.074
1.052GlnThr: 1.052 ± 0.519
2.103GlnVal: 2.103 ± 1.037
1.052GlnTrp: 1.052 ± 0.519
4.206GlnTyr: 4.206 ± 1.12
0.0GlnXaa: 0.0 ± 0.0
Arg
3.155ArgAla: 3.155 ± 2.568
2.103ArgCys: 2.103 ± 4.06
4.206ArgAsp: 4.206 ± 5.766
2.103ArgGlu: 2.103 ± 4.06
1.052ArgPhe: 1.052 ± 0.519
1.052ArgGly: 1.052 ± 0.519
6.309ArgHis: 6.309 ± 3.422
2.103ArgIle: 2.103 ± 1.037
10.515ArgLys: 10.515 ± 3.744
2.103ArgLeu: 2.103 ± 1.037
2.103ArgMet: 2.103 ± 1.277
3.155ArgAsn: 3.155 ± 4.38
5.258ArgPro: 5.258 ± 2.086
2.103ArgGln: 2.103 ± 1.037
19.979ArgArg: 19.979 ± 3.592
1.052ArgSer: 1.052 ± 0.519
1.052ArgThr: 1.052 ± 0.519
1.052ArgVal: 1.052 ± 0.519
0.0ArgTrp: 0.0 ± 0.0
6.309ArgTyr: 6.309 ± 1.832
0.0ArgXaa: 0.0 ± 0.0
Ser
2.103SerAla: 2.103 ± 1.738
0.0SerCys: 0.0 ± 0.0
2.103SerAsp: 2.103 ± 1.037
3.155SerGlu: 3.155 ± 1.556
5.258SerPhe: 5.258 ± 3.269
2.103SerGly: 2.103 ± 2.44
0.0SerHis: 0.0 ± 0.0
4.206SerIle: 4.206 ± 1.12
11.567SerLys: 11.567 ± 3.243
6.309SerLeu: 6.309 ± 2.008
0.0SerMet: 0.0 ± 0.0
3.155SerAsn: 3.155 ± 1.004
4.206SerPro: 4.206 ± 1.12
5.258SerGln: 5.258 ± 0.985
2.103SerArg: 2.103 ± 1.037
10.515SerSer: 10.515 ± 3.744
8.412SerThr: 8.412 ± 0.967
4.206SerVal: 4.206 ± 3.476
1.052SerTrp: 1.052 ± 0.519
1.052SerTyr: 1.052 ± 1.46
0.0SerXaa: 0.0 ± 0.0
Thr
4.206ThrAla: 4.206 ± 2.074
1.052ThrCys: 1.052 ± 0.519
5.258ThrAsp: 5.258 ± 0.985
7.361ThrGlu: 7.361 ± 0.579
4.206ThrPhe: 4.206 ± 1.56
4.206ThrGly: 4.206 ± 1.12
1.052ThrHis: 1.052 ± 1.46
5.258ThrIle: 5.258 ± 0.985
5.258ThrLys: 5.258 ± 1.717
3.155ThrLeu: 3.155 ± 1.004
0.0ThrMet: 0.0 ± 0.0
1.052ThrAsn: 1.052 ± 0.519
9.464ThrPro: 9.464 ± 3.013
2.103ThrGln: 2.103 ± 1.037
1.052ThrArg: 1.052 ± 1.46
5.258ThrSer: 5.258 ± 2.086
3.155ThrThr: 3.155 ± 2.568
3.155ThrVal: 3.155 ± 1.556
2.103ThrTrp: 2.103 ± 1.037
2.103ThrTyr: 2.103 ± 1.037
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
1.052ValGlu: 1.052 ± 0.519
0.0ValPhe: 0.0 ± 0.0
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
1.052ValIle: 1.052 ± 0.519
2.103ValLys: 2.103 ± 1.037
3.155ValLeu: 3.155 ± 1.938
0.0ValMet: 0.0 ± 0.0
1.052ValAsn: 1.052 ± 0.519
1.052ValPro: 1.052 ± 0.519
2.103ValGln: 2.103 ± 1.037
2.103ValArg: 2.103 ± 1.037
3.155ValSer: 3.155 ± 1.556
3.155ValThr: 3.155 ± 1.568
2.103ValVal: 2.103 ± 1.738
1.052ValTrp: 1.052 ± 0.519
3.155ValTyr: 3.155 ± 1.568
0.0ValXaa: 0.0 ± 0.0
Trp
1.052TrpAla: 1.052 ± 0.519
0.0TrpCys: 0.0 ± 0.0
1.052TrpAsp: 1.052 ± 0.519
1.052TrpGlu: 1.052 ± 0.519
1.052TrpPhe: 1.052 ± 0.519
2.103TrpGly: 2.103 ± 1.037
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.052TrpLys: 1.052 ± 0.519
1.052TrpLeu: 1.052 ± 0.519
1.052TrpMet: 1.052 ± 2.03
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
2.103TrpGln: 2.103 ± 1.141
1.052TrpArg: 1.052 ± 0.519
0.0TrpSer: 0.0 ± 0.0
1.052TrpThr: 1.052 ± 0.519
1.052TrpVal: 1.052 ± 0.519
2.103TrpTrp: 2.103 ± 1.037
2.103TrpTyr: 2.103 ± 1.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.155TyrAla: 3.155 ± 1.556
0.0TyrCys: 0.0 ± 0.0
3.155TyrAsp: 3.155 ± 1.556
1.052TyrGlu: 1.052 ± 0.519
2.103TyrPhe: 2.103 ± 1.037
0.0TyrGly: 0.0 ± 0.0
1.052TyrHis: 1.052 ± 0.519
3.155TyrIle: 3.155 ± 1.556
8.412TyrLys: 8.412 ± 0.908
8.412TyrLeu: 8.412 ± 2.239
1.052TyrMet: 1.052 ± 0.519
3.155TyrAsn: 3.155 ± 1.568
4.206TyrPro: 4.206 ± 1.12
0.0TyrGln: 0.0 ± 0.0
4.206TyrArg: 4.206 ± 2.074
3.155TyrSer: 3.155 ± 1.004
6.309TyrThr: 6.309 ± 1.832
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.103TyrTyr: 2.103 ± 1.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski