Amino acid dipepetide frequency for Torque teno virus (isolate Human/China/CT39F/2001) (TTV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.475AlaAla: 13.475 ± 6.688
0.0AlaCys: 0.0 ± 0.0
5.674AlaAsp: 5.674 ± 2.024
4.965AlaGlu: 4.965 ± 0.921
0.709AlaPhe: 0.709 ± 0.395
6.383AlaGly: 6.383 ± 2.408
0.709AlaHis: 0.709 ± 0.395
0.0AlaIle: 0.0 ± 0.0
3.546AlaLys: 3.546 ± 1.333
7.801AlaLeu: 7.801 ± 3.394
2.128AlaMet: 2.128 ± 0.808
0.709AlaAsn: 0.709 ± 0.816
8.511AlaPro: 8.511 ± 4.465
2.128AlaGln: 2.128 ± 0.808
4.965AlaArg: 4.965 ± 3.035
1.418AlaSer: 1.418 ± 0.664
2.837AlaThr: 2.837 ± 0.961
4.965AlaVal: 4.965 ± 2.325
0.709AlaTrp: 0.709 ± 0.395
0.709AlaTyr: 0.709 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.709CysAsp: 0.709 ± 0.395
0.709CysGlu: 0.709 ± 0.395
0.709CysPhe: 0.709 ± 0.395
4.965CysGly: 4.965 ± 2.384
0.709CysHis: 0.709 ± 0.395
0.709CysIle: 0.709 ± 0.395
1.418CysLys: 1.418 ± 1.643
2.128CysLeu: 2.128 ± 0.726
0.709CysMet: 0.709 ± 0.395
0.0CysAsn: 0.0 ± 0.0
0.709CysPro: 0.709 ± 0.395
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.709CysSer: 0.709 ± 0.395
0.709CysThr: 0.709 ± 0.395
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.709AspAla: 0.709 ± 0.821
0.0AspCys: 0.0 ± 0.0
4.965AspAsp: 4.965 ± 2.384
2.837AspGlu: 2.837 ± 1.846
3.546AspPhe: 3.546 ± 0.681
4.965AspGly: 4.965 ± 2.384
0.0AspHis: 0.0 ± 0.0
1.418AspIle: 1.418 ± 0.789
2.837AspLys: 2.837 ± 1.579
5.674AspLeu: 5.674 ± 0.806
1.418AspMet: 1.418 ± 0.789
2.128AspAsn: 2.128 ± 0.726
9.22AspPro: 9.22 ± 2.496
0.0AspGln: 0.0 ± 0.0
0.709AspArg: 0.709 ± 0.816
0.709AspSer: 0.709 ± 0.395
1.418AspThr: 1.418 ± 0.789
1.418AspVal: 1.418 ± 0.664
1.418AspTrp: 1.418 ± 0.713
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.837GluAla: 2.837 ± 1.579
0.0GluCys: 0.0 ± 0.0
1.418GluAsp: 1.418 ± 0.789
7.801GluGlu: 7.801 ± 1.794
1.418GluPhe: 1.418 ± 0.789
1.418GluGly: 1.418 ± 0.664
2.837GluHis: 2.837 ± 1.012
0.709GluIle: 0.709 ± 0.395
0.709GluLys: 0.709 ± 0.395
1.418GluLeu: 1.418 ± 0.664
0.709GluMet: 0.709 ± 0.818
3.546GluAsn: 3.546 ± 1.278
0.0GluPro: 0.0 ± 0.0
4.255GluGln: 4.255 ± 1.719
6.383GluArg: 6.383 ± 3.467
10.638GluSer: 10.638 ± 3.675
3.546GluThr: 3.546 ± 1.472
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
2.128GluTyr: 2.128 ± 0.651
0.0GluXaa: 0.0 ± 0.0
Phe
2.128PheAla: 2.128 ± 1.377
0.709PheCys: 0.709 ± 0.395
0.709PheAsp: 0.709 ± 0.395
1.418PheGlu: 1.418 ± 0.789
0.709PhePhe: 0.709 ± 0.395
0.709PheGly: 0.709 ± 0.395
0.0PheHis: 0.0 ± 0.0
2.128PheIle: 2.128 ± 1.184
0.709PheLys: 0.709 ± 0.395
2.837PheLeu: 2.837 ± 1.579
0.0PheMet: 0.0 ± 0.0
2.128PheAsn: 2.128 ± 0.726
3.546PhePro: 3.546 ± 1.278
2.837PheGln: 2.837 ± 1.053
2.837PheArg: 2.837 ± 1.579
2.837PheSer: 2.837 ± 1.579
2.128PheThr: 2.128 ± 1.184
0.709PheVal: 0.709 ± 0.395
0.0PheTrp: 0.0 ± 0.0
2.128PheTyr: 2.128 ± 0.651
0.0PheXaa: 0.0 ± 0.0
Gly
6.383GlyAla: 6.383 ± 1.679
2.837GlyCys: 2.837 ± 1.012
5.674GlyAsp: 5.674 ± 2.776
2.837GlyGlu: 2.837 ± 1.012
0.0GlyPhe: 0.0 ± 0.0
18.44GlyGly: 18.44 ± 6.013
1.418GlyHis: 1.418 ± 0.789
2.837GlyIle: 2.837 ± 1.012
2.837GlyLys: 2.837 ± 1.579
7.801GlyLeu: 7.801 ± 1.705
0.0GlyMet: 0.0 ± 0.0
9.22GlyAsn: 9.22 ± 2.687
6.383GlyPro: 6.383 ± 1.437
0.709GlyGln: 0.709 ± 0.395
3.546GlyArg: 3.546 ± 1.278
6.383GlySer: 6.383 ± 1.618
0.709GlyThr: 0.709 ± 0.395
3.546GlyVal: 3.546 ± 0.681
1.418GlyTrp: 1.418 ± 0.789
1.418GlyTyr: 1.418 ± 0.789
0.0GlyXaa: 0.0 ± 0.0
His
2.837HisAla: 2.837 ± 1.012
0.709HisCys: 0.709 ± 0.395
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.128HisGly: 2.128 ± 1.377
0.709HisHis: 0.709 ± 0.395
0.709HisIle: 0.709 ± 0.395
0.709HisLys: 0.709 ± 0.395
6.383HisLeu: 6.383 ± 1.679
0.709HisMet: 0.709 ± 0.395
4.965HisAsn: 4.965 ± 2.384
3.546HisPro: 3.546 ± 0.681
0.709HisGln: 0.709 ± 0.395
2.128HisArg: 2.128 ± 1.377
0.709HisSer: 0.709 ± 0.395
0.0HisThr: 0.0 ± 0.0
0.709HisVal: 0.709 ± 0.395
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.418IleCys: 1.418 ± 0.789
1.418IleAsp: 1.418 ± 0.789
0.709IleGlu: 0.709 ± 0.395
0.709IlePhe: 0.709 ± 0.395
0.709IleGly: 0.709 ± 0.395
0.0IleHis: 0.0 ± 0.0
0.709IleIle: 0.709 ± 0.395
1.418IleLys: 1.418 ± 0.789
3.546IleLeu: 3.546 ± 1.973
1.418IleMet: 1.418 ± 0.713
1.418IleAsn: 1.418 ± 0.789
4.965IlePro: 4.965 ± 0.526
1.418IleGln: 1.418 ± 0.789
4.965IleArg: 4.965 ± 0.921
2.128IleSer: 2.128 ± 0.726
2.837IleThr: 2.837 ± 1.579
3.546IleVal: 3.546 ± 1.973
2.837IleTrp: 2.837 ± 1.012
2.128IleTyr: 2.128 ± 1.184
0.0IleXaa: 0.0 ± 0.0
Lys
1.418LysAla: 1.418 ± 0.664
0.709LysCys: 0.709 ± 0.395
2.128LysAsp: 2.128 ± 1.184
2.128LysGlu: 2.128 ± 1.434
1.418LysPhe: 1.418 ± 0.789
3.546LysGly: 3.546 ± 1.973
0.709LysHis: 0.709 ± 0.395
4.965LysIle: 4.965 ± 2.085
3.546LysLys: 3.546 ± 1.333
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.684
2.837LysAsn: 2.837 ± 1.053
4.965LysPro: 4.965 ± 1.007
0.709LysGln: 0.709 ± 0.816
9.22LysArg: 9.22 ± 1.264
3.546LysSer: 3.546 ± 2.08
2.128LysThr: 2.128 ± 1.434
2.128LysVal: 2.128 ± 1.184
1.418LysTrp: 1.418 ± 0.789
2.837LysTyr: 2.837 ± 0.961
0.0LysXaa: 0.0 ± 0.0
Leu
5.674LeuAla: 5.674 ± 1.677
0.709LeuCys: 0.709 ± 0.395
4.965LeuAsp: 4.965 ± 2.384
3.546LeuGlu: 3.546 ± 0.681
2.837LeuPhe: 2.837 ± 1.579
2.837LeuGly: 2.837 ± 1.053
3.546LeuHis: 3.546 ± 0.681
1.418LeuIle: 1.418 ± 0.789
4.965LeuLys: 4.965 ± 1.977
4.255LeuLeu: 4.255 ± 0.864
1.418LeuMet: 1.418 ± 0.713
4.255LeuAsn: 4.255 ± 1.719
4.255LeuPro: 4.255 ± 2.179
4.255LeuGln: 4.255 ± 1.719
3.546LeuArg: 3.546 ± 1.37
2.128LeuSer: 2.128 ± 0.726
7.092LeuThr: 7.092 ± 0.224
3.546LeuVal: 3.546 ± 0.681
2.837LeuTrp: 2.837 ± 1.012
3.546LeuTyr: 3.546 ± 1.973
0.0LeuXaa: 0.0 ± 0.0
Met
1.418MetAla: 1.418 ± 0.789
0.709MetCys: 0.709 ± 0.821
0.0MetAsp: 0.0 ± 0.0
1.418MetGlu: 1.418 ± 0.713
0.709MetPhe: 0.709 ± 0.395
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.709MetLys: 0.709 ± 0.395
1.418MetLeu: 1.418 ± 0.789
0.0MetMet: 0.0 ± 0.0
0.709MetAsn: 0.709 ± 0.821
2.128MetPro: 2.128 ± 1.184
0.709MetGln: 0.709 ± 0.395
0.0MetArg: 0.0 ± 0.0
2.837MetSer: 2.837 ± 1.012
0.709MetThr: 0.709 ± 0.395
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.709AsnAla: 0.709 ± 0.816
0.0AsnCys: 0.0 ± 0.0
0.709AsnAsp: 0.709 ± 0.395
2.837AsnGlu: 2.837 ± 1.012
1.418AsnPhe: 1.418 ± 0.789
1.418AsnGly: 1.418 ± 0.789
2.128AsnHis: 2.128 ± 1.377
4.965AsnIle: 4.965 ± 2.763
3.546AsnLys: 3.546 ± 1.333
0.709AsnLeu: 0.709 ± 0.395
1.418AsnMet: 1.418 ± 0.789
3.546AsnAsn: 3.546 ± 0.512
6.383AsnPro: 6.383 ± 0.74
2.837AsnGln: 2.837 ± 1.012
1.418AsnArg: 1.418 ± 0.986
0.709AsnSer: 0.709 ± 0.816
2.128AsnThr: 2.128 ± 1.184
4.255AsnVal: 4.255 ± 1.153
0.709AsnTrp: 0.709 ± 0.395
4.255AsnTyr: 4.255 ± 1.616
0.0AsnXaa: 0.0 ± 0.0
Pro
12.057ProAla: 12.057 ± 5.035
1.418ProCys: 1.418 ± 0.789
4.965ProAsp: 4.965 ± 0.526
2.128ProGlu: 2.128 ± 1.487
2.837ProPhe: 2.837 ± 1.053
9.22ProGly: 9.22 ± 3.475
0.709ProHis: 0.709 ± 0.395
2.128ProIle: 2.128 ± 1.184
4.965ProLys: 4.965 ± 1.007
5.674ProLeu: 5.674 ± 1.988
0.0ProMet: 0.0 ± 0.0
1.418ProAsn: 1.418 ± 0.789
14.184ProPro: 14.184 ± 4.722
3.546ProGln: 3.546 ± 0.681
10.638ProArg: 10.638 ± 4.997
3.546ProSer: 3.546 ± 1.333
2.837ProThr: 2.837 ± 1.426
5.674ProVal: 5.674 ± 2.862
3.546ProTrp: 3.546 ± 0.681
3.546ProTyr: 3.546 ± 1.37
0.0ProXaa: 0.0 ± 0.0
Gln
0.709GlnAla: 0.709 ± 0.395
0.0GlnCys: 0.0 ± 0.0
0.709GlnAsp: 0.709 ± 0.395
3.546GlnGlu: 3.546 ± 1.973
0.709GlnPhe: 0.709 ± 0.395
0.709GlnGly: 0.709 ± 0.395
0.0GlnHis: 0.0 ± 0.0
3.546GlnIle: 3.546 ± 0.681
2.837GlnLys: 2.837 ± 1.327
5.674GlnLeu: 5.674 ± 2.106
0.709GlnMet: 0.709 ± 0.395
1.418GlnAsn: 1.418 ± 0.986
3.546GlnPro: 3.546 ± 0.681
5.674GlnGln: 5.674 ± 3.157
2.128GlnArg: 2.128 ± 1.184
0.709GlnSer: 0.709 ± 0.395
2.128GlnThr: 2.128 ± 1.184
2.837GlnVal: 2.837 ± 1.053
1.418GlnTrp: 1.418 ± 0.713
1.418GlnTyr: 1.418 ± 0.713
0.0GlnXaa: 0.0 ± 0.0
Arg
7.801ArgAla: 7.801 ± 4.199
0.709ArgCys: 0.709 ± 0.395
2.837ArgAsp: 2.837 ± 1.293
4.255ArgGlu: 4.255 ± 1.616
2.128ArgPhe: 2.128 ± 1.184
4.965ArgGly: 4.965 ± 1.996
4.965ArgHis: 4.965 ± 2.384
1.418ArgIle: 1.418 ± 0.789
5.674ArgLys: 5.674 ± 1.179
4.255ArgLeu: 4.255 ± 0.805
0.0ArgMet: 0.0 ± 0.0
0.709ArgAsn: 0.709 ± 0.395
5.674ArgPro: 5.674 ± 0.705
2.837ArgGln: 2.837 ± 0.961
22.695ArgArg: 22.695 ± 5.165
10.638ArgSer: 10.638 ± 3.75
2.128ArgThr: 2.128 ± 0.808
3.546ArgVal: 3.546 ± 0.512
2.837ArgTrp: 2.837 ± 1.579
2.837ArgTyr: 2.837 ± 1.012
0.0ArgXaa: 0.0 ± 0.0
Ser
0.709SerAla: 0.709 ± 0.816
0.0SerCys: 0.0 ± 0.0
2.837SerAsp: 2.837 ± 2.237
5.674SerGlu: 5.674 ± 1.923
4.255SerPhe: 4.255 ± 1.451
9.22SerGly: 9.22 ± 3.381
3.546SerHis: 3.546 ± 0.681
4.255SerIle: 4.255 ± 2.269
2.837SerLys: 2.837 ± 1.327
4.965SerLeu: 4.965 ± 1.161
0.709SerMet: 0.709 ± 0.69
2.128SerAsn: 2.128 ± 1.184
5.674SerPro: 5.674 ± 0.865
2.128SerGln: 2.128 ± 1.621
4.255SerArg: 4.255 ± 1.8
12.057SerSer: 12.057 ± 11.093
4.255SerThr: 4.255 ± 0.864
2.837SerVal: 2.837 ± 1.012
0.0SerTrp: 0.0 ± 0.0
0.709SerTyr: 0.709 ± 0.395
0.0SerXaa: 0.0 ± 0.0
Thr
3.546ThrAla: 3.546 ± 0.681
1.418ThrCys: 1.418 ± 0.664
3.546ThrAsp: 3.546 ± 1.37
1.418ThrGlu: 1.418 ± 0.713
5.674ThrPhe: 5.674 ± 1.535
2.128ThrGly: 2.128 ± 0.808
0.0ThrHis: 0.0 ± 0.0
2.128ThrIle: 2.128 ± 1.184
3.546ThrLys: 3.546 ± 1.37
2.128ThrLeu: 2.128 ± 1.184
0.0ThrMet: 0.0 ± 0.0
1.418ThrAsn: 1.418 ± 0.789
2.837ThrPro: 2.837 ± 0.433
3.546ThrGln: 3.546 ± 1.973
2.128ThrArg: 2.128 ± 1.184
5.674ThrSer: 5.674 ± 2.39
2.128ThrThr: 2.128 ± 0.808
2.128ThrVal: 2.128 ± 0.651
1.418ThrTrp: 1.418 ± 0.789
0.709ThrTyr: 0.709 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
7.092ValAla: 7.092 ± 2.136
2.837ValCys: 2.837 ± 1.012
1.418ValAsp: 1.418 ± 0.713
1.418ValGlu: 1.418 ± 0.789
0.0ValPhe: 0.0 ± 0.0
2.837ValGly: 2.837 ± 1.846
3.546ValHis: 3.546 ± 0.681
2.128ValIle: 2.128 ± 1.184
2.128ValLys: 2.128 ± 1.184
2.837ValLeu: 2.837 ± 0.961
1.418ValMet: 1.418 ± 0.755
0.709ValAsn: 0.709 ± 0.395
4.965ValPro: 4.965 ± 0.87
0.709ValGln: 0.709 ± 0.395
4.255ValArg: 4.255 ± 0.463
2.837ValSer: 2.837 ± 1.325
2.837ValThr: 2.837 ± 1.293
1.418ValVal: 1.418 ± 0.789
0.0ValTrp: 0.0 ± 0.0
0.709ValTyr: 0.709 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
1.418TrpAla: 1.418 ± 0.713
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.546TrpGly: 3.546 ± 1.973
2.128TrpHis: 2.128 ± 1.377
0.0TrpIle: 0.0 ± 0.0
0.709TrpLys: 0.709 ± 0.395
1.418TrpLeu: 1.418 ± 0.713
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.709TrpPro: 0.709 ± 0.395
0.709TrpGln: 0.709 ± 0.395
4.255TrpArg: 4.255 ± 0.463
0.0TrpSer: 0.0 ± 0.0
2.837TrpThr: 2.837 ± 1.579
0.0TrpVal: 0.0 ± 0.0
2.128TrpTrp: 2.128 ± 1.184
3.546TrpTyr: 3.546 ± 0.681
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.418TyrAla: 1.418 ± 0.789
0.709TyrCys: 0.709 ± 0.821
0.709TyrAsp: 0.709 ± 0.395
2.128TyrGlu: 2.128 ± 1.377
1.418TyrPhe: 1.418 ± 0.789
4.255TyrGly: 4.255 ± 0.463
0.0TyrHis: 0.0 ± 0.0
2.128TyrIle: 2.128 ± 1.184
1.418TyrLys: 1.418 ± 0.986
0.709TyrLeu: 0.709 ± 0.821
0.0TyrMet: 0.0 ± 0.0
2.837TyrAsn: 2.837 ± 1.053
2.837TyrPro: 2.837 ± 1.325
0.709TyrGln: 0.709 ± 0.395
2.837TyrArg: 2.837 ± 1.579
2.837TyrSer: 2.837 ± 0.961
2.128TyrThr: 2.128 ± 1.184
2.837TyrVal: 2.837 ± 1.579
0.709TyrTrp: 0.709 ± 0.395
1.418TyrTyr: 1.418 ± 0.789
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1411 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski