Amino acid dipepetide frequency for Lesser panda anellovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.055AlaAla: 2.055 ± 0.723
2.055AlaCys: 2.055 ± 1.996
2.055AlaAsp: 2.055 ± 1.996
0.0AlaGlu: 0.0 ± 0.0
3.083AlaPhe: 3.083 ± 1.827
5.139AlaGly: 5.139 ± 1.473
2.055AlaHis: 2.055 ± 1.218
0.0AlaIle: 0.0 ± 0.0
7.194AlaLys: 7.194 ± 2.92
1.028AlaLeu: 1.028 ± 0.609
1.028AlaMet: 1.028 ± 0.609
0.0AlaAsn: 0.0 ± 0.0
2.055AlaPro: 2.055 ± 0.723
2.055AlaGln: 2.055 ± 1.218
2.055AlaArg: 2.055 ± 2.141
3.083AlaSer: 3.083 ± 3.212
1.028AlaThr: 1.028 ± 0.609
6.166AlaVal: 6.166 ± 2.878
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.028CysGlu: 1.028 ± 0.609
0.0CysPhe: 0.0 ± 0.0
2.055CysGly: 2.055 ± 0.723
0.0CysHis: 0.0 ± 0.0
1.028CysIle: 1.028 ± 0.609
4.111CysLys: 4.111 ± 3.991
2.055CysLeu: 2.055 ± 1.996
0.0CysMet: 0.0 ± 0.0
2.055CysAsn: 2.055 ± 1.996
4.111CysPro: 4.111 ± 2.082
1.028CysGln: 1.028 ± 0.609
2.055CysArg: 2.055 ± 1.996
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.055CysVal: 2.055 ± 1.218
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.111AspAla: 4.111 ± 3.991
0.0AspCys: 0.0 ± 0.0
5.139AspAsp: 5.139 ± 0.686
3.083AspGlu: 3.083 ± 1.439
1.028AspPhe: 1.028 ± 0.609
3.083AspGly: 3.083 ± 1.827
0.0AspHis: 0.0 ± 0.0
2.055AspIle: 2.055 ± 1.218
1.028AspLys: 1.028 ± 0.609
6.166AspLeu: 6.166 ± 0.885
1.028AspMet: 1.028 ± 0.609
3.083AspAsn: 3.083 ± 1.827
3.083AspPro: 3.083 ± 1.827
1.028AspGln: 1.028 ± 0.609
3.083AspArg: 3.083 ± 2.691
5.139AspSer: 5.139 ± 1.473
3.083AspThr: 3.083 ± 1.439
0.0AspVal: 0.0 ± 0.0
4.111AspTrp: 4.111 ± 0.949
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.055GluAla: 2.055 ± 0.723
0.0GluCys: 0.0 ± 0.0
3.083GluAsp: 3.083 ± 1.439
8.222GluGlu: 8.222 ± 4.164
2.055GluPhe: 2.055 ± 1.996
6.166GluGly: 6.166 ± 2.878
2.055GluHis: 2.055 ± 1.996
0.0GluIle: 0.0 ± 0.0
2.055GluLys: 2.055 ± 1.218
1.028GluLeu: 1.028 ± 0.609
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
1.028GluGln: 1.028 ± 0.609
5.139GluArg: 5.139 ± 0.686
8.222GluSer: 8.222 ± 4.164
6.166GluThr: 6.166 ± 3.967
4.111GluVal: 4.111 ± 0.949
2.055GluTrp: 2.055 ± 1.218
4.111GluTyr: 4.111 ± 2.082
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
5.139PheAsp: 5.139 ± 3.426
0.0PheGlu: 0.0 ± 0.0
2.055PhePhe: 2.055 ± 1.218
2.055PheGly: 2.055 ± 1.218
0.0PheHis: 0.0 ± 0.0
2.055PheIle: 2.055 ± 1.996
0.0PheLys: 0.0 ± 0.0
3.083PheLeu: 3.083 ± 0.801
1.028PheMet: 1.028 ± 0.609
0.0PheAsn: 0.0 ± 0.0
2.055PhePro: 2.055 ± 1.218
0.0PheGln: 0.0 ± 0.0
5.139PheArg: 5.139 ± 3.044
4.111PheSer: 4.111 ± 0.949
2.055PheThr: 2.055 ± 1.218
4.111PheVal: 4.111 ± 2.435
3.083PheTrp: 3.083 ± 1.827
2.055PheTyr: 2.055 ± 1.218
0.0PheXaa: 0.0 ± 0.0
Gly
6.166GlyAla: 6.166 ± 0.864
3.083GlyCys: 3.083 ± 1.439
4.111GlyAsp: 4.111 ± 0.949
9.25GlyGlu: 9.25 ± 4.317
4.111GlyPhe: 4.111 ± 0.949
11.305GlyGly: 11.305 ± 6.298
3.083GlyHis: 3.083 ± 1.439
4.111GlyIle: 4.111 ± 2.435
2.055GlyLys: 2.055 ± 1.996
5.139GlyLeu: 5.139 ± 0.686
0.0GlyMet: 0.0 ± 0.886
1.028GlyAsn: 1.028 ± 0.609
1.028GlyPro: 1.028 ± 0.609
5.139GlyGln: 5.139 ± 3.426
3.083GlyArg: 3.083 ± 0.801
6.166GlySer: 6.166 ± 5.547
6.166GlyThr: 6.166 ± 1.602
4.111GlyVal: 4.111 ± 0.949
2.055GlyTrp: 2.055 ± 1.218
1.028GlyTyr: 1.028 ± 0.609
0.0GlyXaa: 0.0 ± 0.0
His
1.028HisAla: 1.028 ± 0.609
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
5.139HisPhe: 5.139 ± 0.686
1.028HisGly: 1.028 ± 1.071
0.0HisHis: 0.0 ± 0.0
2.055HisIle: 2.055 ± 1.218
1.028HisLys: 1.028 ± 0.609
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
6.166HisPro: 6.166 ± 2.333
2.055HisGln: 2.055 ± 1.218
0.0HisArg: 0.0 ± 0.0
5.139HisSer: 5.139 ± 3.426
0.0HisThr: 0.0 ± 0.0
3.083HisVal: 3.083 ± 1.439
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.028IleAla: 1.028 ± 0.609
0.0IleCys: 0.0 ± 0.0
1.028IleAsp: 1.028 ± 0.609
1.028IleGlu: 1.028 ± 0.609
5.139IlePhe: 5.139 ± 3.044
3.083IleGly: 3.083 ± 1.827
1.028IleHis: 1.028 ± 0.609
3.083IleIle: 3.083 ± 1.827
0.0IleLys: 0.0 ± 0.0
3.083IleLeu: 3.083 ± 2.691
1.028IleMet: 1.028 ± 0.609
5.139IleAsn: 5.139 ± 3.044
2.055IlePro: 2.055 ± 1.218
4.111IleGln: 4.111 ± 2.435
2.055IleArg: 2.055 ± 1.218
1.028IleSer: 1.028 ± 0.609
2.055IleThr: 2.055 ± 1.218
1.028IleVal: 1.028 ± 0.609
1.028IleTrp: 1.028 ± 0.609
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
3.083LysCys: 3.083 ± 1.439
0.0LysAsp: 0.0 ± 0.0
1.028LysGlu: 1.028 ± 0.609
1.028LysPhe: 1.028 ± 1.071
2.055LysGly: 2.055 ± 1.996
1.028LysHis: 1.028 ± 0.609
2.055LysIle: 2.055 ± 1.218
5.139LysLys: 5.139 ± 1.4
3.083LysLeu: 3.083 ± 1.827
1.028LysMet: 1.028 ± 0.609
4.111LysAsn: 4.111 ± 2.435
1.028LysPro: 1.028 ± 1.071
2.055LysGln: 2.055 ± 1.218
8.222LysArg: 8.222 ± 1.281
1.028LysSer: 1.028 ± 0.609
5.139LysThr: 5.139 ± 1.473
4.111LysVal: 4.111 ± 1.225
3.083LysTrp: 3.083 ± 1.827
2.055LysTyr: 2.055 ± 1.218
0.0LysXaa: 0.0 ± 0.0
Leu
4.111LeuAla: 4.111 ± 1.225
0.0LeuCys: 0.0 ± 0.0
1.028LeuAsp: 1.028 ± 0.609
3.083LeuGlu: 3.083 ± 1.439
1.028LeuPhe: 1.028 ± 0.609
3.083LeuGly: 3.083 ± 1.439
2.055LeuHis: 2.055 ± 1.218
1.028LeuIle: 1.028 ± 0.609
3.083LeuLys: 3.083 ± 1.827
6.166LeuLeu: 6.166 ± 0.864
1.028LeuMet: 1.028 ± 0.609
3.083LeuAsn: 3.083 ± 0.801
6.166LeuPro: 6.166 ± 0.864
1.028LeuGln: 1.028 ± 1.071
3.083LeuArg: 3.083 ± 0.801
5.139LeuSer: 5.139 ± 1.473
7.194LeuThr: 7.194 ± 0.256
1.028LeuVal: 1.028 ± 0.609
5.139LeuTrp: 5.139 ± 3.426
2.055LeuTyr: 2.055 ± 1.218
0.0LeuXaa: 0.0 ± 0.0
Met
1.028MetAla: 1.028 ± 0.609
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.028MetGlu: 1.028 ± 0.609
0.0MetPhe: 0.0 ± 0.0
2.055MetGly: 2.055 ± 1.996
2.055MetHis: 2.055 ± 1.218
0.0MetIle: 0.0 ± 0.0
1.028MetLys: 1.028 ± 0.609
3.083MetLeu: 3.083 ± 0.801
0.0MetMet: 0.0 ± 0.0
1.028MetAsn: 1.028 ± 0.609
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.028MetArg: 1.028 ± 0.609
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
3.083MetVal: 3.083 ± 1.439
1.028MetTrp: 1.028 ± 0.609
1.028MetTyr: 1.028 ± 0.609
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.139AsnAsp: 5.139 ± 0.686
1.028AsnGlu: 1.028 ± 0.609
1.028AsnPhe: 1.028 ± 0.609
1.028AsnGly: 1.028 ± 0.609
2.055AsnHis: 2.055 ± 1.218
4.111AsnIle: 4.111 ± 2.435
3.083AsnLys: 3.083 ± 0.801
2.055AsnLeu: 2.055 ± 1.218
0.0AsnMet: 0.0 ± 0.0
2.055AsnAsn: 2.055 ± 1.218
2.055AsnPro: 2.055 ± 1.218
0.0AsnGln: 0.0 ± 0.0
1.028AsnArg: 1.028 ± 0.609
4.111AsnSer: 4.111 ± 1.447
1.028AsnThr: 1.028 ± 0.609
3.083AsnVal: 3.083 ± 1.827
0.0AsnTrp: 0.0 ± 0.0
1.028AsnTyr: 1.028 ± 0.609
0.0AsnXaa: 0.0 ± 0.0
Pro
6.166ProAla: 6.166 ± 2.333
1.028ProCys: 1.028 ± 0.609
5.139ProAsp: 5.139 ± 0.686
6.166ProGlu: 6.166 ± 2.878
2.055ProPhe: 2.055 ± 1.218
1.028ProGly: 1.028 ± 1.071
2.055ProHis: 2.055 ± 1.218
4.111ProIle: 4.111 ± 2.435
4.111ProLys: 4.111 ± 1.225
0.0ProLeu: 0.0 ± 0.0
1.028ProMet: 1.028 ± 0.609
3.083ProAsn: 3.083 ± 0.801
8.222ProPro: 8.222 ± 3.515
3.083ProGln: 3.083 ± 1.827
4.111ProArg: 4.111 ± 2.435
7.194ProSer: 7.194 ± 2.079
2.055ProThr: 2.055 ± 0.723
2.055ProVal: 2.055 ± 0.723
1.028ProTrp: 1.028 ± 0.609
5.139ProTyr: 5.139 ± 1.473
1.028ProXaa: 1.028 ± 0.609
Gln
1.028GlnAla: 1.028 ± 0.609
0.0GlnCys: 0.0 ± 0.0
1.028GlnAsp: 1.028 ± 1.071
0.0GlnGlu: 0.0 ± 0.0
1.028GlnPhe: 1.028 ± 0.609
3.083GlnGly: 3.083 ± 1.827
1.028GlnHis: 1.028 ± 0.609
1.028GlnIle: 1.028 ± 0.609
2.055GlnLys: 2.055 ± 0.723
1.028GlnLeu: 1.028 ± 0.609
4.111GlnMet: 4.111 ± 0.949
1.028GlnAsn: 1.028 ± 0.609
3.083GlnPro: 3.083 ± 1.827
0.0GlnGln: 0.0 ± 0.0
2.055GlnArg: 2.055 ± 1.218
6.166GlnSer: 6.166 ± 2.878
2.055GlnThr: 2.055 ± 0.723
1.028GlnVal: 1.028 ± 0.609
1.028GlnTrp: 1.028 ± 0.609
3.083GlnTyr: 3.083 ± 1.827
0.0GlnXaa: 0.0 ± 0.0
Arg
2.055ArgAla: 2.055 ± 0.723
1.028ArgCys: 1.028 ± 1.071
3.083ArgAsp: 3.083 ± 0.801
6.166ArgGlu: 6.166 ± 2.394
2.055ArgPhe: 2.055 ± 1.996
5.139ArgGly: 5.139 ± 0.686
1.028ArgHis: 1.028 ± 0.609
3.083ArgIle: 3.083 ± 1.827
7.194ArgLys: 7.194 ± 1.355
1.028ArgLeu: 1.028 ± 1.071
2.055ArgMet: 2.055 ± 1.62
2.055ArgAsn: 2.055 ± 1.218
6.166ArgPro: 6.166 ± 2.333
4.111ArgGln: 4.111 ± 2.082
29.805ArgArg: 29.805 ± 4.04
12.333ArgSer: 12.333 ± 3.658
8.222ArgThr: 8.222 ± 5.988
3.083ArgVal: 3.083 ± 1.827
4.111ArgTrp: 4.111 ± 2.435
2.055ArgTyr: 2.055 ± 0.723
0.0ArgXaa: 0.0 ± 0.0
Ser
4.111SerAla: 4.111 ± 2.082
6.166SerCys: 6.166 ± 4.014
1.028SerAsp: 1.028 ± 0.609
12.333SerGlu: 12.333 ± 6.359
4.111SerPhe: 4.111 ± 2.435
7.194SerGly: 7.194 ± 4.773
0.0SerHis: 0.0 ± 0.0
1.028SerIle: 1.028 ± 0.609
1.028SerLys: 1.028 ± 0.609
7.194SerLeu: 7.194 ± 1.821
0.0SerMet: 0.0 ± 0.0
2.055SerAsn: 2.055 ± 0.723
12.333SerPro: 12.333 ± 1.396
1.028SerGln: 1.028 ± 0.609
11.305SerArg: 11.305 ± 5.471
6.166SerSer: 6.166 ± 2.394
10.277SerThr: 10.277 ± 2.946
1.028SerVal: 1.028 ± 1.071
1.028SerTrp: 1.028 ± 1.071
1.028SerTyr: 1.028 ± 0.609
0.0SerXaa: 0.0 ± 0.0
Thr
1.028ThrAla: 1.028 ± 1.071
0.0ThrCys: 0.0 ± 0.0
6.166ThrAsp: 6.166 ± 0.885
1.028ThrGlu: 1.028 ± 0.609
2.055ThrPhe: 2.055 ± 1.218
11.305ThrGly: 11.305 ± 5.471
4.111ThrHis: 4.111 ± 3.576
1.028ThrIle: 1.028 ± 1.071
1.028ThrLys: 1.028 ± 0.609
7.194ThrLeu: 7.194 ± 0.256
2.055ThrMet: 2.055 ± 1.046
1.028ThrAsn: 1.028 ± 0.609
1.028ThrPro: 1.028 ± 0.609
2.055ThrGln: 2.055 ± 0.723
4.111ThrArg: 4.111 ± 2.082
7.194ThrSer: 7.194 ± 3.404
2.055ThrThr: 2.055 ± 1.218
1.028ThrVal: 1.028 ± 0.609
4.111ThrTrp: 4.111 ± 0.949
3.083ThrTyr: 3.083 ± 0.801
0.0ThrXaa: 0.0 ± 0.0
Val
4.111ValAla: 4.111 ± 0.949
0.0ValCys: 0.0 ± 0.0
2.055ValAsp: 2.055 ± 1.218
2.055ValGlu: 2.055 ± 1.996
0.0ValPhe: 0.0 ± 0.0
5.139ValGly: 5.139 ± 0.686
3.083ValHis: 3.083 ± 1.439
1.028ValIle: 1.028 ± 0.609
1.028ValLys: 1.028 ± 1.071
2.055ValLeu: 2.055 ± 1.218
0.0ValMet: 0.0 ± 0.0
1.028ValAsn: 1.028 ± 0.609
7.194ValPro: 7.194 ± 1.355
1.028ValGln: 1.028 ± 0.609
5.139ValArg: 5.139 ± 0.686
4.111ValSer: 4.111 ± 1.225
3.083ValThr: 3.083 ± 0.801
0.0ValVal: 0.0 ± 0.0
2.055ValTrp: 2.055 ± 1.218
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.609
4.111TrpCys: 4.111 ± 0.949
3.083TrpAsp: 3.083 ± 1.827
1.028TrpGlu: 1.028 ± 0.609
1.028TrpPhe: 1.028 ± 0.609
3.083TrpGly: 3.083 ± 1.827
0.0TrpHis: 0.0 ± 0.0
4.111TrpIle: 4.111 ± 2.435
0.0TrpLys: 0.0 ± 0.0
5.139TrpLeu: 5.139 ± 0.686
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.028TrpPro: 1.028 ± 1.071
2.055TrpGln: 2.055 ± 1.218
7.194TrpArg: 7.194 ± 2.36
3.083TrpSer: 3.083 ± 1.827
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
5.139TrpTrp: 5.139 ± 3.044
2.055TrpTyr: 2.055 ± 1.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.028TyrAla: 1.028 ± 1.071
1.028TyrCys: 1.028 ± 0.609
1.028TyrAsp: 1.028 ± 0.609
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
4.111TyrGly: 4.111 ± 2.082
0.0TyrHis: 0.0 ± 0.0
1.028TyrIle: 1.028 ± 0.609
4.111TyrLys: 4.111 ± 1.225
0.0TyrLeu: 0.0 ± 0.0
1.028TyrMet: 1.028 ± 0.609
2.055TyrAsn: 2.055 ± 1.218
1.028TyrPro: 1.028 ± 0.609
2.055TyrGln: 2.055 ± 1.218
6.166TyrArg: 6.166 ± 0.864
1.028TyrSer: 1.028 ± 0.609
1.028TyrThr: 1.028 ± 0.609
0.0TyrVal: 0.0 ± 0.0
3.083TyrTrp: 3.083 ± 1.827
1.028TyrTyr: 1.028 ± 0.609
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
1.028XaaGly: 1.028 ± 0.609
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (974 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski