Amino acid dipepetide frequency for Tupaia glis polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.953AlaAla: 3.953 ± 1.488
1.976AlaCys: 1.976 ± 0.825
1.976AlaAsp: 1.976 ± 0.843
2.767AlaGlu: 2.767 ± 1.85
1.581AlaPhe: 1.581 ± 0.612
2.372AlaGly: 2.372 ± 0.923
2.372AlaHis: 2.372 ± 0.766
1.581AlaIle: 1.581 ± 0.418
3.557AlaLys: 3.557 ± 1.65
7.115AlaLeu: 7.115 ± 3.534
0.395AlaMet: 0.395 ± 0.288
2.372AlaAsn: 2.372 ± 1.015
1.976AlaPro: 1.976 ± 1.156
0.395AlaGln: 0.395 ± 0.487
4.743AlaArg: 4.743 ± 1.479
2.372AlaSer: 2.372 ± 0.967
1.976AlaThr: 1.976 ± 0.8
5.929AlaVal: 5.929 ± 1.115
0.395AlaTrp: 0.395 ± 0.288
3.162AlaTyr: 3.162 ± 1.223
0.0AlaXaa: 0.0 ± 0.0
Cys
1.186CysAla: 1.186 ± 0.617
0.395CysCys: 0.395 ± 0.288
1.186CysAsp: 1.186 ± 0.662
0.395CysGlu: 0.395 ± 0.336
2.767CysPhe: 2.767 ± 1.302
1.581CysGly: 1.581 ± 0.707
1.186CysHis: 1.186 ± 0.718
2.372CysIle: 2.372 ± 1.436
1.976CysLys: 1.976 ± 0.857
5.534CysLeu: 5.534 ± 2.078
0.791CysMet: 0.791 ± 0.501
1.186CysAsn: 1.186 ± 0.677
2.767CysPro: 2.767 ± 1.041
1.186CysGln: 1.186 ± 0.731
0.395CysArg: 0.395 ± 0.288
0.791CysSer: 0.791 ± 0.567
0.395CysThr: 0.395 ± 0.288
0.395CysVal: 0.395 ± 0.336
1.581CysTrp: 1.581 ± 0.612
1.186CysTyr: 1.186 ± 0.718
0.0CysXaa: 0.0 ± 0.0
Asp
0.395AspAla: 0.395 ± 0.336
0.0AspCys: 0.0 ± 0.0
1.976AspAsp: 1.976 ± 0.631
2.372AspGlu: 2.372 ± 0.705
1.581AspPhe: 1.581 ± 0.791
3.953AspGly: 3.953 ± 0.563
0.0AspHis: 0.0 ± 0.0
3.557AspIle: 3.557 ± 0.577
3.953AspLys: 3.953 ± 1.292
6.324AspLeu: 6.324 ± 1.448
0.791AspMet: 0.791 ± 0.346
0.791AspAsn: 0.791 ± 0.344
3.953AspPro: 3.953 ± 0.531
1.581AspGln: 1.581 ± 0.707
2.767AspArg: 2.767 ± 0.971
1.976AspSer: 1.976 ± 0.428
1.186AspThr: 1.186 ± 0.863
1.186AspVal: 1.186 ± 0.538
0.791AspTrp: 0.791 ± 0.719
0.791AspTyr: 0.791 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
6.719GluAla: 6.719 ± 2.529
1.186GluCys: 1.186 ± 0.718
3.953GluAsp: 3.953 ± 1.291
9.486GluGlu: 9.486 ± 2.606
1.186GluPhe: 1.186 ± 0.863
3.162GluGly: 3.162 ± 0.753
0.395GluHis: 0.395 ± 0.288
5.138GluIle: 5.138 ± 0.762
5.138GluLys: 5.138 ± 0.957
5.929GluLeu: 5.929 ± 0.961
0.395GluMet: 0.395 ± 0.336
5.138GluAsn: 5.138 ± 0.948
1.186GluPro: 1.186 ± 0.538
2.372GluGln: 2.372 ± 1.324
2.372GluArg: 2.372 ± 0.591
3.557GluSer: 3.557 ± 0.809
2.372GluThr: 2.372 ± 0.983
7.115GluVal: 7.115 ± 1.451
3.162GluTrp: 3.162 ± 1.215
0.395GluTyr: 0.395 ± 0.288
0.0GluXaa: 0.0 ± 0.0
Phe
2.372PheAla: 2.372 ± 1.34
3.953PheCys: 3.953 ± 2.012
2.767PheAsp: 2.767 ± 0.971
4.348PheGlu: 4.348 ± 1.374
1.186PhePhe: 1.186 ± 0.863
3.953PheGly: 3.953 ± 1.136
0.791PheHis: 0.791 ± 0.575
0.395PheIle: 0.395 ± 0.288
1.186PheLys: 1.186 ± 0.538
5.138PheLeu: 5.138 ± 1.561
0.791PheMet: 0.791 ± 0.593
0.0PheAsn: 0.0 ± 0.0
0.791PhePro: 0.791 ± 0.575
2.767PheGln: 2.767 ± 0.695
0.791PheArg: 0.791 ± 0.575
2.372PheSer: 2.372 ± 1.338
2.767PheThr: 2.767 ± 1.275
2.372PheVal: 2.372 ± 0.644
0.0PheTrp: 0.0 ± 0.0
0.395PheTyr: 0.395 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
2.372GlyAla: 2.372 ± 0.591
3.162GlyCys: 3.162 ± 1.223
1.581GlyAsp: 1.581 ± 0.655
3.953GlyGlu: 3.953 ± 0.988
1.581GlyPhe: 1.581 ± 0.689
5.534GlyGly: 5.534 ± 1.242
1.581GlyHis: 1.581 ± 0.629
5.534GlyIle: 5.534 ± 1.856
1.976GlyLys: 1.976 ± 0.94
7.905GlyLeu: 7.905 ± 2.538
0.791GlyMet: 0.791 ± 0.561
1.976GlyAsn: 1.976 ± 0.632
2.372GlyPro: 2.372 ± 1.033
1.581GlyGln: 1.581 ± 0.689
2.767GlyArg: 2.767 ± 0.908
6.719GlySer: 6.719 ± 1.112
5.138GlyThr: 5.138 ± 1.185
1.581GlyVal: 1.581 ± 0.931
0.395GlyTrp: 0.395 ± 0.487
1.581GlyTyr: 1.581 ± 0.695
0.0GlyXaa: 0.0 ± 0.0
His
0.395HisAla: 0.395 ± 0.288
1.581HisCys: 1.581 ± 0.612
0.0HisAsp: 0.0 ± 0.0
2.767HisGlu: 2.767 ± 0.971
2.372HisPhe: 2.372 ± 0.926
0.0HisGly: 0.0 ± 0.0
1.186HisHis: 1.186 ± 0.506
0.395HisIle: 0.395 ± 0.371
0.0HisLys: 0.0 ± 0.0
3.162HisLeu: 3.162 ± 1.172
0.395HisMet: 0.395 ± 0.336
0.0HisAsn: 0.0 ± 0.0
1.976HisPro: 1.976 ± 0.631
0.395HisGln: 0.395 ± 0.288
0.395HisArg: 0.395 ± 0.288
2.767HisSer: 2.767 ± 0.815
0.791HisThr: 0.791 ± 0.579
1.186HisVal: 1.186 ± 0.662
1.186HisTrp: 1.186 ± 0.718
0.791HisTyr: 0.791 ± 0.575
0.0HisXaa: 0.0 ± 0.0
Ile
4.743IleAla: 4.743 ± 1.398
1.186IleCys: 1.186 ± 0.538
1.186IleAsp: 1.186 ± 0.787
5.929IleGlu: 5.929 ± 1.01
1.581IlePhe: 1.581 ± 0.647
1.581IleGly: 1.581 ± 0.931
0.791IleHis: 0.791 ± 0.561
1.976IleIle: 1.976 ± 0.79
5.138IleLys: 5.138 ± 1.013
2.767IleLeu: 2.767 ± 0.797
1.976IleMet: 1.976 ± 0.428
1.186IleAsn: 1.186 ± 1.008
6.324IlePro: 6.324 ± 1.744
2.372IleGln: 2.372 ± 1.431
3.557IleArg: 3.557 ± 1.244
2.372IleSer: 2.372 ± 1.53
3.162IleThr: 3.162 ± 1.166
3.557IleVal: 3.557 ± 0.875
0.0IleTrp: 0.0 ± 0.0
0.395IleTyr: 0.395 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
3.162LysAla: 3.162 ± 0.914
1.581LysCys: 1.581 ± 0.689
1.976LysAsp: 1.976 ± 0.503
4.743LysGlu: 4.743 ± 0.792
2.372LysPhe: 2.372 ± 0.766
4.743LysGly: 4.743 ± 0.741
0.791LysHis: 0.791 ± 0.575
0.791LysIle: 0.791 ± 0.672
8.696LysLys: 8.696 ± 2.222
8.3LysLeu: 8.3 ± 1.35
0.395LysMet: 0.395 ± 0.393
1.581LysAsn: 1.581 ± 0.791
3.162LysPro: 3.162 ± 0.786
1.976LysGln: 1.976 ± 0.503
4.348LysArg: 4.348 ± 0.723
4.348LysSer: 4.348 ± 1.067
5.534LysThr: 5.534 ± 1.378
4.348LysVal: 4.348 ± 0.666
0.791LysTrp: 0.791 ± 0.561
1.976LysTyr: 1.976 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
5.138LeuAla: 5.138 ± 1.808
2.767LeuCys: 2.767 ± 0.777
5.929LeuAsp: 5.929 ± 1.41
10.277LeuGlu: 10.277 ± 2.13
4.743LeuPhe: 4.743 ± 2.019
4.348LeuGly: 4.348 ± 1.259
2.372LeuHis: 2.372 ± 0.478
9.881LeuIle: 9.881 ± 1.451
3.557LeuLys: 3.557 ± 0.561
16.206LeuLeu: 16.206 ± 1.666
1.581LeuMet: 1.581 ± 0.602
5.138LeuAsn: 5.138 ± 0.797
5.929LeuPro: 5.929 ± 2.097
9.091LeuGln: 9.091 ± 1.101
5.929LeuArg: 5.929 ± 1.59
4.743LeuSer: 4.743 ± 1.881
4.743LeuThr: 4.743 ± 1.471
3.162LeuVal: 3.162 ± 1.215
3.557LeuTrp: 3.557 ± 1.209
4.348LeuTyr: 4.348 ± 0.953
0.0LeuXaa: 0.0 ± 0.0
Met
1.581MetAla: 1.581 ± 0.655
0.0MetCys: 0.0 ± 0.0
3.557MetAsp: 3.557 ± 1.297
0.791MetGlu: 0.791 ± 0.672
1.581MetPhe: 1.581 ± 0.579
0.791MetGly: 0.791 ± 0.561
0.0MetHis: 0.0 ± 0.0
0.791MetIle: 0.791 ± 0.501
3.557MetLys: 3.557 ± 1.297
1.186MetLeu: 1.186 ± 0.538
1.581MetMet: 1.581 ± 0.612
0.395MetAsn: 0.395 ± 0.288
1.186MetPro: 1.186 ± 0.617
0.791MetGln: 0.791 ± 0.501
0.791MetArg: 0.791 ± 0.505
2.767MetSer: 2.767 ± 1.064
1.976MetThr: 1.976 ± 0.967
0.0MetVal: 0.0 ± 0.0
0.395MetTrp: 0.395 ± 0.336
0.791MetTyr: 0.791 ± 0.501
0.0MetXaa: 0.0 ± 0.0
Asn
0.791AsnAla: 0.791 ± 0.575
0.395AsnCys: 0.395 ± 0.288
0.791AsnAsp: 0.791 ± 0.672
1.581AsnGlu: 1.581 ± 0.931
2.767AsnPhe: 2.767 ± 1.302
3.557AsnGly: 3.557 ± 1.144
0.395AsnHis: 0.395 ± 0.371
4.348AsnIle: 4.348 ± 0.863
1.976AsnLys: 1.976 ± 1.438
3.557AsnLeu: 3.557 ± 0.738
1.186AsnMet: 1.186 ± 0.49
1.186AsnAsn: 1.186 ± 0.718
1.976AsnPro: 1.976 ± 0.94
1.581AsnGln: 1.581 ± 0.629
0.395AsnArg: 0.395 ± 0.487
5.929AsnSer: 5.929 ± 1.475
1.581AsnThr: 1.581 ± 1.072
1.581AsnVal: 1.581 ± 1.151
0.0AsnTrp: 0.0 ± 0.0
1.186AsnTyr: 1.186 ± 0.751
0.0AsnXaa: 0.0 ± 0.0
Pro
5.138ProAla: 5.138 ± 1.724
1.581ProCys: 1.581 ± 0.612
5.534ProAsp: 5.534 ± 0.894
1.186ProGlu: 1.186 ± 0.538
1.186ProPhe: 1.186 ± 0.863
2.767ProGly: 2.767 ± 1.007
0.791ProHis: 0.791 ± 0.501
1.976ProIle: 1.976 ± 0.579
7.51ProLys: 7.51 ± 1.475
7.905ProLeu: 7.905 ± 2.052
0.791ProMet: 0.791 ± 0.672
0.791ProAsn: 0.791 ± 0.575
7.905ProPro: 7.905 ± 2.592
1.581ProGln: 1.581 ± 0.612
3.162ProArg: 3.162 ± 1.037
5.929ProSer: 5.929 ± 1.422
3.557ProThr: 3.557 ± 0.572
7.51ProVal: 7.51 ± 1.674
0.395ProTrp: 0.395 ± 0.523
1.581ProTyr: 1.581 ± 0.791
0.0ProXaa: 0.0 ± 0.0
Gln
1.581GlnAla: 1.581 ± 0.956
1.581GlnCys: 1.581 ± 0.629
1.976GlnAsp: 1.976 ± 0.579
5.138GlnGlu: 5.138 ± 0.618
1.186GlnPhe: 1.186 ± 0.787
3.162GlnGly: 3.162 ± 1.221
1.186GlnHis: 1.186 ± 0.718
1.186GlnIle: 1.186 ± 0.538
2.767GlnLys: 2.767 ± 0.76
3.953GlnLeu: 3.953 ± 1.24
1.581GlnMet: 1.581 ± 0.427
1.976GlnAsn: 1.976 ± 1.045
1.581GlnPro: 1.581 ± 0.791
0.0GlnGln: 0.0 ± 0.0
3.162GlnArg: 3.162 ± 0.622
3.162GlnSer: 3.162 ± 1.121
5.929GlnThr: 5.929 ± 1.097
1.581GlnVal: 1.581 ± 1.001
0.0GlnTrp: 0.0 ± 0.0
1.581GlnTyr: 1.581 ± 0.612
0.0GlnXaa: 0.0 ± 0.0
Arg
2.767ArgAla: 2.767 ± 1.134
1.581ArgCys: 1.581 ± 0.707
1.186ArgAsp: 1.186 ± 0.863
1.976ArgGlu: 1.976 ± 0.503
1.186ArgPhe: 1.186 ± 0.863
3.557ArgGly: 3.557 ± 0.721
4.348ArgHis: 4.348 ± 1.525
1.976ArgIle: 1.976 ± 0.955
1.581ArgLys: 1.581 ± 0.629
1.581ArgLeu: 1.581 ± 1.063
2.372ArgMet: 2.372 ± 1.024
2.767ArgAsn: 2.767 ± 0.803
2.372ArgPro: 2.372 ± 0.849
1.976ArgGln: 1.976 ± 0.955
1.976ArgArg: 1.976 ± 0.949
5.534ArgSer: 5.534 ± 1.815
1.186ArgThr: 1.186 ± 0.731
5.534ArgVal: 5.534 ± 0.589
0.791ArgTrp: 0.791 ± 0.719
1.581ArgTyr: 1.581 ± 1.344
0.0ArgXaa: 0.0 ± 0.0
Ser
4.348SerAla: 4.348 ± 1.104
1.976SerCys: 1.976 ± 0.977
1.186SerAsp: 1.186 ± 0.538
5.138SerGlu: 5.138 ± 0.389
3.557SerPhe: 3.557 ± 1.03
4.743SerGly: 4.743 ± 1.162
1.186SerHis: 1.186 ± 0.717
3.557SerIle: 3.557 ± 1.01
3.557SerLys: 3.557 ± 0.792
8.696SerLeu: 8.696 ± 2.152
1.186SerMet: 1.186 ± 0.5
5.534SerAsn: 5.534 ± 1.297
3.953SerPro: 3.953 ± 1.144
4.743SerGln: 4.743 ± 1.529
3.557SerArg: 3.557 ± 1.083
9.091SerSer: 9.091 ± 2.609
5.138SerThr: 5.138 ± 1.076
2.767SerVal: 2.767 ± 1.13
0.0SerTrp: 0.0 ± 0.0
2.767SerTyr: 2.767 ± 0.892
0.0SerXaa: 0.0 ± 0.0
Thr
2.372ThrAla: 2.372 ± 1.452
1.186ThrCys: 1.186 ± 0.538
1.186ThrAsp: 1.186 ± 0.538
3.557ThrGlu: 3.557 ± 1.279
0.791ThrPhe: 0.791 ± 0.505
4.348ThrGly: 4.348 ± 1.422
0.0ThrHis: 0.0 ± 0.0
1.581ThrIle: 1.581 ± 0.782
2.372ThrLys: 2.372 ± 1.187
7.51ThrLeu: 7.51 ± 1.172
1.186ThrMet: 1.186 ± 0.603
1.581ThrAsn: 1.581 ± 0.931
8.696ThrPro: 8.696 ± 1.61
3.162ThrGln: 3.162 ± 0.753
3.162ThrArg: 3.162 ± 0.836
3.953ThrSer: 3.953 ± 1.07
4.348ThrThr: 4.348 ± 1.48
3.162ThrVal: 3.162 ± 1.262
1.186ThrTrp: 1.186 ± 0.512
0.791ThrTyr: 0.791 ± 0.501
0.0ThrXaa: 0.0 ± 0.0
Val
1.581ValAla: 1.581 ± 0.956
0.0ValCys: 0.0 ± 0.0
1.186ValAsp: 1.186 ± 0.538
2.372ValGlu: 2.372 ± 1.233
0.791ValPhe: 0.791 ± 0.505
3.557ValGly: 3.557 ± 1.044
0.395ValHis: 0.395 ± 0.288
3.162ValIle: 3.162 ± 1.827
1.976ValLys: 1.976 ± 0.531
8.696ValLeu: 8.696 ± 1.394
4.743ValMet: 4.743 ± 1.615
2.767ValAsn: 2.767 ± 1.539
7.905ValPro: 7.905 ± 1.753
2.767ValGln: 2.767 ± 0.666
1.976ValArg: 1.976 ± 0.94
5.534ValSer: 5.534 ± 0.891
3.557ValThr: 3.557 ± 0.687
0.791ValVal: 0.791 ± 0.344
1.186ValTrp: 1.186 ± 0.718
0.395ValTyr: 0.395 ± 0.336
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.186TrpCys: 1.186 ± 0.506
0.0TrpAsp: 0.0 ± 0.0
0.395TrpGlu: 0.395 ± 0.336
2.372TrpPhe: 2.372 ± 1.436
1.186TrpGly: 1.186 ± 0.718
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.581TrpLys: 1.581 ± 0.612
1.186TrpLeu: 1.186 ± 0.944
0.791TrpMet: 0.791 ± 0.501
0.0TrpAsn: 0.0 ± 0.0
0.395TrpPro: 0.395 ± 0.336
2.767TrpGln: 2.767 ± 1.302
1.186TrpArg: 1.186 ± 0.718
0.395TrpSer: 0.395 ± 0.487
0.395TrpThr: 0.395 ± 0.288
0.791TrpVal: 0.791 ± 0.719
2.372TrpTrp: 2.372 ± 1.436
1.186TrpTyr: 1.186 ± 0.538
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.478
1.976TyrCys: 1.976 ± 0.631
0.791TyrAsp: 0.791 ± 0.575
1.186TyrGlu: 1.186 ± 0.751
3.162TyrPhe: 3.162 ± 0.834
1.186TyrGly: 1.186 ± 0.506
1.976TyrHis: 1.976 ± 0.531
1.581TyrIle: 1.581 ± 0.533
3.557TyrLys: 3.557 ± 1.095
0.791TyrLeu: 0.791 ± 0.575
0.395TyrMet: 0.395 ± 0.486
0.395TyrAsn: 0.395 ± 0.288
2.372TyrPro: 2.372 ± 0.759
1.186TyrGln: 1.186 ± 0.49
0.395TyrArg: 0.395 ± 0.336
2.372TyrSer: 2.372 ± 1.033
0.395TyrThr: 0.395 ± 0.288
0.791TyrVal: 0.791 ± 0.344
0.0TyrTrp: 0.0 ± 0.0
0.395TyrTyr: 0.395 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2531 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski