Amino acid dipepetide frequency for Circulifer tenellus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.426AlaAla: 13.426 ± 5.444
1.627AlaCys: 1.627 ± 0.468
4.475AlaAsp: 4.475 ± 0.038
5.289AlaGlu: 5.289 ± 1.673
0.814AlaPhe: 0.814 ± 0.044
8.137AlaGly: 8.137 ± 2.104
2.441AlaHis: 2.441 ± 0.131
6.916AlaIle: 6.916 ± 1.574
4.068AlaLys: 4.068 ± 0.893
13.019AlaLeu: 13.019 ± 0.698
2.441AlaMet: 2.441 ± 0.425
4.882AlaAsn: 4.882 ± 0.294
8.137AlaPro: 8.137 ± 3.215
5.289AlaGln: 5.289 ± 1.117
4.882AlaArg: 4.882 ± 0.85
7.73AlaSer: 7.73 ± 0.975
4.475AlaThr: 4.475 ± 0.038
6.916AlaVal: 6.916 ± 1.76
2.034AlaTrp: 2.034 ± 0.169
2.848AlaTyr: 2.848 ± 1.237
0.0AlaXaa: 0.0 ± 0.0
Cys
1.221CysAla: 1.221 ± 0.768
0.0CysCys: 0.0 ± 0.0
1.221CysAsp: 1.221 ± 0.768
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.407CysGly: 0.407 ± 0.256
0.407CysHis: 0.407 ± 0.256
0.814CysIle: 0.814 ± 0.512
0.0CysLys: 0.0 ± 0.0
1.221CysLeu: 1.221 ± 0.212
0.0CysMet: 0.0 ± 0.0
0.814CysAsn: 0.814 ± 0.044
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.034CysSer: 2.034 ± 0.725
0.814CysThr: 0.814 ± 0.044
0.814CysVal: 0.814 ± 0.512
0.0CysTrp: 0.0 ± 0.0
0.814CysTyr: 0.814 ± 0.512
0.0CysXaa: 0.0 ± 0.0
Asp
4.882AspAla: 4.882 ± 0.262
0.0AspCys: 0.0 ± 0.0
2.034AspAsp: 2.034 ± 0.169
1.221AspGlu: 1.221 ± 0.212
1.221AspPhe: 1.221 ± 0.343
3.662AspGly: 3.662 ± 0.637
2.441AspHis: 2.441 ± 0.131
2.848AspIle: 2.848 ± 0.681
1.221AspLys: 1.221 ± 0.343
4.882AspLeu: 4.882 ± 0.85
1.627AspMet: 1.627 ± 0.468
2.848AspAsn: 2.848 ± 0.431
3.662AspPro: 3.662 ± 1.193
2.441AspGln: 2.441 ± 1.242
1.221AspArg: 1.221 ± 0.768
1.221AspSer: 1.221 ± 0.768
1.627AspThr: 1.627 ± 0.468
4.882AspVal: 4.882 ± 0.85
1.221AspTrp: 1.221 ± 0.768
2.441AspTyr: 2.441 ± 0.687
0.0AspXaa: 0.0 ± 0.0
Glu
5.289GluAla: 5.289 ± 2.785
1.221GluCys: 1.221 ± 0.212
2.034GluAsp: 2.034 ± 0.169
2.034GluGlu: 2.034 ± 0.387
0.0GluPhe: 0.0 ± 0.0
2.034GluGly: 2.034 ± 0.387
0.814GluHis: 0.814 ± 0.599
0.407GluIle: 0.407 ± 0.3
2.441GluLys: 2.441 ± 0.131
4.068GluLeu: 4.068 ± 0.338
1.627GluMet: 1.627 ± 0.087
1.221GluAsn: 1.221 ± 0.212
2.441GluPro: 2.441 ± 1.242
2.441GluGln: 2.441 ± 0.425
2.441GluArg: 2.441 ± 0.425
2.441GluSer: 2.441 ± 0.131
0.814GluThr: 0.814 ± 0.044
2.034GluVal: 2.034 ± 0.725
2.441GluTrp: 2.441 ± 0.131
2.441GluTyr: 2.441 ± 0.981
0.0GluXaa: 0.0 ± 0.0
Phe
0.814PheAla: 0.814 ± 0.044
0.0PheCys: 0.0 ± 0.0
1.627PheAsp: 1.627 ± 0.468
1.221PheGlu: 1.221 ± 0.343
0.814PhePhe: 0.814 ± 0.044
2.848PheGly: 2.848 ± 0.431
0.407PheHis: 0.407 ± 0.256
1.221PheIle: 1.221 ± 0.343
1.627PheLys: 1.627 ± 1.024
2.441PheLeu: 2.441 ± 1.242
0.407PheMet: 0.407 ± 0.3
0.407PheAsn: 0.407 ± 0.256
2.848PhePro: 2.848 ± 0.986
2.848PheGln: 2.848 ± 0.125
1.221PheArg: 1.221 ± 0.343
2.034PheSer: 2.034 ± 0.169
2.848PheThr: 2.848 ± 0.125
0.0PheVal: 0.0 ± 0.0
0.407PheTrp: 0.407 ± 0.256
2.034PheTyr: 2.034 ± 0.387
0.0PheXaa: 0.0 ± 0.0
Gly
8.95GlyAla: 8.95 ± 2.703
1.221GlyCys: 1.221 ± 0.768
4.475GlyAsp: 4.475 ± 1.149
5.696GlyGlu: 5.696 ± 0.861
2.441GlyPhe: 2.441 ± 0.425
8.137GlyGly: 8.137 ± 0.119
1.627GlyHis: 1.627 ± 1.024
2.034GlyIle: 2.034 ± 0.943
1.221GlyLys: 1.221 ± 0.212
5.289GlyLeu: 5.289 ± 2.217
1.221GlyMet: 1.221 ± 0.343
2.848GlyAsn: 2.848 ± 0.986
3.662GlyPro: 3.662 ± 1.586
3.255GlyGln: 3.255 ± 0.73
3.662GlyArg: 3.662 ± 0.637
4.882GlySer: 4.882 ± 0.294
3.662GlyThr: 3.662 ± 1.03
4.882GlyVal: 4.882 ± 1.373
2.441GlyTrp: 2.441 ± 0.425
2.034GlyTyr: 2.034 ± 0.725
0.0GlyXaa: 0.0 ± 0.0
His
4.475HisAla: 4.475 ± 0.518
0.407HisCys: 0.407 ± 0.256
1.221HisAsp: 1.221 ± 0.343
2.034HisGlu: 2.034 ± 0.169
1.221HisPhe: 1.221 ± 0.212
3.255HisGly: 3.255 ± 1.493
0.814HisHis: 0.814 ± 0.044
2.034HisIle: 2.034 ± 0.725
0.814HisLys: 0.814 ± 0.512
2.848HisLeu: 2.848 ± 0.681
0.407HisMet: 0.407 ± 0.256
3.255HisAsn: 3.255 ± 1.286
2.034HisPro: 2.034 ± 0.943
1.627HisGln: 1.627 ± 0.468
1.221HisArg: 1.221 ± 0.343
1.627HisSer: 1.627 ± 0.643
1.221HisThr: 1.221 ± 0.212
0.407HisVal: 0.407 ± 0.256
0.0HisTrp: 0.0 ± 0.0
0.407HisTyr: 0.407 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
3.255IleAla: 3.255 ± 1.493
0.814IleCys: 0.814 ± 0.044
2.441IleAsp: 2.441 ± 0.425
2.848IleGlu: 2.848 ± 0.681
1.627IlePhe: 1.627 ± 0.643
4.068IleGly: 4.068 ± 2.441
2.848IleHis: 2.848 ± 0.431
2.034IleIle: 2.034 ± 0.387
0.814IleLys: 0.814 ± 0.512
4.068IleLeu: 4.068 ± 0.893
0.0IleMet: 0.0 ± 0.0
2.034IleAsn: 2.034 ± 0.387
2.441IlePro: 2.441 ± 0.687
2.848IleGln: 2.848 ± 0.125
2.848IleArg: 2.848 ± 1.237
3.662IleSer: 3.662 ± 0.637
1.627IleThr: 1.627 ± 0.468
1.627IleVal: 1.627 ± 0.087
0.814IleTrp: 0.814 ± 0.044
2.034IleTyr: 2.034 ± 0.169
0.0IleXaa: 0.0 ± 0.0
Lys
3.662LysAla: 3.662 ± 0.082
1.221LysCys: 1.221 ± 0.768
1.627LysAsp: 1.627 ± 0.087
0.0LysGlu: 0.0 ± 0.0
1.221LysPhe: 1.221 ± 0.212
2.441LysGly: 2.441 ± 0.981
0.407LysHis: 0.407 ± 0.256
0.814LysIle: 0.814 ± 0.044
0.0LysLys: 0.0 ± 0.0
1.221LysLeu: 1.221 ± 0.343
0.814LysMet: 0.814 ± 0.512
0.814LysAsn: 0.814 ± 0.044
1.627LysPro: 1.627 ± 0.087
0.814LysGln: 0.814 ± 0.044
2.848LysArg: 2.848 ± 1.792
2.848LysSer: 2.848 ± 0.681
1.221LysThr: 1.221 ± 0.768
2.441LysVal: 2.441 ± 0.425
0.407LysTrp: 0.407 ± 0.256
1.627LysTyr: 1.627 ± 0.087
0.0LysXaa: 0.0 ± 0.0
Leu
10.578LeuAla: 10.578 ± 1.1
0.407LeuCys: 0.407 ± 0.256
4.475LeuAsp: 4.475 ± 0.594
2.848LeuGlu: 2.848 ± 0.125
3.255LeuPhe: 3.255 ± 0.175
7.73LeuGly: 7.73 ± 0.975
2.441LeuHis: 2.441 ± 0.981
4.475LeuIle: 4.475 ± 0.594
2.441LeuLys: 2.441 ± 0.425
10.171LeuLeu: 10.171 ± 0.844
1.221LeuMet: 1.221 ± 0.899
6.916LeuAsn: 6.916 ± 0.463
9.357LeuPro: 9.357 ± 0.888
4.068LeuGln: 4.068 ± 0.774
8.544LeuArg: 8.544 ± 0.931
5.289LeuSer: 5.289 ± 1.106
3.662LeuThr: 3.662 ± 0.474
5.696LeuVal: 5.696 ± 0.305
1.627LeuTrp: 1.627 ± 0.087
3.255LeuTyr: 3.255 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
4.068MetAla: 4.068 ± 1.449
0.407MetCys: 0.407 ± 0.256
0.0MetAsp: 0.0 ± 0.0
0.814MetGlu: 0.814 ± 0.044
1.627MetPhe: 1.627 ± 0.468
1.221MetGly: 1.221 ± 0.343
0.407MetHis: 0.407 ± 0.256
0.407MetIle: 0.407 ± 0.256
0.407MetLys: 0.407 ± 0.3
0.814MetLeu: 0.814 ± 0.512
0.407MetMet: 0.407 ± 0.256
0.0MetAsn: 0.0 ± 0.0
1.627MetPro: 1.627 ± 0.468
0.407MetGln: 0.407 ± 0.256
2.034MetArg: 2.034 ± 0.943
1.221MetSer: 1.221 ± 0.343
0.814MetThr: 0.814 ± 0.044
1.221MetVal: 1.221 ± 0.212
0.0MetTrp: 0.0 ± 0.0
1.221MetTyr: 1.221 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
4.882AsnAla: 4.882 ± 0.294
0.0AsnCys: 0.0 ± 0.0
2.441AsnAsp: 2.441 ± 0.131
1.627AsnGlu: 1.627 ± 0.643
2.034AsnPhe: 2.034 ± 0.943
2.441AsnGly: 2.441 ± 0.687
0.407AsnHis: 0.407 ± 0.256
3.255AsnIle: 3.255 ± 1.286
0.0AsnLys: 0.0 ± 0.0
6.103AsnLeu: 6.103 ± 1.161
0.407AsnMet: 0.407 ± 0.256
1.627AsnAsn: 1.627 ± 0.087
4.475AsnPro: 4.475 ± 1.074
2.034AsnGln: 2.034 ± 0.943
2.441AsnArg: 2.441 ± 0.425
0.814AsnSer: 0.814 ± 0.599
2.441AsnThr: 2.441 ± 1.536
4.475AsnVal: 4.475 ± 0.518
1.221AsnTrp: 1.221 ± 0.212
0.407AsnTyr: 0.407 ± 0.256
0.0AsnXaa: 0.0 ± 0.0
Pro
6.509ProAla: 6.509 ± 4.239
0.407ProCys: 0.407 ± 0.3
5.289ProAsp: 5.289 ± 0.55
2.441ProGlu: 2.441 ± 0.131
1.221ProPhe: 1.221 ± 0.899
2.848ProGly: 2.848 ± 0.986
3.662ProHis: 3.662 ± 1.03
2.034ProIle: 2.034 ± 0.725
2.441ProLys: 2.441 ± 0.425
8.137ProLeu: 8.137 ± 1.548
1.221ProMet: 1.221 ± 0.122
3.255ProAsn: 3.255 ± 1.842
10.985ProPro: 10.985 ± 6.135
6.103ProGln: 6.103 ± 2.828
3.255ProArg: 3.255 ± 0.937
7.73ProSer: 7.73 ± 1.804
7.73ProThr: 7.73 ± 0.692
4.882ProVal: 4.882 ± 0.85
0.407ProTrp: 0.407 ± 0.3
1.221ProTyr: 1.221 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
5.289GlnAla: 5.289 ± 0.55
0.0GlnCys: 0.0 ± 0.0
2.441GlnAsp: 2.441 ± 1.242
2.034GlnGlu: 2.034 ± 0.169
0.407GlnPhe: 0.407 ± 0.3
4.068GlnGly: 4.068 ± 0.218
1.627GlnHis: 1.627 ± 0.087
2.034GlnIle: 2.034 ± 0.387
0.814GlnLys: 0.814 ± 0.512
4.068GlnLeu: 4.068 ± 0.893
1.221GlnMet: 1.221 ± 0.212
1.221GlnAsn: 1.221 ± 0.343
5.696GlnPro: 5.696 ± 3.64
1.221GlnGln: 1.221 ± 0.343
2.034GlnArg: 2.034 ± 0.387
3.662GlnSer: 3.662 ± 1.03
3.255GlnThr: 3.255 ± 0.73
2.441GlnVal: 2.441 ± 1.798
2.034GlnTrp: 2.034 ± 0.387
1.221GlnTyr: 1.221 ± 0.212
0.0GlnXaa: 0.0 ± 0.0
Arg
8.544ArgAla: 8.544 ± 2.043
0.814ArgCys: 0.814 ± 0.512
0.814ArgAsp: 0.814 ± 0.044
2.441ArgGlu: 2.441 ± 0.425
2.441ArgPhe: 2.441 ± 0.131
4.475ArgGly: 4.475 ± 1.705
5.289ArgHis: 5.289 ± 1.106
1.627ArgIle: 1.627 ± 0.087
2.034ArgLys: 2.034 ± 0.169
4.475ArgLeu: 4.475 ± 0.518
0.0ArgMet: 0.0 ± 0.0
1.627ArgAsn: 1.627 ± 0.468
3.662ArgPro: 3.662 ± 0.474
2.848ArgGln: 2.848 ± 0.431
5.289ArgArg: 5.289 ± 1.106
5.696ArgSer: 5.696 ± 1.362
0.407ArgThr: 0.407 ± 0.256
5.696ArgVal: 5.696 ± 0.806
0.814ArgTrp: 0.814 ± 0.044
2.034ArgTyr: 2.034 ± 0.169
0.0ArgXaa: 0.0 ± 0.0
Ser
5.696SerAla: 5.696 ± 1.973
0.407SerCys: 0.407 ± 0.256
3.255SerAsp: 3.255 ± 0.937
3.255SerGlu: 3.255 ± 0.175
1.221SerPhe: 1.221 ± 0.212
6.509SerGly: 6.509 ± 0.762
0.407SerHis: 0.407 ± 0.3
3.662SerIle: 3.662 ± 0.082
2.848SerLys: 2.848 ± 1.237
6.509SerLeu: 6.509 ± 2.43
1.221SerMet: 1.221 ± 0.768
4.068SerAsn: 4.068 ± 0.893
5.696SerPro: 5.696 ± 1.417
3.662SerGln: 3.662 ± 0.474
6.916SerArg: 6.916 ± 0.649
6.509SerSer: 6.509 ± 1.318
4.068SerThr: 4.068 ± 1.449
2.034SerVal: 2.034 ± 0.725
0.814SerTrp: 0.814 ± 0.044
1.627SerTyr: 1.627 ± 0.087
0.0SerXaa: 0.0 ± 0.0
Thr
3.662ThrAla: 3.662 ± 0.082
0.0ThrCys: 0.0 ± 0.0
2.848ThrAsp: 2.848 ± 1.237
0.814ThrGlu: 0.814 ± 0.044
2.034ThrPhe: 2.034 ± 0.169
2.034ThrGly: 2.034 ± 0.943
0.407ThrHis: 0.407 ± 0.3
2.441ThrIle: 2.441 ± 0.425
2.441ThrLys: 2.441 ± 0.981
5.696ThrLeu: 5.696 ± 0.25
2.441ThrMet: 2.441 ± 0.981
1.627ThrAsn: 1.627 ± 0.468
6.103ThrPro: 6.103 ± 1.161
2.848ThrGln: 2.848 ± 0.986
3.255ThrArg: 3.255 ± 1.493
3.255ThrSer: 3.255 ± 0.937
3.662ThrThr: 3.662 ± 0.474
2.441ThrVal: 2.441 ± 0.131
0.814ThrTrp: 0.814 ± 0.044
1.221ThrTyr: 1.221 ± 0.212
0.0ThrXaa: 0.0 ± 0.0
Val
7.73ValAla: 7.73 ± 0.137
0.814ValCys: 0.814 ± 0.512
1.627ValAsp: 1.627 ± 0.087
2.441ValGlu: 2.441 ± 0.425
3.662ValPhe: 3.662 ± 0.082
3.662ValGly: 3.662 ± 0.082
2.848ValHis: 2.848 ± 0.986
2.441ValIle: 2.441 ± 0.131
1.627ValLys: 1.627 ± 0.643
6.103ValLeu: 6.103 ± 2.174
1.627ValMet: 1.627 ± 0.087
2.848ValAsn: 2.848 ± 1.542
4.882ValPro: 4.882 ± 0.818
0.814ValGln: 0.814 ± 0.044
4.882ValArg: 4.882 ± 0.262
4.882ValSer: 4.882 ± 1.405
2.034ValThr: 2.034 ± 0.169
3.255ValVal: 3.255 ± 0.73
1.627ValTrp: 1.627 ± 0.468
2.034ValTyr: 2.034 ± 0.169
0.0ValXaa: 0.0 ± 0.0
Trp
4.068TrpAla: 4.068 ± 0.893
0.407TrpCys: 0.407 ± 0.256
0.407TrpAsp: 0.407 ± 0.256
0.814TrpGlu: 0.814 ± 0.599
0.407TrpPhe: 0.407 ± 0.3
1.221TrpGly: 1.221 ± 0.899
0.407TrpHis: 0.407 ± 0.256
0.814TrpIle: 0.814 ± 0.599
0.407TrpLys: 0.407 ± 0.256
3.662TrpLeu: 3.662 ± 0.082
0.407TrpMet: 0.407 ± 0.256
0.0TrpAsn: 0.0 ± 0.0
0.407TrpPro: 0.407 ± 0.256
0.407TrpGln: 0.407 ± 0.256
0.814TrpArg: 0.814 ± 0.512
1.627TrpSer: 1.627 ± 0.087
1.221TrpThr: 1.221 ± 0.343
2.441TrpVal: 2.441 ± 0.981
0.0TrpTrp: 0.0 ± 0.0
0.407TrpTyr: 0.407 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.848TyrAla: 2.848 ± 0.125
0.407TyrCys: 0.407 ± 0.256
2.848TyrAsp: 2.848 ± 0.431
0.814TyrGlu: 0.814 ± 0.044
0.407TyrPhe: 0.407 ± 0.256
2.441TyrGly: 2.441 ± 0.131
1.221TyrHis: 1.221 ± 0.768
2.441TyrIle: 2.441 ± 0.425
0.407TyrLys: 0.407 ± 0.256
3.662TyrLeu: 3.662 ± 0.082
0.0TyrMet: 0.0 ± 0.163
1.221TyrAsn: 1.221 ± 0.899
2.034TyrPro: 2.034 ± 0.864
0.814TyrGln: 0.814 ± 0.044
1.221TyrArg: 1.221 ± 0.212
1.221TyrSer: 1.221 ± 0.212
2.441TyrThr: 2.441 ± 1.536
3.255TyrVal: 3.255 ± 0.381
1.221TyrTrp: 1.221 ± 0.343
1.221TyrTyr: 1.221 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2459 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski