Amino acid dipepetide frequency for Simian T-lymphotropic virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.101AlaAla: 6.101 ± 0.04
0.555AlaCys: 0.555 ± 0.322
1.109AlaAsp: 1.109 ± 0.408
1.109AlaGlu: 1.109 ± 0.608
4.437AlaPhe: 4.437 ± 1.157
2.773AlaGly: 2.773 ± 1.323
2.219AlaHis: 2.219 ± 0.824
6.656AlaIle: 6.656 ± 0.746
1.109AlaLys: 1.109 ± 0.608
12.757AlaLeu: 12.757 ± 1.844
0.0AlaMet: 0.0 ± 0.0
2.773AlaAsn: 2.773 ± 1.898
6.101AlaPro: 6.101 ± 2.673
3.328AlaGln: 3.328 ± 1.105
2.773AlaArg: 2.773 ± 0.828
4.437AlaSer: 4.437 ± 1.431
3.882AlaThr: 3.882 ± 1.642
1.664AlaVal: 1.664 ± 0.925
1.109AlaTrp: 1.109 ± 0.408
1.109AlaTyr: 1.109 ± 0.608
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.816
0.555CysCys: 0.555 ± 0.56
0.0CysAsp: 0.0 ± 0.0
0.555CysGlu: 0.555 ± 0.322
1.109CysPhe: 1.109 ± 0.408
0.555CysGly: 0.555 ± 0.56
0.0CysHis: 0.0 ± 0.0
1.664CysIle: 1.664 ± 1.679
1.109CysLys: 1.109 ± 0.408
2.219CysLeu: 2.219 ± 0.824
0.555CysMet: 0.555 ± 0.56
1.664CysAsn: 1.664 ± 0.477
2.773CysPro: 2.773 ± 1.316
1.664CysGln: 1.664 ± 0.649
0.555CysArg: 0.555 ± 0.322
1.664CysSer: 1.664 ± 1.197
1.664CysThr: 1.664 ± 0.477
0.555CysVal: 0.555 ± 0.322
0.0CysTrp: 0.0 ± 0.0
0.555CysTyr: 0.555 ± 0.56
0.0CysXaa: 0.0 ± 0.0
Asp
1.109AspAla: 1.109 ± 0.408
0.555AspCys: 0.555 ± 0.725
0.555AspAsp: 0.555 ± 0.322
0.0AspGlu: 0.0 ± 0.0
1.664AspPhe: 1.664 ± 1.299
1.109AspGly: 1.109 ± 0.644
1.109AspHis: 1.109 ± 0.644
1.664AspIle: 1.664 ± 0.477
1.109AspLys: 1.109 ± 0.408
7.21AspLeu: 7.21 ± 1.555
0.0AspMet: 0.0 ± 0.0
3.328AspAsn: 3.328 ± 1.348
3.328AspPro: 3.328 ± 1.021
1.664AspGln: 1.664 ± 0.477
0.555AspArg: 0.555 ± 0.56
3.328AspSer: 3.328 ± 0.954
1.664AspThr: 1.664 ± 0.965
1.109AspVal: 1.109 ± 0.408
0.0AspTrp: 0.0 ± 0.0
0.555AspTyr: 0.555 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
1.664GluAla: 1.664 ± 0.51
0.0GluCys: 0.0 ± 0.0
1.109GluAsp: 1.109 ± 0.644
0.555GluGlu: 0.555 ± 0.725
0.555GluPhe: 0.555 ± 0.322
1.664GluGly: 1.664 ± 1.299
1.109GluHis: 1.109 ± 0.408
1.109GluIle: 1.109 ± 0.608
0.555GluLys: 0.555 ± 0.56
2.219GluLeu: 2.219 ± 1.287
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
3.882GluPro: 3.882 ± 3.34
2.219GluGln: 2.219 ± 0.817
2.773GluArg: 2.773 ± 0.265
0.555GluSer: 0.555 ± 0.322
2.773GluThr: 2.773 ± 0.265
1.664GluVal: 1.664 ± 1.197
0.0GluTrp: 0.0 ± 0.0
1.109GluTyr: 1.109 ± 0.608
0.0GluXaa: 0.0 ± 0.0
Phe
0.555PheAla: 0.555 ± 0.322
0.555PheCys: 0.555 ± 0.322
1.109PheAsp: 1.109 ± 1.451
1.109PheGlu: 1.109 ± 0.408
1.109PhePhe: 1.109 ± 0.644
0.555PheGly: 0.555 ± 0.56
2.219PheHis: 2.219 ± 1.474
1.109PheIle: 1.109 ± 0.408
1.664PheLys: 1.664 ± 0.965
4.437PheLeu: 4.437 ± 0.497
0.555PheMet: 0.555 ± 0.725
0.0PheAsn: 0.0 ± 0.0
2.773PhePro: 2.773 ± 0.985
2.773PheGln: 2.773 ± 1.07
1.109PheArg: 1.109 ± 0.644
2.773PheSer: 2.773 ± 1.386
1.664PheThr: 1.664 ± 0.477
0.555PheVal: 0.555 ± 0.322
0.555PheTrp: 0.555 ± 0.56
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.219GlyAla: 2.219 ± 0.824
0.0GlyCys: 0.0 ± 0.0
1.109GlyAsp: 1.109 ± 0.408
1.664GlyGlu: 1.664 ± 0.965
0.555GlyPhe: 0.555 ± 0.322
2.219GlyGly: 2.219 ± 1.474
1.109GlyHis: 1.109 ± 0.644
2.773GlyIle: 2.773 ± 1.316
1.664GlyLys: 1.664 ± 0.965
10.538GlyLeu: 10.538 ± 1.457
1.109GlyMet: 1.109 ± 0.676
1.664GlyAsn: 1.664 ± 0.51
7.21GlyPro: 7.21 ± 1.162
2.219GlyGln: 2.219 ± 0.865
2.219GlyArg: 2.219 ± 0.824
5.546GlySer: 5.546 ± 0.362
3.882GlyThr: 3.882 ± 0.23
0.555GlyVal: 0.555 ± 0.56
0.555GlyTrp: 0.555 ± 0.56
2.773GlyTyr: 2.773 ± 1.323
0.0GlyXaa: 0.0 ± 0.0
His
2.219HisAla: 2.219 ± 1.287
1.664HisCys: 1.664 ± 0.51
1.664HisAsp: 1.664 ± 0.965
0.0HisGlu: 0.0 ± 0.0
0.555HisPhe: 0.555 ± 0.322
2.773HisGly: 2.773 ± 1.07
3.882HisHis: 3.882 ± 1.448
3.328HisIle: 3.328 ± 1.286
0.555HisLys: 0.555 ± 0.56
4.992HisLeu: 4.992 ± 1.431
1.109HisMet: 1.109 ± 0.644
2.219HisAsn: 2.219 ± 0.704
2.773HisPro: 2.773 ± 0.265
2.219HisGln: 2.219 ± 0.249
2.219HisArg: 2.219 ± 0.249
2.219HisSer: 2.219 ± 0.817
2.219HisThr: 2.219 ± 1.217
2.773HisVal: 2.773 ± 1.316
1.664HisTrp: 1.664 ± 1.197
1.109HisTyr: 1.109 ± 0.644
0.0HisXaa: 0.0 ± 0.0
Ile
1.664IleAla: 1.664 ± 0.925
0.0IleCys: 0.0 ± 0.0
3.328IleAsp: 3.328 ± 0.295
1.109IleGlu: 1.109 ± 0.608
0.555IlePhe: 0.555 ± 0.322
1.109IleGly: 1.109 ± 0.644
1.109IleHis: 1.109 ± 0.644
3.882IleIle: 3.882 ± 0.23
1.109IleLys: 1.109 ± 0.608
11.093IleLeu: 11.093 ± 2.004
0.0IleMet: 0.0 ± 0.0
1.664IleAsn: 1.664 ± 0.51
4.992IlePro: 4.992 ± 1.089
3.328IleGln: 3.328 ± 0.295
3.328IleArg: 3.328 ± 1.348
5.546IleSer: 5.546 ± 2.042
6.101IleThr: 6.101 ± 1.592
1.109IleVal: 1.109 ± 0.408
1.109IleTrp: 1.109 ± 0.608
0.555IleTyr: 0.555 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
4.992LysAla: 4.992 ± 1.475
0.0LysCys: 0.0 ± 0.0
2.219LysAsp: 2.219 ± 2.128
1.109LysGlu: 1.109 ± 0.608
0.0LysPhe: 0.0 ± 0.0
1.664LysGly: 1.664 ± 0.649
0.555LysHis: 0.555 ± 0.322
0.0LysIle: 0.0 ± 0.0
2.219LysLys: 2.219 ± 0.704
2.219LysLeu: 2.219 ± 1.217
0.555LysMet: 0.555 ± 0.322
1.664LysAsn: 1.664 ± 0.477
2.219LysPro: 2.219 ± 0.704
2.219LysGln: 2.219 ± 0.249
1.109LysArg: 1.109 ± 0.408
2.219LysSer: 2.219 ± 0.704
2.219LysThr: 2.219 ± 0.824
1.109LysVal: 1.109 ± 0.644
1.109LysTrp: 1.109 ± 0.644
2.773LysTyr: 2.773 ± 1.609
0.0LysXaa: 0.0 ± 0.0
Leu
11.093LeuAla: 11.093 ± 2.17
2.773LeuCys: 2.773 ± 0.265
4.437LeuAsp: 4.437 ± 0.475
4.437LeuGlu: 4.437 ± 0.497
4.437LeuPhe: 4.437 ± 2.239
6.101LeuGly: 6.101 ± 1.857
9.983LeuHis: 9.983 ± 2.415
6.656LeuIle: 6.656 ± 0.746
1.664LeuLys: 1.664 ± 1.299
16.084LeuLeu: 16.084 ± 2.41
0.0LeuMet: 0.0 ± 0.449
6.656LeuAsn: 6.656 ± 1.037
15.53LeuPro: 15.53 ± 1.85
14.42LeuGln: 14.42 ± 2.11
7.765LeuArg: 7.765 ± 1.312
8.319LeuSer: 8.319 ± 1.247
8.874LeuThr: 8.874 ± 3.267
4.992LeuVal: 4.992 ± 1.211
2.219LeuTrp: 2.219 ± 0.249
3.328LeuTyr: 3.328 ± 1.225
0.0LeuXaa: 0.0 ± 0.0
Met
0.555MetAla: 0.555 ± 0.725
0.0MetCys: 0.0 ± 0.0
0.555MetAsp: 0.555 ± 0.322
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.109MetGly: 1.109 ± 0.816
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.109MetLys: 1.109 ± 0.608
0.555MetLeu: 0.555 ± 0.56
0.0MetMet: 0.0 ± 0.0
0.555MetAsn: 0.555 ± 0.322
0.555MetPro: 0.555 ± 0.322
1.109MetGln: 1.109 ± 1.451
0.0MetArg: 0.0 ± 0.0
1.664MetSer: 1.664 ± 0.965
1.664MetThr: 1.664 ± 0.51
0.555MetVal: 0.555 ± 0.56
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.219AsnAla: 2.219 ± 0.824
1.109AsnCys: 1.109 ± 0.408
0.0AsnAsp: 0.0 ± 0.0
0.555AsnGlu: 0.555 ± 0.322
1.664AsnPhe: 1.664 ± 1.197
2.219AsnGly: 2.219 ± 0.824
1.664AsnHis: 1.664 ± 0.925
2.773AsnIle: 2.773 ± 2.028
2.219AsnLys: 2.219 ± 0.824
3.328AsnLeu: 3.328 ± 0.535
0.555AsnMet: 0.555 ± 0.322
1.664AsnAsn: 1.664 ± 0.51
3.882AsnPro: 3.882 ± 2.497
2.773AsnGln: 2.773 ± 0.573
1.664AsnArg: 1.664 ± 1.679
3.882AsnSer: 3.882 ± 1.594
2.773AsnThr: 2.773 ± 0.974
1.664AsnVal: 1.664 ± 1.299
1.109AsnTrp: 1.109 ± 0.408
1.109AsnTyr: 1.109 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
6.101ProAla: 6.101 ± 2.01
3.328ProCys: 3.328 ± 1.851
2.219ProAsp: 2.219 ± 1.287
2.773ProGlu: 2.773 ± 1.895
2.773ProPhe: 2.773 ± 1.316
6.656ProGly: 6.656 ± 1.677
4.437ProHis: 4.437 ± 1.006
6.101ProIle: 6.101 ± 2.673
2.773ProLys: 2.773 ± 1.895
8.874ProLeu: 8.874 ± 1.569
1.109ProMet: 1.109 ± 0.608
3.328ProAsn: 3.328 ± 1.225
12.202ProPro: 12.202 ± 3.83
5.546ProGln: 5.546 ± 1.794
5.546ProArg: 5.546 ± 1.794
12.202ProSer: 12.202 ± 1.8
5.546ProThr: 5.546 ± 0.53
5.546ProVal: 5.546 ± 1.627
2.219ProTrp: 2.219 ± 0.824
4.992ProTyr: 4.992 ± 1.222
0.0ProXaa: 0.0 ± 0.0
Gln
6.656GlnAla: 6.656 ± 1.67
2.219GlnCys: 2.219 ± 0.817
1.664GlnAsp: 1.664 ± 0.649
4.437GlnGlu: 4.437 ± 1.763
2.773GlnPhe: 2.773 ± 1.216
4.437GlnGly: 4.437 ± 0.475
2.219GlnHis: 2.219 ± 1.287
2.219GlnIle: 2.219 ± 1.287
1.664GlnLys: 1.664 ± 0.649
4.992GlnLeu: 4.992 ± 0.401
0.555GlnMet: 0.555 ± 0.725
2.219GlnAsn: 2.219 ± 1.681
6.101GlnPro: 6.101 ± 0.853
4.992GlnGln: 4.992 ± 2.411
2.219GlnArg: 2.219 ± 0.704
4.437GlnSer: 4.437 ± 1.634
4.437GlnThr: 4.437 ± 0.903
1.109GlnVal: 1.109 ± 0.644
2.219GlnTrp: 2.219 ± 0.704
2.219GlnTyr: 2.219 ± 0.881
0.0GlnXaa: 0.0 ± 0.0
Arg
2.773ArgAla: 2.773 ± 0.985
1.109ArgCys: 1.109 ± 1.119
1.664ArgAsp: 1.664 ± 0.51
1.664ArgGlu: 1.664 ± 0.51
2.773ArgPhe: 2.773 ± 0.265
3.882ArgGly: 3.882 ± 0.808
2.219ArgHis: 2.219 ± 0.704
0.0ArgIle: 0.0 ± 0.0
2.219ArgLys: 2.219 ± 1.287
8.319ArgLeu: 8.319 ± 0.795
1.664ArgMet: 1.664 ± 0.51
0.555ArgAsn: 0.555 ± 0.725
3.882ArgPro: 3.882 ± 0.23
1.664ArgGln: 1.664 ± 0.925
2.773ArgArg: 2.773 ± 1.323
3.328ArgSer: 3.328 ± 0.295
1.664ArgThr: 1.664 ± 0.51
2.219ArgVal: 2.219 ± 0.817
1.664ArgTrp: 1.664 ± 0.965
1.109ArgTyr: 1.109 ± 0.644
0.0ArgXaa: 0.0 ± 0.0
Ser
5.546SerAla: 5.546 ± 1.947
1.664SerCys: 1.664 ± 0.477
2.219SerAsp: 2.219 ± 0.881
2.219SerGlu: 2.219 ± 0.249
0.0SerPhe: 0.0 ± 0.0
4.437SerGly: 4.437 ± 1.945
1.109SerHis: 1.109 ± 0.408
2.773SerIle: 2.773 ± 0.985
3.328SerLys: 3.328 ± 1.931
17.194SerLeu: 17.194 ± 3.952
0.555SerMet: 0.555 ± 0.56
3.328SerAsn: 3.328 ± 0.295
13.311SerPro: 13.311 ± 1.333
5.546SerGln: 5.546 ± 1.97
4.992SerArg: 4.992 ± 1.613
8.874SerSer: 8.874 ± 3.607
3.882SerThr: 3.882 ± 0.842
2.219SerVal: 2.219 ± 0.704
4.437SerTrp: 4.437 ± 2.21
3.328SerTyr: 3.328 ± 1.921
0.0SerXaa: 0.0 ± 0.0
Thr
2.773ThrAla: 2.773 ± 0.573
1.664ThrCys: 1.664 ± 0.925
2.773ThrAsp: 2.773 ± 1.609
0.555ThrGlu: 0.555 ± 0.725
0.555ThrPhe: 0.555 ± 0.322
4.437ThrGly: 4.437 ± 2.21
5.546ThrHis: 5.546 ± 0.53
4.992ThrIle: 4.992 ± 1.682
1.109ThrLys: 1.109 ± 0.408
9.429ThrLeu: 9.429 ± 1.879
0.0ThrMet: 0.0 ± 0.0
2.773ThrAsn: 2.773 ± 0.985
9.429ThrPro: 9.429 ± 1.28
1.664ThrGln: 1.664 ± 1.679
2.219ThrArg: 2.219 ± 1.186
7.765ThrSer: 7.765 ± 1.683
4.437ThrThr: 4.437 ± 1.408
3.882ThrVal: 3.882 ± 0.842
0.0ThrTrp: 0.0 ± 0.0
1.109ThrTyr: 1.109 ± 0.608
0.0ThrXaa: 0.0 ± 0.0
Val
2.773ValAla: 2.773 ± 0.573
1.664ValCys: 1.664 ± 0.925
1.109ValAsp: 1.109 ± 0.408
0.555ValGlu: 0.555 ± 0.322
0.555ValPhe: 0.555 ± 0.322
1.109ValGly: 1.109 ± 0.408
0.0ValHis: 0.0 ± 0.0
2.773ValIle: 2.773 ± 0.828
2.773ValLys: 2.773 ± 0.828
6.101ValLeu: 6.101 ± 1.41
0.555ValMet: 0.555 ± 0.322
1.664ValAsn: 1.664 ± 1.197
1.109ValPro: 1.109 ± 0.408
0.555ValGln: 0.555 ± 0.322
1.109ValArg: 1.109 ± 0.816
6.656ValSer: 6.656 ± 1.052
3.328ValThr: 3.328 ± 0.954
1.109ValVal: 1.109 ± 0.608
1.664ValTrp: 1.664 ± 0.477
0.555ValTyr: 0.555 ± 0.322
0.0ValXaa: 0.0 ± 0.0
Trp
2.219TrpAla: 2.219 ± 0.881
0.0TrpCys: 0.0 ± 0.0
2.219TrpAsp: 2.219 ± 1.474
0.555TrpGlu: 0.555 ± 0.56
0.555TrpPhe: 0.555 ± 0.56
1.109TrpGly: 1.109 ± 0.408
1.109TrpHis: 1.109 ± 0.408
0.555TrpIle: 0.555 ± 0.322
1.109TrpLys: 1.109 ± 0.644
2.773TrpLeu: 2.773 ± 1.898
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.109TrpPro: 1.109 ± 0.644
1.664TrpGln: 1.664 ± 0.649
1.664TrpArg: 1.664 ± 0.477
0.0TrpSer: 0.0 ± 0.0
2.773TrpThr: 2.773 ± 0.828
1.664TrpVal: 1.664 ± 0.477
0.0TrpTrp: 0.0 ± 0.0
0.555TrpTyr: 0.555 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.664TyrAla: 1.664 ± 0.477
1.109TyrCys: 1.109 ± 0.608
0.555TyrAsp: 0.555 ± 0.56
0.555TyrGlu: 0.555 ± 0.322
0.555TyrPhe: 0.555 ± 0.322
1.664TyrGly: 1.664 ± 0.649
0.555TyrHis: 0.555 ± 0.56
1.664TyrIle: 1.664 ± 0.51
1.109TyrLys: 1.109 ± 0.644
4.992TyrLeu: 4.992 ± 0.778
0.555TyrMet: 0.555 ± 0.443
1.109TyrAsn: 1.109 ± 0.608
1.109TyrPro: 1.109 ± 0.816
2.219TyrGln: 2.219 ± 0.503
0.555TyrArg: 0.555 ± 0.725
5.546TyrSer: 5.546 ± 1.957
1.664TyrThr: 1.664 ± 0.925
1.664TyrVal: 1.664 ± 0.965
0.0TyrTrp: 0.0 ± 0.0
0.555TyrTyr: 0.555 ± 0.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1804 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski