Difference between revisions of "Data Challenges"

From CMB-S4 wiki
Jump to navigationJump to search
 
(69 intermediate revisions by 4 users not shown)
Line 1: Line 1:
At the Chicago meeting it was agreed that it would be useful to provide common inputs for all science forecasting, to include extra-galactic foreground, galactic foreground and noise maps at a range of frequencies. It was further agreed that, in order to support the widest range of uses, these would be provided as all-sky HEALPix maps at Nside-8192.
+
We separate "experiment definitions" (bands, resolutions, sensitivity level) from "sky models" (LCDM+dust+sync+).
 +
Each of these is designated by a number. The combination of a given experiment definition XX acting on a given sky model YY is designated XX.YY.
  
Possible sky-model inputs are:
+
Experiment models, sky models and the datasets obtained by applying for former to the latter are stored at NERSC at <br />
* Extra-galactice: Alvarez/Battaglia/Bond, PSM
+
'''/project/projectdirs/cmbs4/{expt_xx, sky_yy, data_xx.yy}'''
* Galactic: pySM, PSM
 
Please add to these lists as appropriate.
 
  
Inputs will be hosted at [http://www.nersc.gov NERSC], where everyone is welcome to [http://crd.lbl.gov/cmb sign-up for an account] under the "Data Analysis for Post-Planck CMB Experiments" allocation (PI Borrill). The files will then be located in the shared CMB-S4 file space (/project/projectdirs/cmbs4). In order to manage this space, all files stored there should be owned by the cmbs4 project account but accessible to the cmbs4 group. To do this:
+
== Experiment Definitions ==
* log in to the cmbs4 project account following the instructions [[http://www.nersc.gov/users/accounts/user-accounts/production-project-accounts here]]
 
* sync the files to an appropriate subdirectory in the project space
 
* ensure that the permissions are set appropriately (g+rX,o-rwx)
 
Remember to include a README and to post the details on this wiki page.
 
  
For any NERSC issues, including access to the filegroup and/or project account, please contact [mailto:jdborrill@lbl.gov Julian Borrill].
+
'''[[Experiment Definitions]]'''
  
== Data Challenge 1 ==  
+
== Input Sky Components ==
Intended as a simple test to develop our practice, compare spectral- and map-based forecasting, contrast different map-based codes and methods, and coordinate r and non-r science forecasting.
 
  
'''[http://bicep.rc.fas.harvard.edu/cbischoff/20170210_data_challenge_1/index.html Maps pager for data challenge 1]''' (version 3, posted 2017-02-10)
+
'''[[Sky Components]]'''
  
* All maps are in Healpix format with nside = 512 and &#x2113;max = 1024.
+
== Sky Models ==
* Maps have been filtered to remove all signal and noise below &#x2113; = 30.
 
  
Combined noise + lensed-&Lambda;CDM + dust + sync maps are posted on NERSC in directory:
+
'''[[Sky Models]]'''
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/comb/map/
 
  
=== Instrument specification ===
 
{| class="wikitable" style="text-align: right; margin-left: 51px;"
 
! Frequency (GHz) !!  30 !!  40 !!  85 !!  95 !! 145 !! 155 !! 220 !! 270
 
|-
 
| Bandwidth (GHz) || 9.0 || 12.0 || 20.4 || 22.8 || 31.9 || 34.1 || 48.4 || 59.4
 
|-
 
| Beam FWHM (arcmin) || 76.6 || 57.5 || 27.0 || 24.2 || 15.9 || 14.8 || 10.7 || 8.5
 
|-
 
|}
 
  
* Tophat bandpasses [[Tophat bands for Data Challenge | described in this posting]], added to above table --[[User:Cbischoff|Cbischoff]] ([[User talk:Cbischoff|talk]]) 15:51, 8 November 2016 (UTC)
+
== Sim Data Sets ==
* Beam widths copied from [http://users.physics.harvard.edu/~buza/20160531_fisher/ Victor's posting here] --[[User:Cbischoff|Cbischoff]] ([[User talk:Cbischoff|talk]]) 19:20, 14 December 2016 (UTC)
 
  
Beam window functions for each frequency can be found on NERSC in:
+
Many of the possible xx.yy combinations have been generated and are available on NERSC.
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/wfunc/
 
In addition to beam smoothing at high ell, these window functions drop sharply to zero below ell=30 (maps for the data challenge contain no information below ell=30).
 
  
=== Scalar, tensor & non-Gaussian CMB: Borrill ===
+
Most of 02.00 through 02.09 exist - these are the "Science Book config".
  
'''Lensed scalar CMB''', consistent with Planck 2015 and low &tau; parameters
+
Most of 04.00 through 04.06 exist - these are the "CDT report config".
* Lensed CMB maps are posted on NERSC in directory:
 
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/llcdm/map/
 
  
'''Unlensed CMB?'''
+
For experiment config 06, only foreground models 00, 07, and 09 were generated - these are the "DSR report config"
  
'''Corresponding Kappa Maps?'''
+
== NERSC Info ==
  
=== Extragalactic foregrounds: Alvarez / Battaglia / Bond / Stein===
+
Shared space is available on [http://www.nersc.gov NERSC], where everyone is welcome to [http://crd.lbl.gov/cmb sign-up for an account] under the "Data Analysis for Post-Planck CMB Experiments" allocation (PI Borrill). The files are located in the shared CMB-S4 file space (<code>/global/cfs/cdirs/cmbs4</code>). In order to manage this space, all files stored there should be owned by the cmbs4 project account but accessible to the cmbs4 group. To do this:
 +
* log in to the cmbs4 project account following the instructions [[http://www.nersc.gov/users/accounts/user-accounts/production-project-accounts here]]
 +
* sync the files to an appropriate subdirectory in the project space
 +
* ensure that the permissions are set appropriately (g+rX,o-rwx)
 +
Remember to include a README and to post the details on this wiki page.
  
Not included in Data Challenge 1
+
For any NERSC issues, including access to the filegroup and/or project account, please contact [mailto:jdborrill@lbl.gov Julian Borrill].
 
 
=== Galactic foregrounds ===
 
 
 
'''Gaussian dust'''
 
* Amplitude (in BB spectrum) is 4.25 &mu;K<sup>2</sup> at &nu = 353 GHz, &#x2113; = 80. This is the value that was used for Science Book forecasts and is listed in [http://users.physics.harvard.edu/~buza/20160531_fisher/ Victor's 2016-05-31 posting] (Section 2). It also corresponds to the best fit dust amplitude from the [http://adsabs.harvard.edu/abs/2016PhRvL.116c1302B BK14 result].
 
* Dust amplitude in EE is 2x larger than BB. Dust amplitude in TT is 10x larger than EE (20x larger than BB). There is no TE, TB, or EB correlation.
 
* Dust scaling in frequency follows a greybody spectrum with &beta;<sub>dust</sub> = 1.6 and T<sub>dust</sub> = 19.6 K.
 
* Dust D<sub>&#x2113;</sub> scaling follows a power law in ell with exponent &alpha;<sub>dust</sub> = -0.4.
 
* Dust maps are posted on NERSC in directory:
 
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/gdust/map/
 
 
 
'''Gaussian synchrotron'''
 
* Amplitude (in BB spectrum) is 3.8 &mu;K<sup>2</sup> at &nu = 23 GHz, &#x2113; = 80. This is the value that was used for Science Book forecasts and is listed in [http://users.physics.harvard.edu/~buza/20160531_fisher/ Victor's 2016-05-31 posting] (Section 2). It also corresponds to the 95% upper limit from the [http://adsabs.harvard.edu/abs/2016PhRvL.116c1302B BK14 result].
 
* Sync amplitude in EE is 2x larger than BB. Dust amplitude in TT is 10x larger than EE (20x larger than BB). There is no TE, TB, or EB correlation.
 
* Sync scaling in frequency follows a power-law spectrum with &beta;<sub>sync</sub> = -3.1.
 
* Sync D<sub>&#x2113;</sub> scaling follows a power law in ell with exponent &alpha;<sub>sync</sub> = -0.6.
 
* Sync maps are posted on NERSC in directory:
 
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/gsync/map/
 
 
 
=== Noise: homogeneous isotropic matched to N_ell: Victor / Clem / Colin / John K ===
 
 
 
'''White + 1/&#x2113; noise'''
 
* N<sub>&#x2113;</sub> functional form and parameters listed in [http://users.physics.harvard.edu/~buza/20161220_chkS4/ Victor's 2016-12-20 posting] (Table 2).
 
* Noise a<sub>&#x2113;m</sub> are calculated out to &#x2113; = 4096, but maps are rendered at nside = 512 with &#x2113;<sub>max</sub> = 2048. No beam smoothing is applied.
 
* Full sky noise maps are generated from N<sub>&#x2113;</sub>, then scaled by the square root of variance map to produce noise that is lowest in the center of the observed field and blows up towards the edge.
 
* Noise maps are posted on NERSC in director:
 
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/noise/map/
 
* Maps with filename like <tt>r0000_f030_n512.fits</tt> are full sky noise maps. Filenames like <tt>r0000_f030_n512_b00p0_mfsky03.fits</tt> have been multiplied by the square root of variance map.
 
 
 
=== Masks: Clem ===
 
 
 
Noise inverse variance map is defined as a flat circular region with radius = 12&deg;, surrounded by a tapered region that falls to zero with cos profile over an additional 15 degrees. This map is defined to range from 0 (in unobserved pixels) to 1 (in the flat 12&deg; circular region); noise simulations are generated over the full sky from a C_l spectrum, then multiplied by 1 / sqrt(inverse variance) so that the noise amplitude blows up near the edge of the field.
 
 
 
* (Sum of inverse variance map) / (# of pixels in full sky) = 0.0293, so total noise power should be equivalent to fsky = 3%.
 
* Fraction of pixels that are observed is 5.4%.
 
  
File located on NERSC at:
+
=== HPSS Archive ===
  /project/projectdirs/cmbs4/data_challenges/phase1/sky/amask/map/fsky03_fall_n512.fits
 
  
=== Analysis ===
+
Older data challenge maps are moved to the [https://docs.nersc.gov/filesystems/archive/ High Performance Storage System]. These are stored in the <code>low_ell_BB</code> directory in the <code>cmbs4</code> user home directory of HPSS. For each Data Challenge, maps are broken up in several different archives to achieve archive file sizes of ~few 100 GB.
  
=== Schedule ===
+
List the data challenge sets that have been archived here:
* Dec 7: Beta version of Phase 1 maps
+
02.00,  02.01, 02.02, 02.03, 02.04, 02.05, 02.06, 02.09
* Dec 14: Telecon
+
02b.00, 02b.03
* Dec 17: Noise power (N_ell) prescriptions to Clem from Victor and Colin
+
02c.00, 02c.03
* Dec 21: A few realizations of Phase 1 maps delivered by Clem
+
03.00,  03.03
* Dec 21: Telecon (note phase shift)
+
03b.00, 03b.03
* Jan 4: Telecon
+
03c.00, 03c.03
* Jan 12: 100 realizations of Phase 1 maps posted to NERSC
 
* Jan 18: Telecon
 
* Feb 1: Telecon
 
* Feb 15: Telecon
 
* Feb 27, 28, Mar 1: S4 general meeting at SLAC
 
* Mar 2, 3: S4 CDT meeting at SLAC
 

Latest revision as of 11:39, 25 February 2020

We separate "experiment definitions" (bands, resolutions, sensitivity level) from "sky models" (LCDM+dust+sync+). Each of these is designated by a number. The combination of a given experiment definition XX acting on a given sky model YY is designated XX.YY.

Experiment models, sky models and the datasets obtained by applying for former to the latter are stored at NERSC at
/project/projectdirs/cmbs4/{expt_xx, sky_yy, data_xx.yy}

Experiment Definitions

Experiment Definitions

Input Sky Components

Sky Components

Sky Models

Sky Models


Sim Data Sets

Many of the possible xx.yy combinations have been generated and are available on NERSC.

Most of 02.00 through 02.09 exist - these are the "Science Book config".

Most of 04.00 through 04.06 exist - these are the "CDT report config".

For experiment config 06, only foreground models 00, 07, and 09 were generated - these are the "DSR report config"

NERSC Info

Shared space is available on NERSC, where everyone is welcome to sign-up for an account under the "Data Analysis for Post-Planck CMB Experiments" allocation (PI Borrill). The files are located in the shared CMB-S4 file space (/global/cfs/cdirs/cmbs4). In order to manage this space, all files stored there should be owned by the cmbs4 project account but accessible to the cmbs4 group. To do this:

  • log in to the cmbs4 project account following the instructions [here]
  • sync the files to an appropriate subdirectory in the project space
  • ensure that the permissions are set appropriately (g+rX,o-rwx)

Remember to include a README and to post the details on this wiki page.

For any NERSC issues, including access to the filegroup and/or project account, please contact Julian Borrill.

HPSS Archive

Older data challenge maps are moved to the High Performance Storage System. These are stored in the low_ell_BB directory in the cmbs4 user home directory of HPSS. For each Data Challenge, maps are broken up in several different archives to achieve archive file sizes of ~few 100 GB.

List the data challenge sets that have been archived here:

02.00,  02.01, 02.02, 02.03, 02.04, 02.05, 02.06, 02.09
02b.00, 02b.03
02c.00, 02c.03
03.00,  03.03
03b.00, 03b.03
03c.00, 03c.03