<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://pwiki.pic.es/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Sbogaart</id>
	<title>Public PIC Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://pwiki.pic.es/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Sbogaart"/>
	<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Special:Contributions/Sbogaart"/>
	<updated>2026-05-25T07:08:39Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.14</generator>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=HDFS_Access_via_VOSpace&amp;diff=1238</id>
		<title>HDFS Access via VOSpace</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=HDFS_Access_via_VOSpace&amp;diff=1238"/>
		<updated>2025-07-23T11:30:50Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: Created page with &amp;quot;= Introduction =  PIC provides access to the distributed file system HDFS through a '''VOSpace server''', following the IVOA standard described in [https://www.ivoa.net/docume...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
PIC provides access to the distributed file system HDFS through a '''VOSpace server''', following the IVOA standard described in [https://www.ivoa.net/documents/VOSpace/20180502/PR-VOSpace-2.1-20180502.html VOSpace 2.1].&lt;br /&gt;
&lt;br /&gt;
This service is an alternative to WebDAV access, allowing users to manage their data programmatically and in a structured way using tools compatible with the Virtual Observatory (VO) ecosystem. It is especially aimed at users who require data management operations such as reading, writing, moving, and metadata querying within a standardized environment.&lt;br /&gt;
&lt;br /&gt;
= How to connect to the service =&lt;br /&gt;
&lt;br /&gt;
== VOSpace Endpoint ==&lt;br /&gt;
&lt;br /&gt;
The VOSpace server is available at the following URL:&lt;br /&gt;
&lt;br /&gt;
    https://vospace.pic.es/vospace&lt;br /&gt;
&lt;br /&gt;
== Compatible clients ==&lt;br /&gt;
&lt;br /&gt;
You can access the service using tools compatible with the VOSpace 2.1 standard:&lt;br /&gt;
&lt;br /&gt;
* [https://pypi.org/project/vos/ '''vos''' client by CADC]&lt;br /&gt;
: Install with pip:&lt;br /&gt;
    pip install vos&amp;gt;=3.6.3&lt;br /&gt;
&lt;br /&gt;
* '''curl''':&lt;br /&gt;
: Can be used to perform HTTP operations following the examples defined in the [https://www.ivoa.net/documents/VOSpace/20180502/PR-VOSpace-2.1-20180502.html VOSpace 2.1 standard], including node creation, file transfers, and property queries.&lt;br /&gt;
&lt;br /&gt;
== Authentication ==&lt;br /&gt;
&lt;br /&gt;
The VOSpace server allows both anonymous and authenticated access.&lt;br /&gt;
To access restricted data or personal space, users must specify their '''PIC username and password'''.&lt;br /&gt;
&lt;br /&gt;
= Usage Guidelines =&lt;br /&gt;
&lt;br /&gt;
Once connected to the service, users can perform a variety of operations defined by the VOSpace standard. Supported functionalities include:&lt;br /&gt;
&lt;br /&gt;
* '''getProtocols''': Query supported transfer protocols.&lt;br /&gt;
* '''getViews''': View available data formats (views).&lt;br /&gt;
* '''getProperties''': Obtain node properties.&lt;br /&gt;
* '''createNode''': Create files or folders.&lt;br /&gt;
* '''getNode''': Retrieve node information.&lt;br /&gt;
* '''deleteNode''': Delete files or folders.&lt;br /&gt;
* '''moveNode''': Move nodes inside the VOSpace.&lt;br /&gt;
* '''put / get''': Upload or download files.&lt;br /&gt;
* '''pushToVoSpace / pullFromVoSpace''': Transfer files via external URLs.&lt;br /&gt;
&lt;br /&gt;
= Best Practices =&lt;br /&gt;
&lt;br /&gt;
* Avoid using the '''!''' character in file or folder names, as it may cause compatibility issues with the CADC '''vos''' client.&lt;br /&gt;
* Use tools that comply with the VOSpace standard to ensure compatibility and avoid transfer errors.&lt;br /&gt;
&lt;br /&gt;
= Troubleshooting =&lt;br /&gt;
&lt;br /&gt;
'''Cannot access the server:'''&lt;br /&gt;
* Verify the URL is correct: &amp;lt;code&amp;gt;https://vospace.pic.es/vospace&amp;lt;/code&amp;gt;&lt;br /&gt;
* Make sure PIC credentials are set correctly.&lt;br /&gt;
* Check client configuration (e.g., '''vos''', '''curl''').&lt;br /&gt;
&lt;br /&gt;
'''Problems with properties:'''&lt;br /&gt;
* The '''setProperties''' operation is '''not allowed''' due to permission restrictions on HDFS.&lt;br /&gt;
&lt;br /&gt;
= Security and Data Management =&lt;br /&gt;
&lt;br /&gt;
* Protect PIC credentials. Do not share or store them in plain text.&lt;br /&gt;
* For automation, consider using secure credential managers or temporary storage.&lt;br /&gt;
&lt;br /&gt;
= Usage Examples =&lt;br /&gt;
&lt;br /&gt;
== Using the CADC '''vos''' client (Python) ==&lt;br /&gt;
&lt;br /&gt;
This server supports [https://pypi.org/project/vos/ the CADC vos client] and operations like '''listdir''', '''mkdir''', '''copy''', '''move''', and '''delete'''. Minimum version: '''vos &amp;gt;= 3.6.3'''.&lt;br /&gt;
&lt;br /&gt;
=== Configuration ===&lt;br /&gt;
&lt;br /&gt;
* Create a config file (e.g. &amp;lt;code&amp;gt;vos-config.ini&amp;lt;/code&amp;gt;):&lt;br /&gt;
    [vos]&lt;br /&gt;
    resourceID = https://vospace.pic.es/vospace http&lt;br /&gt;
&lt;br /&gt;
* Export the file path as an environment variable:&lt;br /&gt;
&lt;br /&gt;
    export VOSPACE_CONFIG_FILE=/path/to/vos-config.ini&lt;br /&gt;
&lt;br /&gt;
* Create &amp;lt;code&amp;gt;~/.netrc&amp;lt;/code&amp;gt; for authentication:&lt;br /&gt;
&lt;br /&gt;
    machine https://vospace.pic.es/vospace&lt;br /&gt;
      login USERNAME&lt;br /&gt;
      password PASSWORD&lt;br /&gt;
&lt;br /&gt;
* Set permissions:&lt;br /&gt;
&lt;br /&gt;
    chmod 600 ~/.netrc&lt;br /&gt;
=== Python usage examples ===&lt;br /&gt;
&lt;br /&gt;
    import vos&lt;br /&gt;
&lt;br /&gt;
    client = vos.Client()&lt;br /&gt;
&lt;br /&gt;
    # List contents&lt;br /&gt;
    files = client.listdir(&amp;quot;https://vospace.pic.es/vospace/nodes/user/my_user/&amp;quot;)&lt;br /&gt;
    print(&amp;quot;Contents:&amp;quot;, files)&lt;br /&gt;
&lt;br /&gt;
    # Create directory&lt;br /&gt;
    client.mkdir(&amp;quot;https://vospace.pic.es/vospace/nodes/user/my_user/mydir&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
    # Upload file&lt;br /&gt;
    client.copy(&amp;quot;localfile.txt&amp;quot;, &amp;quot;https://vospace.pic.es/vospace/user/my_user/mydir/localfile.txt&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
    # Download file&lt;br /&gt;
    client.copy(&amp;quot;https://vospace.pic.es/vospace/user/my_user/mydir/localfile.txt&amp;quot;, &amp;quot;downloaded.txt&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
    # Move file&lt;br /&gt;
    client.move(&lt;br /&gt;
        &amp;quot;https://vospace.pic.es/vospace/user/my_user/mydir/localfile.txt&amp;quot;,&lt;br /&gt;
        &amp;quot;https://vospace.pic.es/vospace/user/my_user/mydir/renamedfile.txt&amp;quot;&lt;br /&gt;
    )&lt;br /&gt;
&lt;br /&gt;
    # Get node info&lt;br /&gt;
    node = client.get_node(&amp;quot;https://vospace.pic.es/vospace/nodes/user/my_user/mydir/renamedfile.txt&amp;quot;)&lt;br /&gt;
    print(&amp;quot;Properties:&amp;quot;, node.props)&lt;br /&gt;
&lt;br /&gt;
    # Delete resource&lt;br /&gt;
    client.delete(&amp;quot;https://vospace.pic.es/vospace/nodes/user/my_user/mydir&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
'''Note''': Ensure configuration is properly set.&lt;br /&gt;
&lt;br /&gt;
== Using curl ==&lt;br /&gt;
&lt;br /&gt;
You can also interact with the server using '''curl''' or other HTTP tools.&lt;br /&gt;
&lt;br /&gt;
These must follow the [https://www.ivoa.net/documents/VOSpace/20180502/PR-VOSpace-2.1-20180502.html VOSpace 2.1] specification.&lt;br /&gt;
&lt;br /&gt;
'''Important:'''&lt;br /&gt;
* Avoid using '''!''' in URLs.&lt;br /&gt;
* Use full HTTP paths like &amp;lt;code&amp;gt;/vospace/nodes/...&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
= VOSpace Server Endpoints =&lt;br /&gt;
&lt;br /&gt;
The server provides the following REST endpoints per the VOSpace 2.1 specification:&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
! Method !! Path !! Description&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/protocols&amp;lt;/code&amp;gt; || Retrieves supported transfer protocols&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/views&amp;lt;/code&amp;gt; || Retrieves available data views&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/properties&amp;lt;/code&amp;gt; || Retrieves node properties&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/capabilities&amp;lt;/code&amp;gt; || Retrieves server capabilities&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{job_id}&amp;lt;/code&amp;gt; || Retrieves job information&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{job_id}/phase&amp;lt;/code&amp;gt; || Retrieves job phase&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{job_id}/error&amp;lt;/code&amp;gt; || Retrieves job error info&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{job_id}/results/transferDetails&amp;lt;/code&amp;gt; || Retrieves transfer details&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Retrieves a file/folder node&lt;br /&gt;
|-&lt;br /&gt;
| GET || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Downloads a file (streaming)&lt;br /&gt;
|-&lt;br /&gt;
| PUT || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Creates a file or folder&lt;br /&gt;
|-&lt;br /&gt;
| PUT || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Uploads a file&lt;br /&gt;
|-&lt;br /&gt;
| POST || &amp;lt;code&amp;gt;/vospace/&amp;lt;/code&amp;gt; || Moves or copies a node (returns 303 redirect)&lt;br /&gt;
|-&lt;br /&gt;
| POST || &amp;lt;code&amp;gt;/vospace/synctrans&amp;lt;/code&amp;gt; || Push/pull transfer&lt;br /&gt;
|-&lt;br /&gt;
| POST || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Set properties (''not supported'')&lt;br /&gt;
|-&lt;br /&gt;
| POST || &amp;lt;code&amp;gt;/vospace/{job_id}/phase&amp;lt;/code&amp;gt; || Change job phase&lt;br /&gt;
|-&lt;br /&gt;
| DELETE || &amp;lt;code&amp;gt;/vospace/{path}&amp;lt;/code&amp;gt; || Deletes a file or folder&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
* &amp;lt;code&amp;gt;{path}&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;{job_id}&amp;lt;/code&amp;gt; are dynamic parameters.&lt;br /&gt;
* The server uses standard HTTP error codes: 400, 403, 404, 409, 500.&lt;br /&gt;
* '''PUT''' supports uploads and node creation.&lt;br /&gt;
* '''POST /vospace/''' moves or copies nodes (responds with '''303 See Other''').&lt;br /&gt;
* '''setProperties''' is disabled due to HDFS permissions.&lt;br /&gt;
* File downloads return streamed content with '''GET'''.&lt;br /&gt;
* Avoid using '''!''' in paths for full compatibility with the CADC client.&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Main_Page&amp;diff=1237</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Main_Page&amp;diff=1237"/>
		<updated>2025-07-23T10:50:15Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: /* Storage */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting started ==&lt;br /&gt;
* [[PIC description|PIC in an image]]&lt;br /&gt;
* [[PIC account|Get a PIC account]]&lt;br /&gt;
* [[PIC_User_Manual | User manual]]&lt;br /&gt;
* [[faq| Frequently asked questions]]&lt;br /&gt;
&lt;br /&gt;
== Services ==&lt;br /&gt;
=== Distributed computing ===&lt;br /&gt;
* [[HTCondor]]&lt;br /&gt;
* [[Dask]]&lt;br /&gt;
* Spark:&lt;br /&gt;
** [[Spark on Hadoop|on Hadoop]]&lt;br /&gt;
** [[Spark_on_farm|on HTCondor]]&lt;br /&gt;
&lt;br /&gt;
=== Storage ===&lt;br /&gt;
* [[Storage]]&lt;br /&gt;
* [[Hadoop Distributed File System (HDFS)]]&lt;br /&gt;
* [[HDFS Access via VOSpace]]&lt;br /&gt;
* [[Transferring data to/from PIC]]&lt;br /&gt;
&lt;br /&gt;
=== User interfaces ===&lt;br /&gt;
* [[Login machines]]&lt;br /&gt;
* [[JupyterHub]]&lt;br /&gt;
&lt;br /&gt;
=== Other services ===&lt;br /&gt;
* [[Gitlab]]&lt;br /&gt;
* [[CosmoHub]]&lt;br /&gt;
&lt;br /&gt;
== Experiments ==&lt;br /&gt;
&lt;br /&gt;
* [[Euclid]]&lt;br /&gt;
* [[AGN ICE]]&lt;br /&gt;
* [[ICFO]]&lt;br /&gt;
&lt;br /&gt;
== More technical information ==&lt;br /&gt;
&lt;br /&gt;
* [[Storage Department]]&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1225</id>
		<title>Hadoop Distributed File System (HDFS)</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1225"/>
		<updated>2025-05-19T08:41:00Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: /* How to connect to the service */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
PIC offers a WebDAV service over WebHDFS for seamless file transfer and management. This service is designed for users who need to upload, download, and manage large files on HDFS through a familiar file management interface. The WebDAV protocol allows users to access the HDFS in a way that mimics the experience of working with local file systems, offering the flexibility and ease of use of tools like Finder, File Explorer, and rclone, among others.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
WebDAV-compatible clients allow for the management of large datasets and offer multithreaded operations for enhanced performance. This makes the WebDAV service ideal for handling large-scale data uploads and downloads, with support for efficient file transfers.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The service also provides a simple and effective way to interface with HDFS, especially for users who prefer a file-system-like experience for managing their data, rather than relying on more technical methods like command-line tools.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= How to connect to the service =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To connect to the WebDAV service, follow the steps below:&lt;br /&gt;
&lt;br /&gt;
# '''Use a WebDAV-compatible client:'''&lt;br /&gt;
#*Finder (macOS)&lt;br /&gt;
#*File Explorer (Windows)&lt;br /&gt;
#*Linux File System (Linux)&lt;br /&gt;
#*rclone (CLI)&lt;br /&gt;
#*Cyberduck&lt;br /&gt;
#*CrossFTP&lt;br /&gt;
#*curl&lt;br /&gt;
# '''Mount the WebDAV endpoint:'''&lt;br /&gt;
#: The WebDAV server is accessible via the following URL: https://webdav-hdfs.pic.es/ [https://webdav.pic.es/]&lt;br /&gt;
# '''Authenticate with your PIC credentials:'''&lt;br /&gt;
#: When prompted, enter your PIC user credentials to authenticate your session. This ensures that only authorized users have access to the HDFS.&lt;br /&gt;
# '''Browse, upload, and download files:'''&lt;br /&gt;
#: Once connected, you will be able to manage your files in the same way you would with any local file system. You can drag and drop files, create directories, and manage large datasets directly on the HDFS.&lt;br /&gt;
**''Tip'': Using a multithreaded client like rclone or Cyberduck will help improve upload and download speeds for large files.&lt;br /&gt;
&lt;br /&gt;
= Usage Guidelines=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* File Uploads:&lt;br /&gt;
&lt;br /&gt;
** The WebDAV service allows you to upload files of any size to HDFS.&lt;br /&gt;
** For large datasets, it is recommended to use a multithreaded client, as this will optimize the upload process and make it faster and more efficient.&lt;br /&gt;
&lt;br /&gt;
* File Downloads:&lt;br /&gt;
&lt;br /&gt;
** You can also download files from the HDFS to your local machine.&lt;br /&gt;
** Multithreading is supported, allowing you to download large files quickly.&lt;br /&gt;
&lt;br /&gt;
* File Management:&lt;br /&gt;
&lt;br /&gt;
** Files and directories can be created, deleted, or renamed directly within your WebDAV-compatible client.&lt;br /&gt;
&lt;br /&gt;
** You can also move files around within your HDFS storage, making file management easier.&lt;br /&gt;
&lt;br /&gt;
=Best Practices=&lt;br /&gt;
'''Optimize large file transfers:'''&lt;br /&gt;
&lt;br /&gt;
: To improve performance, use tools like rclone or Cyberduck for multithreaded file transfers. This helps manage the upload and download of large files or large quantities of files more efficiently.&lt;br /&gt;
&lt;br /&gt;
=Troubleshooting=&lt;br /&gt;
&lt;br /&gt;
'''Cannot Connect to the WebDAV Service:'''&lt;br /&gt;
:If you're unable to connect to the WebDAV server, ensure that you're using the correct URL and that your PIC credentials are entered properly. Also, verify that your WebDAV-compatible client is configured correctly.&lt;br /&gt;
&lt;br /&gt;
'''Slow Uploads or Downloads:'''&lt;br /&gt;
:If you experience slow file transfer speeds, check if your client supports multithreaded transfers and consider using a tool like rclone or Cyberduck, which support multiple threads for faster transfers.&lt;br /&gt;
&lt;br /&gt;
'''File Uploads Fail'''&lt;br /&gt;
:If file uploads fail, try splitting large files into smaller chunks or use an alternative tool that supports better error handling and retry capabilities, such as rclone.&lt;br /&gt;
&lt;br /&gt;
=Security and Data Management=&lt;br /&gt;
&lt;br /&gt;
'''PIC Credentials:'''&lt;br /&gt;
: Always ensure that your PIC credentials are secure. Do not share your credentials with others or store them in insecure locations.&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1224</id>
		<title>Hadoop Distributed File System (HDFS)</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1224"/>
		<updated>2025-05-14T13:14:30Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: /* How to connect to the service */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
PIC offers a WebDAV service over WebHDFS for seamless file transfer and management. This service is designed for users who need to upload, download, and manage large files on HDFS through a familiar file management interface. The WebDAV protocol allows users to access the HDFS in a way that mimics the experience of working with local file systems, offering the flexibility and ease of use of tools like Finder, File Explorer, and rclone, among others.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
WebDAV-compatible clients allow for the management of large datasets and offer multithreaded operations for enhanced performance. This makes the WebDAV service ideal for handling large-scale data uploads and downloads, with support for efficient file transfers.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The service also provides a simple and effective way to interface with HDFS, especially for users who prefer a file-system-like experience for managing their data, rather than relying on more technical methods like command-line tools.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= How to connect to the service =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To connect to the WebDAV service, follow the steps below:&lt;br /&gt;
&lt;br /&gt;
# '''Use a WebDAV-compatible client:'''&lt;br /&gt;
#*Finder (macOS)&lt;br /&gt;
#*File Explorer (Windows)&lt;br /&gt;
#*Linux File System (Linux)&lt;br /&gt;
#*rclone (CLI)&lt;br /&gt;
#*Cyberduck&lt;br /&gt;
#*CrossFTP&lt;br /&gt;
#*curl&lt;br /&gt;
# '''Mount the WebDAV endpoint:'''&lt;br /&gt;
#: The WebDAV server is accessible via the following URL: https://webdav.pic.es/ [https://webdav.pic.es/]&lt;br /&gt;
# '''Authenticate with your PIC credentials:'''&lt;br /&gt;
#: When prompted, enter your PIC user credentials to authenticate your session. This ensures that only authorized users have access to the HDFS.&lt;br /&gt;
# '''Browse, upload, and download files:'''&lt;br /&gt;
#: Once connected, you will be able to manage your files in the same way you would with any local file system. You can drag and drop files, create directories, and manage large datasets directly on the HDFS.&lt;br /&gt;
**''Tip'': Using a multithreaded client like rclone or Cyberduck will help improve upload and download speeds for large files.&lt;br /&gt;
&lt;br /&gt;
= Usage Guidelines=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* File Uploads:&lt;br /&gt;
&lt;br /&gt;
** The WebDAV service allows you to upload files of any size to HDFS.&lt;br /&gt;
** For large datasets, it is recommended to use a multithreaded client, as this will optimize the upload process and make it faster and more efficient.&lt;br /&gt;
&lt;br /&gt;
* File Downloads:&lt;br /&gt;
&lt;br /&gt;
** You can also download files from the HDFS to your local machine.&lt;br /&gt;
** Multithreading is supported, allowing you to download large files quickly.&lt;br /&gt;
&lt;br /&gt;
* File Management:&lt;br /&gt;
&lt;br /&gt;
** Files and directories can be created, deleted, or renamed directly within your WebDAV-compatible client.&lt;br /&gt;
&lt;br /&gt;
** You can also move files around within your HDFS storage, making file management easier.&lt;br /&gt;
&lt;br /&gt;
=Best Practices=&lt;br /&gt;
'''Optimize large file transfers:'''&lt;br /&gt;
&lt;br /&gt;
: To improve performance, use tools like rclone or Cyberduck for multithreaded file transfers. This helps manage the upload and download of large files or large quantities of files more efficiently.&lt;br /&gt;
&lt;br /&gt;
=Troubleshooting=&lt;br /&gt;
&lt;br /&gt;
'''Cannot Connect to the WebDAV Service:'''&lt;br /&gt;
:If you're unable to connect to the WebDAV server, ensure that you're using the correct URL and that your PIC credentials are entered properly. Also, verify that your WebDAV-compatible client is configured correctly.&lt;br /&gt;
&lt;br /&gt;
'''Slow Uploads or Downloads:'''&lt;br /&gt;
:If you experience slow file transfer speeds, check if your client supports multithreaded transfers and consider using a tool like rclone or Cyberduck, which support multiple threads for faster transfers.&lt;br /&gt;
&lt;br /&gt;
'''File Uploads Fail'''&lt;br /&gt;
:If file uploads fail, try splitting large files into smaller chunks or use an alternative tool that supports better error handling and retry capabilities, such as rclone.&lt;br /&gt;
&lt;br /&gt;
=Security and Data Management=&lt;br /&gt;
&lt;br /&gt;
'''PIC Credentials:'''&lt;br /&gt;
: Always ensure that your PIC credentials are secure. Do not share your credentials with others or store them in insecure locations.&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1223</id>
		<title>Hadoop Distributed File System (HDFS)</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1223"/>
		<updated>2025-05-14T13:13:32Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: /* Introduction */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
PIC offers a WebDAV service over WebHDFS for seamless file transfer and management. This service is designed for users who need to upload, download, and manage large files on HDFS through a familiar file management interface. The WebDAV protocol allows users to access the HDFS in a way that mimics the experience of working with local file systems, offering the flexibility and ease of use of tools like Finder, File Explorer, and rclone, among others.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
WebDAV-compatible clients allow for the management of large datasets and offer multithreaded operations for enhanced performance. This makes the WebDAV service ideal for handling large-scale data uploads and downloads, with support for efficient file transfers.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The service also provides a simple and effective way to interface with HDFS, especially for users who prefer a file-system-like experience for managing their data, rather than relying on more technical methods like command-line tools.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= How to connect to the service =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To connect to the WebDAV service, follow the steps below:&lt;br /&gt;
&lt;br /&gt;
# '''Use a WebDAV-compatible client:'''&lt;br /&gt;
#*Finder (macOS)&lt;br /&gt;
#*File Explorer (Windows)&lt;br /&gt;
#*Linux File System (Linux)&lt;br /&gt;
#*rclone (CLI)&lt;br /&gt;
#*Cyberduck (macOS, Windows)&lt;br /&gt;
#*CrossFTP (macOS, Windows, Linux)&lt;br /&gt;
# '''Mount the WebDAV endpoint:'''&lt;br /&gt;
#: The WebDAV server is accessible via the following URL: https://webdav.pic.es/ [https://webdav.pic.es/]&lt;br /&gt;
# '''Authenticate with your PIC credentials:'''&lt;br /&gt;
#: When prompted, enter your PIC user credentials to authenticate your session. This ensures that only authorized users have access to the HDFS.&lt;br /&gt;
# '''Browse, upload, and download files:'''&lt;br /&gt;
#: Once connected, you will be able to manage your files in the same way you would with any local file system. You can drag and drop files, create directories, and manage large datasets directly on the HDFS.&lt;br /&gt;
**''Tip'': Using a multithreaded client like rclone or Cyberduck will help improve upload and download speeds for large files.&lt;br /&gt;
&lt;br /&gt;
= Usage Guidelines=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* File Uploads:&lt;br /&gt;
&lt;br /&gt;
** The WebDAV service allows you to upload files of any size to HDFS.&lt;br /&gt;
** For large datasets, it is recommended to use a multithreaded client, as this will optimize the upload process and make it faster and more efficient.&lt;br /&gt;
&lt;br /&gt;
* File Downloads:&lt;br /&gt;
&lt;br /&gt;
** You can also download files from the HDFS to your local machine.&lt;br /&gt;
** Multithreading is supported, allowing you to download large files quickly.&lt;br /&gt;
&lt;br /&gt;
* File Management:&lt;br /&gt;
&lt;br /&gt;
** Files and directories can be created, deleted, or renamed directly within your WebDAV-compatible client.&lt;br /&gt;
&lt;br /&gt;
** You can also move files around within your HDFS storage, making file management easier.&lt;br /&gt;
&lt;br /&gt;
=Best Practices=&lt;br /&gt;
'''Optimize large file transfers:'''&lt;br /&gt;
&lt;br /&gt;
: To improve performance, use tools like rclone or Cyberduck for multithreaded file transfers. This helps manage the upload and download of large files or large quantities of files more efficiently.&lt;br /&gt;
&lt;br /&gt;
=Troubleshooting=&lt;br /&gt;
&lt;br /&gt;
'''Cannot Connect to the WebDAV Service:'''&lt;br /&gt;
:If you're unable to connect to the WebDAV server, ensure that you're using the correct URL and that your PIC credentials are entered properly. Also, verify that your WebDAV-compatible client is configured correctly.&lt;br /&gt;
&lt;br /&gt;
'''Slow Uploads or Downloads:'''&lt;br /&gt;
:If you experience slow file transfer speeds, check if your client supports multithreaded transfers and consider using a tool like rclone or Cyberduck, which support multiple threads for faster transfers.&lt;br /&gt;
&lt;br /&gt;
'''File Uploads Fail'''&lt;br /&gt;
:If file uploads fail, try splitting large files into smaller chunks or use an alternative tool that supports better error handling and retry capabilities, such as rclone.&lt;br /&gt;
&lt;br /&gt;
=Security and Data Management=&lt;br /&gt;
&lt;br /&gt;
'''PIC Credentials:'''&lt;br /&gt;
: Always ensure that your PIC credentials are secure. Do not share your credentials with others or store them in insecure locations.&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1222</id>
		<title>Hadoop Distributed File System (HDFS)</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Hadoop_Distributed_File_System_(HDFS)&amp;diff=1222"/>
		<updated>2025-05-14T12:47:13Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: The WebDAV service over WebHDFS offered by PIC provides a user-friendly and efficient way to interact with your data on HDFS. Whether you're uploading large files, managing your datasets, or downloading files, this service simplifies the process by integrating with common file management tools.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Introduction =&lt;br /&gt;
&lt;br /&gt;
PIC offers a WebDAV service over WebHDFS for seamless file transfer and management. This service is designed for users who need to upload, download, and manage large files on HDFS through a familiar file management interface. The WebDAV protocol allows users to access the HDFS in a way that mimics the experience of working with local file systems, offering the flexibility and ease of use of tools like Finder, File Explorer, and rclone, among others.&lt;br /&gt;
&lt;br /&gt;
WebDAV-compatible clients allow for the management of large datasets and offer multithreaded operations for enhanced performance. This makes the WebDAV service ideal for handling large-scale data uploads and downloads, with support for efficient file transfers.&lt;br /&gt;
&lt;br /&gt;
The service also provides a simple and effective way to interface with HDFS, especially for users who prefer a file-system-like experience for managing their data, rather than relying on more technical methods like command-line tools.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== How to connect to the service ==&lt;br /&gt;
To connect to the WebDAV service, follow the steps below:&lt;br /&gt;
&lt;br /&gt;
1. Use a WebDAV-compatible client:&lt;br /&gt;
Finder (macOS)&lt;br /&gt;
File Explorer (Windows)&lt;br /&gt;
Linux File System (Linux)&lt;br /&gt;
rclone (CLI)&lt;br /&gt;
Cyberduck (macOS, Windows)&lt;br /&gt;
CrossFTP (macOS, Windows, Linux)&lt;br /&gt;
&lt;br /&gt;
2. Mount the WebDAV endpoint:&lt;br /&gt;
The WebDAV server is accessible via the following URL: [https://webdav.pic.es/hdfs]&lt;br /&gt;
&lt;br /&gt;
3. Authenticate with your PIC credentials:&lt;br /&gt;
When prompted, enter your PIC user credentials to authenticate your session. This ensures that only authorized users have access to the HDFS.&lt;br /&gt;
&lt;br /&gt;
4. Browse, upload, and download files:&lt;br /&gt;
Once connected, you will be able to manage your files in the same way you would with any local file system. You can drag and drop files, create directories, and manage large datasets directly on the HDFS.&lt;br /&gt;
Tip: Using a multithreaded client like rclone or Cyberduck will help improve upload and download speeds for large files.&lt;br /&gt;
&lt;br /&gt;
==Usage Guidelines==&lt;br /&gt;
- File Uploads:&lt;br /&gt;
· The WebDAV service allows you to upload files of any size to HDFS.&lt;br /&gt;
· For large datasets, it is recommended to use a multithreaded client, as this will optimize the upload process and make it faster and more efficient.&lt;br /&gt;
- File Downloads:&lt;br /&gt;
· You can also download files from the HDFS to your local machine.&lt;br /&gt;
· Multithreading is supported, allowing you to download large files quickly.&lt;br /&gt;
- File Management:&lt;br /&gt;
· Files and directories can be created, deleted, or renamed directly within your WebDAV-compatible client.&lt;br /&gt;
· You can also move files around within your HDFS storage, making file management easier.&lt;br /&gt;
&lt;br /&gt;
==Best Practices==&lt;br /&gt;
1. Optimize large file transfers:&lt;br /&gt;
To improve performance, use tools like rclone or Cyberduck for multithreaded file transfers. This helps manage the upload and download of large files or large quantities of files more efficiently.&lt;br /&gt;
&lt;br /&gt;
==Troubleshooting==&lt;br /&gt;
&lt;br /&gt;
1. Cannot Connect to the WebDAV Service:&lt;br /&gt;
If you're unable to connect to the WebDAV server, ensure that you're using the correct URL and that your PIC credentials are entered properly. Also, verify that your WebDAV-compatible client is configured correctly.&lt;br /&gt;
2. Slow Uploads or Downloads:&lt;br /&gt;
If you experience slow file transfer speeds, check if your client supports multithreaded transfers and consider using a tool like rclone or Cyberduck, which support multiple threads for faster transfers.&lt;br /&gt;
3. File Uploads Fail:&lt;br /&gt;
If file uploads fail, try splitting large files into smaller chunks or use an alternative tool that supports better error handling and retry capabilities, such as rclone.&lt;br /&gt;
&lt;br /&gt;
==Security and Data Management==&lt;br /&gt;
&lt;br /&gt;
- PIC Credentials:&lt;br /&gt;
Always ensure that your PIC credentials are secure. Do not share your credentials with others or store them in insecure locations.&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
	<entry>
		<id>https://pwiki.pic.es/index.php?title=Main_Page&amp;diff=1221</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://pwiki.pic.es/index.php?title=Main_Page&amp;diff=1221"/>
		<updated>2025-05-14T12:27:44Z</updated>

		<summary type="html">&lt;p&gt;Sbogaart: /* Services */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting started ==&lt;br /&gt;
* [[PIC description|PIC in an image]]&lt;br /&gt;
* [[PIC account|Get a PIC account]]&lt;br /&gt;
* [[PIC_User_Manual | User manual]]&lt;br /&gt;
* [[faq| Frequently asked questions]]&lt;br /&gt;
&lt;br /&gt;
== Services ==&lt;br /&gt;
* [[HTCondor]]&lt;br /&gt;
* [[Storage]]&lt;br /&gt;
* [[JupyterHub]]&lt;br /&gt;
* [[Gitlab]]&lt;br /&gt;
* [[CosmoHub]]&lt;br /&gt;
* [[Hadoop Distributed File System (HDFS)]]&lt;br /&gt;
* Spark:&lt;br /&gt;
** [[Spark on Hadoop|on Hadoop]]&lt;br /&gt;
** [[Spark_on_farm|on HTCondor]]&lt;br /&gt;
* [[Transferring data to/from PIC]]&lt;br /&gt;
&lt;br /&gt;
== Experiments ==&lt;br /&gt;
&lt;br /&gt;
* [[Euclid]]&lt;br /&gt;
* [[AGN ICE]]&lt;br /&gt;
* [[ICFO]]&lt;br /&gt;
&lt;br /&gt;
== More technical information ==&lt;br /&gt;
&lt;br /&gt;
* [[Storage Department]]&lt;/div&gt;</summary>
		<author><name>Sbogaart</name></author>
	</entry>
</feed>